简体   繁体   中英

reading .doc, .docx, .pdf, .rtf documents in .net without Word

so far it's only aspose words but which is very pricey

other are to convert to .pdf or to print to .pdf

I am looking for a way to read the contents of these doc types without installing office or pdf app ie get the text of these documents for parsing

You want to use components that plug into the IFilter framework, which is what windows uses to index documents for its text search.

For office documents you can use Office 2010 Filter Pack For pdf, you can use a commercial offering such as FoxIt IFilter , which seems fairly priced.

DevExpress现在提供了一个文档服务器组件,其价格远低于Aspose。

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM