I would like to take an image, and index the words in that image, let the user search the image and then display the image highlighting around keywords.
So, what I need is an OCR class that takes an image and that gives me a list of words with their coordinates? Possible?
You can use these projects
Tessnet2 (a .NET 2.0 Open Source OCR assembly using Tesseract engine)
Tesseract-ocr (An OCR Engine that was developed at HP Labs between 1985 and 1995... and now at Google)
You can do it with Microsoft Office Document Imaging 12.0 Type Library.
Sample: Converting Images to Text using Office 2007 OCR, OpenXML and Speech Recognition
The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.