Extract Text from Images, PDF Ebooks

by Hami on April 10, 2009

ADVERTISEMENTS

pdf-to-textStudents, Office workers or similar jobs require handling of printed text and most of times feeding the written data into pc for some assignments, or sometimes to enhance the presentation of the text. But last week I encountered a problem in my research when I needed some text from a Encyclopedia available on Google Books, but didn’t wanted to type all those pages instead selected this way to get my work done for my assignments.

I used OCR (Optical Character Recognition) which let you scan the text from the images forms, as available in pdf files or like in Google Books. Output would be in a fully text form, which can be then selected and can be inserted into assignments easily.

For this you will need JOCR. This app is very lite and easy to download, and let you convert the images into text. Images can also be captured with this app, with this function you will be able to capture images of protected text pages in the form of images where text can not be selected, or take a snap of error messages.

This app uses “Micorosoft Office Document Imaging” (MODI) which comes along with the Microsoft Office 2003 and higher versions (Under Office Tools of setup file), or you can manually download MODI for this purpose.

JOCR is capable of extracting text in various languages from Chinese, Dutch, Japanese, Italian etc

Screenshot

images-to-text-converter

Download JOCR




Subscribe Now

If you enjoyed this post, you will definitely enjoy our others. Subscribe to the feed to get instantly updated for those awesome posts soon to come.

Leave a Comment

CommentLuv Enabled

Previous post:

Next post: