Running text-conversion on PDF files

Adobe Acrobat: Progress bar on OCR pdf batch

Evernote runs text-recognition on all the images uploaded to their service. This great feature allows you to search through photos taken of notes and printouts. How does Evernote do it? It sounds like a fairy land.

I’m using Adobe Acrobat Pro to OCR a batch of 32 PDF files that I scanned on the printer at 300 dpi. It takes a good amount of time for it to process each PDF. 388 pages in 32 PDF files took 37 minutes running on a 2011 Macbook Pro (Intel Core i7 Processor 2GHz with 4GB SDRAM RAM).

That’s 10.4 pages per minute. (6.5 seconds per page)

I don’t know how Evernote manages to do text recognition in all those photos uploaded to the service. Kudos to Evernote for providing this service and all the processor power it requires.

Do you use the text recognition feature in Evernote? If so, let me know your thoughts about the service in the comments. Thanks!

Enjoyed this blog post?

Receive notifications of new posts via email

Subscribe
Notify of
guest

This site uses Akismet to reduce spam. Learn how your comment data is processed.

1 Comment
Oldest
Newest Most Voted
Inline Feedbacks
View all comments
Matt Maldre
6 years ago

Also, the 32 pre-OCR PDF files totaled 26.9 MB. The OCR PDF files grew up to 141.6 MB. That’s 5.2x times as large!

1
0
Would love your thoughts, please comment.x
()
x