OmniPage is a PDF tool that advertises itself as "the most accurate OCR solution." It can process scanned documents and save PDFs in multiple formats, including Excel spreadsheets.
From the product's website:
OmniPage can convert almost all image formats into editable files for your existing PC applications, including Microsoft Word and Excel. Better still, OmniPage can convert documents from one format to another (e.g. TIF to PDF) and make static image documents searchable on your PC and within content-management applications.
In terms of its optical character recognition and PDF-to-spreadsheet conversion capabilities, the standard and professional versions do not have significant differences.
It also boasts recognition dictionaries for a variety of industry-specific jargon from law to medicine and features an "IntelliTrain proofing system" that allows it to "learn" the user's editing style to improve results.
April 23, 2009//
If you're a reporter who regularly deals with tricky documents that need text recognition or conversion into sortable spreadsheets, OmniPage is worth the investment of money and time to conquer its quirks. But despite great performance on many of our toughest tests, it's not as ideal for time-strapped journalists looking for a quick solution.READ OUR FULL REVIEW »
OmniPage did a poor job converting this list of Bernie Madoff's customers with the software's default setting. But because the document was so uniform, quickly defining and applying a template resulted in a spreadsheet that looked identical to the original PDF.READ OUR FULL TEST RESULT »
OmniPage had no trouble with these scanned memos from the Obama-Biden transition team, flawlessly recognizing the text despite a variety of different scan qualities.READ OUR FULL TEST RESULT »
It took an hour or so, but OmniPage is the only software we've tested to date that converted this 1,000-page PDF of border fence contributors into a spreadsheet -- reproducing the original almost perfectly. Its OCR feature was even able to handle the weird font that's consistently foiled other programs.READ OUR FULL TEST RESULT »
OmniPage's guesswork on the formatting of this PDF database report will only go so far toward converting the document into a sortable table. But a little manual help from the user before the conversion saves hours of data wrangling later and results in a perfect table.READ OUR FULL TEST RESULT »
OmniPage Pro whipped through these transcripts from combatant tribunals fast, recognizing the text in all but a few areas with lower quality scans.READ OUR FULL TEST RESULT »
Processing these scanned disclosure forms from North Carolina legislators is time-consuming with OmniPage, and although it recognized most of the text accurately, a software hangup prompted by this 1,700-page file made the results more difficult to wrangle.READ OUR FULL TEST RESULT »
Although it does give users some ability to customize the results of the spreadsheets it outputs, OmniPage's options aren't much help when it comes to making sense of this unlined PDF of Clinton administration political appointments. Results would take a long time to clean up without programming knowledge.READ OUR FULL TEST RESULT »
OmniPage completely ignores the relevant text in this low-resolution PDF index of Congress reports containing partial text. Even with ample options for recognizing text, the software only manages to capture page numbers and annotations -- worthless in this context.READ OUR FULL TEST RESULT »