Document: Madoff customers

Convert PDF with lined tables into spreadsheet

Difficulty:

Good formatting, lined tables make things simple

A fairly straightforward task for PDF tools, this list of customers of disgraced financier Bernie Madoff must be converted into a sortable spreadsheet. As the data is already presented in spreadsheet format, most tools should be able to handle it.

DESIRED OUTCOME: Sort and filter the resulting spreadsheet to find all of 29 entries of the Shapiro family's trusts run out of Palm Beach, Fla.

 

Test Results

Verdict:

Processing with template results in flawless spreadsheet

User-defined templates help OmniPage perfect results

OmniPage did a poor job converting this list of Bernie Madoff's customers with the software's default setting. But because the document was so uniform, quickly defining and applying a template resulted in a spreadsheet that looked identical to the original PDF.

READ OUR FULL TEST RESULT »

Verdict:

Table converts easily, perfectly to sortable spreadsheet; features make inconsistencies easy fixes

Simple table poses no conversion problem for Monarch

The consistent formatting of this listing of disgraced financier Bernie Madoff's customers made converting the document into a sortable spreadsheet an easy task for Monarch.

READ OUR FULL TEST RESULT »

Verdict:

Turns PDF into exact replica in Excel

Lined table an easy conversion for Able2Extract

Conversion of this simple table of Bernie Madoff's customers is quick, easy and accurate with Able2Extract. Column alignment is perfect and the text conversion is perfect as well.

READ OUR FULL TEST RESULT »

Verdict:

Handles basic table perfectly

PDF2XL quickly turns lined tables into spreadsheet

PDF2XL OCR had no trouble with this document, a simple lined table of Bernie Madoff's customers and their addresses. Using the program's auto-detect feature, the 163-page document took only seconds to convert.

READ OUR FULL TEST RESULT »

Verdict:

Great accuracy, perfect formatting, almost no cleanup

Acrobat produces almost perfect spreadsheet from simple PDF

It takes less than a minute for Adobe Acrobat to transform this lined PDF table of Bernie Madoff's customers into an almost perfect spreadsheet reproduction. The resulting Excel file is clean and complete, requiring a little effort before journalists can dive into the data.

READ OUR FULL TEST RESULT »

Verdict:

Converts PDF to near-perfect text file in seconds; some post-conversion cleanup required

Pdftotext quickly, fairly accurately converts lined spreadsheet into text file

Pdftotext easily converts this list of former Madoff customers into a text file optimized for fixed width delimiting in Excel. The program's -layout command, which creates an output file identical in format to the original, is key to this result.

READ OUR FULL TEST RESULT »

Verdict:

Simple housekeeping required

Despite junk data, Cometdocs spits out usable spreadsheet

Cometdocs did a good job converting this simple, lined table into a sortable spreadsheet requiring minimal cleanup. Nothing standout here, but certainly fast and easy if you're looking for something quick.

READ OUR FULL TEST RESULT »

Verdict:

Clean data; multiple sheets make it difficult to handle

Zamzar yields clean data, but too many spreadsheets

Even though Zamzar did a decent job converting the data in this 163-page file of Bernie Madoff's customers into spreadsheet format, the entries were spread across 163 separate sheets.

READ OUR FULL TEST RESULT »

Verdict:

Perfectly captured data split over dozens of sheets, requiring automated cleanup

Nitro spreads data from simple table across multiple sheets

Although it captures the data and formats it perfectly, Nitro splits this table of Bernie Madoff's customers over more than 100 pages. That kind of cleanup means a lot of work or a little programming knowledge.

READ OUR FULL TEST RESULT »

Verdict:

Results badly formatted, split across multiple sheets

DeskUNPDF fails with simple spreadsheet conversion

Tacking a 163-page, lined document results in sluggish conversion time and multiple cluttered spreadsheets that prove impossible to sort and filter.

READ OUR FULL TEST RESULT »

Testing

Testing