OmniPage completely ignores the relevant text in this PDF index of Congress reports containing partial text. Even with ample options for recognizing text, the software only manages to capture page numbers and annotations -- worthless in this context.
Results were the same no matter which OCR options we selected. Some failed to recognize the text areas as text at all, choosing instead to interpret them as images. Even when we defined a custom template to tell OmniPage where to look for text, it returned only a blank page.
The low resolution of the text on the page, the small type and the mixed format of this document all contributed to this failure to parse.