esegura
October 13th, 2008, 01:29 PM
Hi there!
I'm in need of advice. I'm new to Monarch and I need to process a 1000-pages pdf file that looks like this:
Screenshot (http://farm4.static.flickr.com/3166/2938859794_aaf8eaf9dd_b.jpg)
As you can imagine, this layout represents a challenge. To start with, those vertical lines seem like they won't make it to the text version in any form. Second, as you can see in the picture, even the headers are not aligned properly. Third, I'm seeing some variance in the number of spaces that Monarch is inserting when I create the text version. Like in the following example:
(on page 3)
[5 white spaces here]Doe, John
...
(and then on page 629)
[6 white spaces here]Doe, John
So, I can't even rely on the number of spaces. Can you folks here help me develop a strategy to address this problem?
Thanks in advance!
Ed.
I'm in need of advice. I'm new to Monarch and I need to process a 1000-pages pdf file that looks like this:
Screenshot (http://farm4.static.flickr.com/3166/2938859794_aaf8eaf9dd_b.jpg)
As you can imagine, this layout represents a challenge. To start with, those vertical lines seem like they won't make it to the text version in any form. Second, as you can see in the picture, even the headers are not aligned properly. Third, I'm seeing some variance in the number of spaces that Monarch is inserting when I create the text version. Like in the following example:
(on page 3)
[5 white spaces here]Doe, John
...
(and then on page 629)
[6 white spaces here]Doe, John
So, I can't even rely on the number of spaces. Can you folks here help me develop a strategy to address this problem?
Thanks in advance!
Ed.