View Full Version : Anyone know how to convert a pdf table into a table in Word or HTML?


BasilC
08-03-2003, 07:52 PM
I'm trying to convert Internet train timetables into a form which will work with iSilo. A lot of UK online timetables are in pdf format, which is a real pain.

I try selecting the text and copying it into Word or Excel, but no joy, it gets completely garbled.

Anyone know of an alternative. I thought about saving it as an image and then running a sophisticated OCR against it, like Textbridge Pro, but I suspect that's an expensive program and I'm not sure it would work anyway

Alexander Turcic
08-04-2003, 02:45 AM
I recommend using Gemini (see post http://www.turcic.com/forums/showthread.php?postid=1772#post1772) which is the only one that can specifically convert pdf tables to html. It is not cheap though :(

BasilC
08-04-2003, 06:30 PM
Thanks, Alex, I might give it a go.

Last night I downloaded a trial version of ABBYY FineReader 6.0 Professional (http://www.abbyy.com/) , which is an optical character recognition program which can convert pdf files.

I tried it on a sample timetable, but it didn't make a very good job of it. The problem seems to be that it's actually working on the basis of an image of the pdf page, and this particular image is very misleading as to where the column dividers are. However, I haven't read the extensive instructions and tutorials, so there might be a way to improve the performance.

palmtungsten
10-04-2003, 01:16 AM
Not sure if this would work, but if you have documents to go premium edition (from dataviz, http://www.dataviz.com/products/documentstogo/index.html) it has pdf support, just drag and drop the pdf file to DTG window and the conversion is automatic, and you'll be able to read the pdf file on your palm.

Good luck!

Novelwriter
07-16-2009, 09:45 PM
I'm trying to convert Internet train timetables into a form which will work with iSilo. A lot of UK online timetables are in pdf format, which is a real pain.

I try selecting the text and copying it into Word or Excel, but no joy, it gets completely garbled.

Anyone know of an alternative. I thought about saving it as an image and then running a sophisticated OCR against it, like Textbridge Pro, but I suspect that's an expensive program and I'm not sure it would work anyway

Here is a quick and easy (FREE) online way to convert files:

http://www.convertpdftoword.net/




http://img217.imageshack.us/img217/1820/284683972870e30m3.gif (http://www.amazon.com/PRAY-DEATH-Bestseller-Cash-Pawley/dp/B002HREKZA)

carlobee
07-20-2009, 08:47 PM
Here is a quick and easy (FREE) online way to convert files:

http://www.convertpdftoword.net/




http://img217.imageshack.us/img217/1820/284683972870e30m3.gif (http://www.amazon.com/PRAY-DEATH-Bestseller-Cash-Pawley/dp/B002HREKZA)

i've been using this too. i'll recommend this one.

EricDP
06-25-2010, 12:33 AM
What happens if you use Calibre?

Poppa1956
06-25-2010, 01:02 AM
What happens if you use Calibre?

It ain't pretty. Unless there's some trick to it I could learn by reading the manual.