View Single Post
Old 01-06-2009, 08:11 AM   #49
AprilHare
Wizard
AprilHare ought to be getting tired of karma fortunes by now.AprilHare ought to be getting tired of karma fortunes by now.AprilHare ought to be getting tired of karma fortunes by now.AprilHare ought to be getting tired of karma fortunes by now.AprilHare ought to be getting tired of karma fortunes by now.AprilHare ought to be getting tired of karma fortunes by now.AprilHare ought to be getting tired of karma fortunes by now.AprilHare ought to be getting tired of karma fortunes by now.AprilHare ought to be getting tired of karma fortunes by now.AprilHare ought to be getting tired of karma fortunes by now.AprilHare ought to be getting tired of karma fortunes by now.
 
AprilHare's Avatar
 
Posts: 2,979
Karma: 11862367
Join Date: Apr 2008
Device: Sony Reader PRS-T2
Very good, finally got the script going:
Of course, it didn't have to be all fair sailing and here is the output from my first conversion:
Quote:
:~/Desktop/untitled folder/CleanMe!!!/gutlrf$ ./gutlrf.pl http://www.gutenberg.org/files/17297/17297-h.zip
... 0KBytes

Extracting files...

Book Title: British Highways And Byways From A Motor Car
Author : Thomas D Murphy

Cleaning HTML...
Wrote cleaned HTML "/tmp/17297-h/new.htm"
Converting to BBeB...
Processing u'new.htm'
Parsing HTML...
Converting to BBeB...
An error occurred while processing a table: AttributeError("'module' object has no attribute 'tt0011m_'",). Ignoring table markup.
An error occurred while processing a table: AttributeError("'module' object has no attribute 'tt0011m_'",). Ignoring table markup.
Rationalizing font sizes...
Output written to /tmp/17297-h/British Highways And Byways From A Motor Car.lrf
Segmentation fault
Died at ./gutlrf.pl line 261.
Attached Files
File Type: lrf British Highways And Byways From A Motor Car.lrf (2.20 MB, 286 views)

Last edited by AprilHare; 01-06-2009 at 08:14 AM. Reason: Attaching output LRF
AprilHare is offline   Reply With Quote