Register Guidelines E-Books Search Today's Posts Mark Forums Read

Go Back   MobileRead Forums > E-Book Readers > Sony Reader

Notices

Reply
 
Thread Tools Search this Thread
Old 10-23-2006, 03:22 PM   #1
gdxf
Enthusiast
gdxf began at the beginning.
 
Posts: 48
Karma: 27
Join Date: Oct 2006
Device: Sony Reader PRS-500
Using Finereader to batch convert PDF files to RTF

I'm using Finereader 8.0 professional to batch convert PDF or Multi-page TIF files to RTF files for reading on Sony Reader and the result is basically satisfactory. It will automatically convert up to 10,000 pages at a time, unattended. I find Finereader 8.0 has much better OCR accuracy than Omnipage 15, Readiris 11, Paperport 11, or Adobe Acrobat 7.0. The page size I use for rtf files is 5.24'x6.69'. Since Sony Reader does not support pictures within RTF, I set the format to "not keeping pictures" in conversion, and this way it reduces the file size dramatically.

One problem that bugs me is that although the converted RTF file has the same FILENAME as the original PDF file, its DOCUMENT TITLE shown on Sony Reader is not the same. I know I can always open each rtf file and change the title dada in file properties one by one, but that would be too much work if you have hundreds of files to modify. Do you have a better solution to batch convert the RTF title to be the same as its file name? Thanks in advance for your suggestions.
gdxf is offline   Reply With Quote
Old 10-23-2006, 03:48 PM   #2
Bob Russell
Recovering Gadget Addict
Bob Russell ought to be getting tired of karma fortunes by now.Bob Russell ought to be getting tired of karma fortunes by now.Bob Russell ought to be getting tired of karma fortunes by now.Bob Russell ought to be getting tired of karma fortunes by now.Bob Russell ought to be getting tired of karma fortunes by now.Bob Russell ought to be getting tired of karma fortunes by now.Bob Russell ought to be getting tired of karma fortunes by now.Bob Russell ought to be getting tired of karma fortunes by now.Bob Russell ought to be getting tired of karma fortunes by now.Bob Russell ought to be getting tired of karma fortunes by now.Bob Russell ought to be getting tired of karma fortunes by now.
 
Bob Russell's Avatar
 
Posts: 5,324
Karma: 590871
Join Date: May 2004
Location: Pittsburgh, PA
Device: Note3, MacBook Air
Sorry, I have no answers about the file names. But another question about what you said... this is probably a silly question, but why does it need a page size for RTF files?
Bob Russell is offline   Reply With Quote
Old 10-23-2006, 04:08 PM   #3
gdxf
Enthusiast
gdxf began at the beginning.
 
Posts: 48
Karma: 27
Join Date: Oct 2006
Device: Sony Reader PRS-500
Thanks, Bob. Your question sort of solved my puzzle, for it actually makes me realize that probably rtf does not need page size at all. I'm very new to this page size thing and have experimented with other page sizes such as A4, letter, or Sony reader screen size, but didn't find much difference in the converted file. So I guess probably you are right that rtf files does not need a page size to be read on the reader.
gdxf is offline   Reply With Quote
Old 10-24-2006, 03:12 AM   #4
ath
Addict
ath doesn't litterath doesn't litter
 
Posts: 222
Karma: 110
Join Date: Jun 2006
Location: Malmo, Sweden
Device: iLiad, Sony PRS-505, Kindle
Quote:
Originally Posted by gdxf
One problem that bugs me is that although the converted RTF file has the same FILENAME as the original PDF file, its DOCUMENT TITLE shown on Sony Reader is not the same.
I'm not entirely sure about this, but document title is probably one of the PDF document properties, and so something that FR doesn't look at -- as far as I know, it 'prints' PDF pages, OCRs the result, and converts that to whatever output format you have. And document properties don't get printed.

I can imagine two solutions:

A. A program that extracts the document title (and other metadata) from the PDF file, and inserts it in the proper place in the RTF file.

B. Suggest to ABBYY that PDF (and perhaps other) metadata should, if possible, survive this conversion, and hope they think it's a good idea.

Of course, I'm assuming that there *is* a proper place for document title to be placed in the RTF file, and not just something that is Sony Reader specific.
ath is offline   Reply With Quote
Old 10-24-2006, 11:31 AM   #5
NatCh
Gizmologist
NatCh ought to be getting tired of karma fortunes by now.NatCh ought to be getting tired of karma fortunes by now.NatCh ought to be getting tired of karma fortunes by now.NatCh ought to be getting tired of karma fortunes by now.NatCh ought to be getting tired of karma fortunes by now.NatCh ought to be getting tired of karma fortunes by now.NatCh ought to be getting tired of karma fortunes by now.NatCh ought to be getting tired of karma fortunes by now.NatCh ought to be getting tired of karma fortunes by now.NatCh ought to be getting tired of karma fortunes by now.NatCh ought to be getting tired of karma fortunes by now.
 
NatCh's Avatar
 
Posts: 11,605
Karma: 926222
Join Date: Jan 2006
Location: Republic of Texas Embassy at Jackson, TN
Device: Nook STGR
Quote:
Originally Posted by ath
I'm assuming that there *is* a proper place for document title to be placed in the RTF file, and not just something that is Sony Reader specific.
There is, indeed: http://www.mobileread.com/forums/sho...1650#post41650
NatCh is offline   Reply With Quote
Old 10-24-2006, 02:43 PM   #6
gdxf
Enthusiast
gdxf began at the beginning.
 
Posts: 48
Karma: 27
Join Date: Oct 2006
Device: Sony Reader PRS-500
Quote:
Originally Posted by ath
I'm not entirely sure about this, but document title is probably one of the PDF document properties, and so something that FR doesn't look at -- as far as I know, it 'prints' PDF pages, OCRs the result, and converts that to whatever output format you have. And document properties don't get printed.

I can imagine two solutions:

A. A program that extracts the document title (and other metadata) from the PDF file, and inserts it in the proper place in the RTF file.

B. Suggest to ABBYY that PDF (and perhaps other) metadata should, if possible, survive this conversion, and hope they think it's a good idea.

Of course, I'm assuming that there *is* a proper place for document title to be placed in the RTF file, and not just something that is Sony Reader specific.
Since finereader does keep the original pdf file name in the converted rtf file. Another solution I'm thinking of is to get such a software that can make the rtf "document title" the same as its "file name," i.e., synchronize the two. I know PDF EXplorer can batch synchronize the pdf document title to be the same as the pdf file name and do hundreds of files within seconds. I'm wondering whether there is an equivalent program that can do the same for rtf files. Any suggestions?
gdxf is offline   Reply With Quote
Old 10-25-2006, 03:01 PM   #7
gdxf
Enthusiast
gdxf began at the beginning.
 
Posts: 48
Karma: 27
Join Date: Oct 2006
Device: Sony Reader PRS-500
I start to find that the fonts of Finereader converted rtf files look too small on the Sony reader, even after I use "L" size on the reader. Does anybody know how to batch increase the font size either in Finereader or in MS Word?
gdxf is offline   Reply With Quote
Old 10-28-2006, 05:32 AM   #8
gdxf
Enthusiast
gdxf began at the beginning.
 
Posts: 48
Karma: 27
Join Date: Oct 2006
Device: Sony Reader PRS-500
Quote:
Originally Posted by gdxf
I start to find that the fonts of Finereader converted rtf files look too small on the Sony reader, even after I use "L" size on the reader. Does anybody know how to batch increase the font size either in Finereader or in MS Word?
After two days studying on how the Word Macro works, I kind of solved the problem with writing a macro for word to run. It completely fulfills my goal: batch make the rtf document title the same as its file name, batch increase font size to 16 points, batch line space to 1.5.
gdxf is offline   Reply With Quote
Old 10-28-2006, 12:38 PM   #9
slayda
Retired & reading more!
slayda ought to be getting tired of karma fortunes by now.slayda ought to be getting tired of karma fortunes by now.slayda ought to be getting tired of karma fortunes by now.slayda ought to be getting tired of karma fortunes by now.slayda ought to be getting tired of karma fortunes by now.slayda ought to be getting tired of karma fortunes by now.slayda ought to be getting tired of karma fortunes by now.slayda ought to be getting tired of karma fortunes by now.slayda ought to be getting tired of karma fortunes by now.slayda ought to be getting tired of karma fortunes by now.slayda ought to be getting tired of karma fortunes by now.
 
slayda's Avatar
 
Posts: 2,741
Karma: 884247
Join Date: Sep 2006
Location: North Alabama, USA
Device: Kindle 1, iPad 4, iPhone 5
Quote:
Originally Posted by gdxf
After two days studying on how the Word Macro works, I kind of solved the problem with writing a macro for word to run. It completely fulfills my goal: batch make the rtf document title the same as its file name, batch increase font size to 16 points, batch line space to 1.5.

Hi gdfx, mind sharing how to create that macro?
Thanks,
slayda is offline   Reply With Quote
Old 10-28-2006, 04:14 PM   #10
gdxf
Enthusiast
gdxf began at the beginning.
 
Posts: 48
Karma: 27
Join Date: Oct 2006
Device: Sony Reader PRS-500
No problem, slayda. If anyone else is also interested, please come up with suggestions and improvements for converting rtf files to be read on the reader. Here is what I did, I go to MS Word 2003, in Tools section find "Macro," and create a new Macro from there: create a button on the toolbar, such as "SET16" the one I use, then paste the following codes in the attached file into your macro editing field, it should do the job automatically and tirelessly for you once you click the button. Of course, you can choose your own source folder and set font size to something other than 16 by slightly modifying the macro.
Attached Files
File Type: txt rtf conversion macro.txt (1.5 KB, 509 views)
gdxf is offline   Reply With Quote
Reply

Thread Tools Search this Thread
Search this Thread:

Advanced Search

Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Can't convert RTF files sglinert Calibre 10 06-08-2010 11:03 AM
Can't convert RTF files sglinert Calibre 0 06-06-2010 10:14 PM
RTF vs HTML---best way to convert my files? ficbot Workshop 16 05-06-2010 06:05 PM
Batch convert .lit or .rb files? mr.sam Sony Reader 11 12-10-2009 05:16 PM
PRS-500 RTF Tools - a simple tool for batch edtiting your rtf ebooks boradicus Sony Reader Dev Corner 11 05-06-2008 11:41 AM


All times are GMT -4. The time now is 04:42 PM.


MobileRead.com is a privately owned, operated and funded community.