Register Guidelines E-Books Search Today's Posts Mark Forums Read

Go Back   MobileRead Forums > E-Book Formats > Workshop

Notices

Reply
 
Thread Tools Search this Thread
Old 03-05-2009, 12:58 PM   #211
Thorkin
Junior Member
Thorkin began at the beginning.
 
Posts: 6
Karma: 10
Join Date: Mar 2009
Device: kindle
Quote:
Originally Posted by nrapallo View Post
That signifies to me that there is an error encountered during the creation of the .prc due to a number of reasons, but most likely linked to memory issues or .pdf encoding. See if you can create a .prc from the resulting .html and images using Mobipocket Creator or Calibre.

Try a range of pages, like 1 to 10, and see if that works using PDFRead's internal .prc creation.

Also, try converting the .djvu file instead. I tested it with both and there were some missing pages when I used the .pdf file (some of the first 10 pages didn't come through).
Hrm, I can get it to work with some other pyle ebooks, and I can get Creator to make ebooks out of the .pngs, but otherwise the error remains. It seems to be something related to filenames as changing the creation filename will help sometimes.

Thanks for your help though!
Thorkin is offline   Reply With Quote
Old 03-06-2009, 02:07 PM   #212
malife
Junior Member
malife doesn't littermalife doesn't litter
 
Posts: 5
Karma: 136
Join Date: Mar 2009
Device: Kindle 2
Hello All,
Nick, awesome job with PDFRead absolutley loved it. Great tool. I too can confirm the "blank page" bug in Kindle 2 mentioned by AnthonyPaulO. If I generate the html and then produce the mobi either with mobipocket (in Windows) or Calibre in Mac the generated book renders correctly and no blank pages are inserted.

I have a question. Say I want to read a PDF that is mostly two columns, i.e. a scientific paper. Is there a way for me to split that document column wise even if I loose some content (like the title) that is not dobule column? For instance say I have a PDF document like the one shown in the attached sample.png, and want to split it in columns like the attached col1.png and col2.png. Can PDFRead do that? Anyone knows the unpaper setting that would make the trick?

Thanks!
Attached Thumbnails
Click image for larger version

Name:	sample.png
Views:	289
Size:	140.7 KB
ID:	25110   Click image for larger version

Name:	col1.png
Views:	291
Size:	68.9 KB
ID:	25111   Click image for larger version

Name:	col2.png
Views:	273
Size:	70.0 KB
ID:	25112  
malife is offline   Reply With Quote
 
Enthusiast
Old 03-06-2009, 02:15 PM   #213
nrapallo
GuteBook/Mobi2IMP Creator
nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.
 
nrapallo's Avatar
 
Posts: 2,956
Karma: 2530531
Join Date: Dec 2007
Location: Toronto, Canada
Device: REB1200 EBW1150 Device: T1 NSTG iLiad_v2 NC Device: Asus_TF Next1 WPDN
Quote:
Originally Posted by malife View Post
Hello All,
Nick, awesome job with PDFRead absolutley loved it. Great tool. I too can confirm the "blank page" bug in Kindle 2 mentioned by AnthonyPaulO. If I generate the html and then produce the mobi either with mobipocket (in Windows) or Calibre in Mac the generated book renders correctly and no blank pages are inserted.
That's very interesting. Could you upload a sample .prc (and .html) of the 'with blank pages (PDFRead created)' and 'without (Mobipocket or Calibre created)' ebooks. I'll investigate it further to determine what can be done to stop that!

Quote:
I have a question. Say I want to read a PDF that is mostly two columns, i.e. a scientific paper. Is there a way for me to split that document column wise even if I loose some content (like the title) that is not dobule column? For instance say I have a PDF document like the one shown in the attached sample.png, and want to split it in columns like the attached col1.png and col2.png. Can PDFRead do that? Anyone knows the unpaper setting that would make the trick?

Thanks!
PDFRead can split the page like that with the Layout mode: 'portrait-2col' (with four quadrants/pages). However, it won't detect the two columns boundaries, just split down the middle. Not too helpful, most of the time. See this post for more info.

However, for that functionality try PaperCrop - Multi-column PDF files on 6 inch display. It does this better!

Last edited by nrapallo; 03-06-2009 at 02:18 PM.
nrapallo is offline   Reply With Quote
Old 03-07-2009, 03:32 PM   #214
malife
Junior Member
malife doesn't littermalife doesn't litter
 
Posts: 5
Karma: 136
Join Date: Mar 2009
Device: Kindle 2
Blank Pages Woes

Here they are Nick. The three formats from PDFRead, Mobipocket Creator and Calibre. I confirm again the PDFRead version has blank pages every other page, at least it displays so in the Kindle 2. Interesting how the Calibre file is noticeably larger (almost 3 times) than the other two.

Thank you so much for the tip on the PaperCrop app. Really useful specially for papers. Thanks!
Attached Files
File Type: mobi NokiaPaperFromCalibreMac.mobi (673.2 KB, 235 views)
File Type: prc NokiaPaperFromMobipocket.prc (271.3 KB, 237 views)
File Type: prc NokiaPaperFromPDFRead.prc (277.3 KB, 251 views)
malife is offline   Reply With Quote
Old 03-07-2009, 04:53 PM   #215
nrapallo
GuteBook/Mobi2IMP Creator
nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.
 
nrapallo's Avatar
 
Posts: 2,956
Karma: 2530531
Join Date: Dec 2007
Location: Toronto, Canada
Device: REB1200 EBW1150 Device: T1 NSTG iLiad_v2 NC Device: Asus_TF Next1 WPDN
Quote:
Originally Posted by malife View Post
Here they are Nick. The three formats from PDFRead, Mobipocket Creator and Calibre. I confirm again the PDFRead version has blank pages every other page, at least it displays so in the Kindle 2. Interesting how the Calibre file is noticeably larger (almost 3 times) than the other two.

Thank you so much for the tip on the PaperCrop app. Really useful specially for papers. Thanks!
Thanks for these!

Nothing apparent that would cause that blank page, other than the NRhtml2mobi.exe I use.

Attached is the explosion of those .prc using my primitive .prc to .html utility called makedocN. I can't find anything that would trigger those blank pages.

More investigation needed...
Attached Files
File Type: zip Blank pages testing-makedocN.zip (2.80 MB, 235 views)
nrapallo is offline   Reply With Quote
Old 03-07-2009, 10:29 PM   #216
nrapallo
GuteBook/Mobi2IMP Creator
nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.
 
nrapallo's Avatar
 
Posts: 2,956
Karma: 2530531
Join Date: Dec 2007
Location: Toronto, Canada
Device: REB1200 EBW1150 Device: T1 NSTG iLiad_v2 NC Device: Asus_TF Next1 WPDN
Quote:
Originally Posted by malife View Post
Here they are Nick. The three formats from PDFRead, Mobipocket Creator and Calibre. I confirm again the PDFRead version has blank pages every other page, at least it displays so in the Kindle 2. Interesting how the Calibre file is noticeably larger (almost 3 times) than the other two.

Thank you so much for the tip on the PaperCrop app. Really useful specially for papers. Thanks!
malife:

OK, just to rule out the old v0.0.37 Mobiperl I previously used to create NRhtml2mobi.exe, I just updated it to the latest Mobiperl v0.0.41.

Please try again using the attached updated NRhtml2mobi.exe (up to v0.0.41) by replacing the one in the 'bin' folder of the PDFRead install directory.

Thanks!
Attached Files
File Type: zip NRhtml2mobi-v41.zip (1.61 MB, 4016 views)

Last edited by nrapallo; 03-07-2009 at 11:36 PM. Reason: typo
nrapallo is offline   Reply With Quote
Old 03-08-2009, 12:12 PM   #217
malife
Junior Member
malife doesn't littermalife doesn't litter
 
Posts: 5
Karma: 136
Join Date: Mar 2009
Device: Kindle 2
Sorry but no Joy. Still blank page every other page

Unfortunately I won't be able to continue reporting the Kindle 2 issues. I've decided I will return it and get a Sony Reader. There are many issues I dislike in the Kindle, being lack of PDF support the most important of them. I will still be using PDFRead though for papers and complicated PDFs.

As soon as I get my Sony Reader I will report back if the "blank page" bug is also present there, though I doubt it.

Thanks for a superb free tool!
malife is offline   Reply With Quote
Old 03-08-2009, 12:33 PM   #218
nrapallo
GuteBook/Mobi2IMP Creator
nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.
 
nrapallo's Avatar
 
Posts: 2,956
Karma: 2530531
Join Date: Dec 2007
Location: Toronto, Canada
Device: REB1200 EBW1150 Device: T1 NSTG iLiad_v2 NC Device: Asus_TF Next1 WPDN
Quote:
Originally Posted by malife View Post
Sorry but no Joy. Still blank page every other page

Unfortunately I won't be able to continue reporting the Kindle 2 issues. I've decided I will return it and get a Sony Reader. There are many issues I dislike in the Kindle, being lack of PDF support the most important of them. I will still be using PDFRead though for papers and complicated PDFs.

As soon as I get my Sony Reader I will report back if the "blank page" bug is also present there, though I doubt it.

Thanks for a superb free tool!
Thanks for all your effort in helping PDFRead get better! Much appreciated!!
nrapallo is offline   Reply With Quote
Old 03-10-2009, 11:26 AM   #219
DaleDe
Grand Sorcerer
DaleDe ought to be getting tired of karma fortunes by now.DaleDe ought to be getting tired of karma fortunes by now.DaleDe ought to be getting tired of karma fortunes by now.DaleDe ought to be getting tired of karma fortunes by now.DaleDe ought to be getting tired of karma fortunes by now.DaleDe ought to be getting tired of karma fortunes by now.DaleDe ought to be getting tired of karma fortunes by now.DaleDe ought to be getting tired of karma fortunes by now.DaleDe ought to be getting tired of karma fortunes by now.DaleDe ought to be getting tired of karma fortunes by now.DaleDe ought to be getting tired of karma fortunes by now.
 
DaleDe's Avatar
 
Posts: 9,388
Karma: 4531756
Join Date: Aug 2007
Location: Grass Valley, CA
Device: EB 1150, EZ Reader, Literati, iPad 2
Typically blank pages are caused by the image being slightly longer than the page itself. The page view gets cropped to the length of the page and then rest of the page shows up on the next page (which appears blank) but is actually the bottom area of the previous page. This is likely due to some extra info on the bottom of the page that causes the area of the page to be smaller than you thought.

Dale
DaleDe is online now   Reply With Quote
Old 03-21-2009, 08:36 PM   #220
Student1
Groupie
Student1 doesn't litterStudent1 doesn't litter
 
Posts: 159
Karma: 170
Join Date: Feb 2009
Device: PRS-505
Hi guys,

starting some massive conversion for pdf to lrf and wondering if these settings are the best that can be done to get good quality and more important as close as possible to full page on my prs 505. Here are the settings i ve been using (using command line as i batch it all for 300+ books so this takes weeks on 2 computers)

set LOC=C:\Program Files (x86)\PDFRead
set OPT=-p prs505-p -i pdf -c "Mythology" -f "lrf" -m "portrait-full" -r "none" --no-dilate --dpi "600" --colorspace "rgb" --colors "256" --optimize

let me know what you guys think and if i should change something, decided to keep color as new reader will be coming with color.

thanks!
Student1 is offline   Reply With Quote
Old 03-21-2009, 11:21 PM   #221
nrapallo
GuteBook/Mobi2IMP Creator
nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.
 
nrapallo's Avatar
 
Posts: 2,956
Karma: 2530531
Join Date: Dec 2007
Location: Toronto, Canada
Device: REB1200 EBW1150 Device: T1 NSTG iLiad_v2 NC Device: Asus_TF Next1 WPDN
Quote:
Originally Posted by Student1 View Post
Hi guys,

starting some massive conversion for pdf to lrf and wondering if these settings are the best that can be done to get good quality and more important as close as possible to full page on my prs 505. Here are the settings i ve been using (using command line as i batch it all for 300+ books so this takes weeks on 2 computers)

set LOC=C:\Program Files (x86)\PDFRead
set OPT=-p prs505-p -i pdf -c "Mythology" -f "lrf" -m "portrait-full" -r "none" --no-dilate --dpi "600" --colorspace "rgb" --colors "256" --optimize

let me know what you guys think and if i should change something, decided to keep color as new reader will be coming with color.

thanks!

I think you would benefit from the dilation (i.e. don't use --no-dilate) that PDFRead does especially if you get the update in post #1 of this thread in the file 'pdfread-MinFilter5-mod-bin.zip' and use 600 DPI.

The dilation bolds the text so that any reduction in size from it's original dimensions, say, 3000x4000 down to 520x640 can still retain some thickness of the original text.

Just experiment on a small page range like --first-page "31" --last-page "31" to see which setup you prefer.

The 'optimize .PNGs' will help with the resulting filesize without too much of a processing hit.

See these sample conversions of page 31 of a Archive.org (merryadventureso00pyle2.pdf) scanned book.
Attached Thumbnails
Click image for larger version

Name:	merryadventureso00pyle2-no-dilation.png
Views:	229
Size:	200.5 KB
ID:	26117   Click image for larger version

Name:	merryadventureso00pyle2-with-dilation.png
Views:	202
Size:	181.3 KB
ID:	26119   Click image for larger version

Name:	merryadventureso00pyle2-with-dilation-optimized png.png
Views:	208
Size:	177.8 KB
ID:	26121  
Attached Files
File Type: lrf merryadventureso00pyle2-no-dilation.lrf (201.5 KB, 173 views)
File Type: lrf merryadventureso00pyle2-with-dilation.lrf (182.2 KB, 185 views)
File Type: lrf merryadventureso00pyle2-with-dilation-optimized png.lrf (178.7 KB, 245 views)
nrapallo is offline   Reply With Quote
Old 03-21-2009, 11:42 PM   #222
Student1
Groupie
Student1 doesn't litterStudent1 doesn't litter
 
Posts: 159
Karma: 170
Join Date: Feb 2009
Device: PRS-505
Quote:
Originally Posted by nrapallo View Post
I think you would benefit from the dilation (i.e. don't use --no-dilate) that PDFRead does especially if you get the update in post #1 of this thread in the file 'pdfread-MinFilter5-mod-bin.zip' and use 600 DPI.

The dilation bolds the text so that any reduction in size from it's original dimensions, say, 3000x4000 down to 520x640 can still retain some thickness of the original text.

Just experiment on a small page range like --first-page "31" --last-page "31" to see which setup you prefer.

The 'optimize .PNGs' will help with the resulting filesize without too much of a processing hit.

See these sample conversions of page 31 of a Archive.org (merryadventureso00pyle2.pdf) scanned book.
Wow huge difference! You just saved me a week in processing power! Well i have to restart but atleast the result will be a lot better! Also i noticed the images did not take the whole page and most of the time i had to zoom once or twice on my device, is there an option so the images will fit the screen without loosing quality (ie stretching)? Again while using the same options without the no dilate this time, also picked up the update, missed the the first time!!
Student1 is offline   Reply With Quote
Old 03-22-2009, 12:16 AM   #223
nrapallo
GuteBook/Mobi2IMP Creator
nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.
 
nrapallo's Avatar
 
Posts: 2,956
Karma: 2530531
Join Date: Dec 2007
Location: Toronto, Canada
Device: REB1200 EBW1150 Device: T1 NSTG iLiad_v2 NC Device: Asus_TF Next1 WPDN
Quote:
Originally Posted by Student1 View Post
Wow huge difference! You just saved me a week in processing power! Well i have to restart but atleast the result will be a lot better! Also i noticed the images did not take the whole page and most of the time i had to zoom once or twice on my device, is there an option so the images will fit the screen without loosing quality (ie stretching)? Again while using the same options without the no dilate this time, also picked up the update, missed the the first time!!
If the border is truly 'white', then it can be stripped off, but in my sample above (see attahced original page as a .jpg) , there is a black outlined box around the page so, in essence, there is 'no white border'.

A smug, tear, or dark spot may also thwart the stripping of the border as most definitely will a page number, header text or watermark. In these cases, the 'unpaper' option may serve you well with the 'pre-border' option mentioned a few pages back in this thread.
Attached Thumbnails
Click image for larger version

Name:	merryadventureso00pyle2_Page_031.jpg
Views:	181
Size:	164.3 KB
ID:	26122  

Last edited by nrapallo; 03-22-2009 at 12:30 AM. Reason: added actual page 31 used in my sample .png above
nrapallo is offline   Reply With Quote
Old 03-22-2009, 12:24 AM   #224
Student1
Groupie
Student1 doesn't litterStudent1 doesn't litter
 
Posts: 159
Karma: 170
Join Date: Feb 2009
Device: PRS-505
Yes thanks, i read about unpape, probably wont need it as my scans are very clean, well most of them anyway! I just started another batch on 2 computers... about 400 books now in all to process! Guess it will be a while !

Thanks again for your help! Great programm, thanks for the dedication!!!

I wonder are there any plans to output to epub? Seems more and more the format that might come on top!
Student1 is offline   Reply With Quote
Old 03-22-2009, 12:29 AM   #225
nrapallo
GuteBook/Mobi2IMP Creator
nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.nrapallo ought to be getting tired of karma fortunes by now.
 
nrapallo's Avatar
 
Posts: 2,956
Karma: 2530531
Join Date: Dec 2007
Location: Toronto, Canada
Device: REB1200 EBW1150 Device: T1 NSTG iLiad_v2 NC Device: Asus_TF Next1 WPDN
Quote:
Originally Posted by Student1 View Post
Yes thanks, i read about unpape, probably wont need it as my scans are very clean, well most of them anyway! I just started another batch on 2 computers... about 400 books now in all to process! Guess it will be a while !

Thanks again for your help! Great programm, thanks for the dedication!!!

I wonder are there any plans to output to epub? Seems more and more the format that might come on top!
Epub output is getting closer to a reality, but since the original author of PDFRead was going to add it in his v2.0 release of the program, I was holding off.

I may just go ahead and beat him to it, though. However, I will need to brush up on my python programming skills first.
nrapallo is offline   Reply With Quote
Reply

Thread Tools Search this Thread
Search this Thread:

Advanced Search

Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Need help using PDFRead daithi81 Workshop 8 10-16-2009 09:33 AM
Need help with PDFRead pfisterfarm PDF 8 03-23-2009 09:19 AM
pdfread cybook x3oo Cybook 2 03-09-2009 11:49 AM
PDFRead 1.7 released ashkulz PDF 87 03-12-2008 10:29 AM
PDFRead v5 available on Sourceforge Alexander Turcic PDF 3 04-08-2007 06:31 AM


All times are GMT -4. The time now is 12:00 PM.


MobileRead.com is a privately owned, operated and funded community.