Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book Software > Calibre

Notices

Reply
 
Thread Tools Search this Thread
Old 09-30-2010, 01:22 PM   #811
arijon
Junior Member
arijon began at the beginning.
 
Posts: 6
Karma: 10
Join Date: Sep 2010
Device: iPad
I have several textbooks scanned into relatively high-quality PDF with OCR. They look good on my computer, but they are too big to read on the iPad. So I've been trying to convert the PDFs to ePub while maintaining the OCR. I have read through the Calibre settings and can't figure out how to convert to ePub and keep OCR. Otherwise the PDFs are too big to read on my iPad. Any suggestions?
arijon is offline   Reply With Quote
Old 09-30-2010, 01:32 PM   #812
Starson17
Wizard
Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.
 
Posts: 4,004
Karma: 177841
Join Date: Dec 2009
Device: WinMo: IPAQ; Android: HTC HD2, Archos 7o; Java:Gravity T
Quote:
Originally Posted by arijon View Post
I have several textbooks scanned into relatively high-quality PDF with OCR. They look good on my computer, but they are too big to read on the iPad. So I've been trying to convert the PDFs to ePub while maintaining the OCR. I have read through the Calibre settings and can't figure out how to convert to ePub and keep OCR. Otherwise the PDFs are too big to read on my iPad. Any suggestions?
Use your pdf reader to save the pdf as text. That should save out the OCR text from the original pdf and drop the images of the text. Then convert the text document. You say you have "high-quality PDF with OCR" and that "They look good on my computer" If you are are looking at the scanned images of text, it will look good, but the OCR text, which normally does not display, may look awful, so this process may not work well.
Starson17 is offline   Reply With Quote
Old 09-30-2010, 01:57 PM   #813
arijon
Junior Member
arijon began at the beginning.
 
Posts: 6
Karma: 10
Join Date: Sep 2010
Device: iPad
Quote:
Originally Posted by Starson17 View Post
Use your pdf reader to save the pdf as text. That should save out the OCR text from the original pdf and drop the images of the text. Then convert the text document. You say you have "high-quality PDF with OCR" and that "They look good on my computer" If you are are looking at the scanned images of text, it will look good, but the OCR text, which normally does not display, may look awful, so this process may not work well.

So then what is the best way to make a clean searchable eBook for my computer and iPad? I have a Fujitsu Snapscan 1500 scanner..
arijon is offline   Reply With Quote
Old 09-30-2010, 02:05 PM   #814
Perkin
Guru
Perkin calls his or her ebook reader Vera.Perkin calls his or her ebook reader Vera.Perkin calls his or her ebook reader Vera.Perkin calls his or her ebook reader Vera.Perkin calls his or her ebook reader Vera.Perkin calls his or her ebook reader Vera.Perkin calls his or her ebook reader Vera.Perkin calls his or her ebook reader Vera.Perkin calls his or her ebook reader Vera.Perkin calls his or her ebook reader Vera.Perkin calls his or her ebook reader Vera.
 
Perkin's Avatar
 
Posts: 657
Karma: 64171
Join Date: Sep 2010
Location: Kent, England, Sol 3, ZZ9 plural Z Alpha
Device: Sony PRS-300, Kobo Aura HD, iPad (Marvin)
If your scanning a book, use OCR software, save to rtf with formatting preserved, add rtf to calibre, convert to preferred format.
Perkin is offline   Reply With Quote
Old 09-30-2010, 02:25 PM   #815
Starson17
Wizard
Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.
 
Posts: 4,004
Karma: 177841
Join Date: Dec 2009
Device: WinMo: IPAQ; Android: HTC HD2, Archos 7o; Java:Gravity T
Quote:
Originally Posted by arijon View Post
So then what is the best way to make a clean searchable eBook for my computer and iPad? I have a Fujitsu Snapscan 1500 scanner..
I thought it already was OCR'd? (Often when someone says they have OCR'd PDF, what they really mean is they have images of pages with text, and hidden OCR searchable text. The OCR searchable text is usually awful quality.)

Either way, the answer is simple - get OCR text and use that. The problem is to get good quality OCR text. Once you have that, it's simple. ABBYY FineReader is one of the best. Adobe Acrobat is commonly used. Take your pick, scan, correct any OCR errors, add any formatting the OCR missed and convert.
Starson17 is offline   Reply With Quote
Old 10-01-2010, 07:55 PM   #816
Metamorphosis
Junior Member
Metamorphosis began at the beginning.
 
Metamorphosis's Avatar
 
Posts: 4
Karma: 10
Join Date: Apr 2009
Device: Sony
Quote:
Originally Posted by Perkin View Post
There's a problem with 7.20, either go back to 7.19 or wait for upated 7.21


I downloaded the new version tonight and it's working now.
Metamorphosis is offline   Reply With Quote
Old 10-09-2010, 08:49 PM   #817
wn0x
Member
wn0x has a complete set of Star Wars action figures.wn0x has a complete set of Star Wars action figures.wn0x has a complete set of Star Wars action figures.wn0x has a complete set of Star Wars action figures.
 
Posts: 13
Karma: 378
Join Date: Mar 2010
Device: Sony PRS-505
When is the best time to correct spelling errors in an epub file - before importing into Calibre - or correcting the file were it resides in my user profiles Calibre library after initial import?

The goal is to eliminate re-importing the epub file into Calibre every time I find a spelling/formatting error I missed.

Thanks,

Rich
wn0x is offline   Reply With Quote
Old 10-09-2010, 08:56 PM   #818
theducks
Well trained by Cats
theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.
 
theducks's Avatar
 
Posts: 31,109
Karma: 60406498
Join Date: Aug 2009
Location: The Central Coast of California
Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A
Quote:
Originally Posted by wn0x View Post
When is the best time to correct spelling errors in an epub file - before importing into Calibre - or correcting the file were it resides in my user profiles Calibre library after initial import?

The goal is to eliminate re-importing the epub file into Calibre every time I find a spelling/formatting error I missed.

Thanks,

Rich
You could edit in place. (backup's highly recommended)
Type T (for Tweak epub), explode. spell check the pieces and repackage.
or
O for open and Right-click select: Edit With Sigil (assumes you have Sigil) when the file manager opens

BTW you can simply replace an existing format. Open the Meta-data editor, Dop the new version of a format onto the formats window to replace the current one.

Last edited by theducks; 10-09-2010 at 08:58 PM.
theducks is offline   Reply With Quote
Old 10-10-2010, 01:59 AM   #819
wn0x
Member
wn0x has a complete set of Star Wars action figures.wn0x has a complete set of Star Wars action figures.wn0x has a complete set of Star Wars action figures.wn0x has a complete set of Star Wars action figures.
 
Posts: 13
Karma: 378
Join Date: Mar 2010
Device: Sony PRS-505
Quote:
Originally Posted by theducks View Post
You could edit in place. (backup's highly recommended)
Type T (for Tweak epub), explode. spell check the pieces and repackage.
or
O for open and Right-click select: Edit With Sigil (assumes you have Sigil) when the file manager opens

BTW you can simply replace an existing format. Open the Meta-data editor, Dop the new version of a format onto the formats window to replace the current one.
Thanks, I'll give that a try. I didn't think that Sigil had spell check, did I miss something? I have been extracting the epub files and using KomPozer (has spell check), which definately adds a few steps.

Rich
wn0x is offline   Reply With Quote
Old 10-10-2010, 09:55 AM   #820
theducks
Well trained by Cats
theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.
 
theducks's Avatar
 
Posts: 31,109
Karma: 60406498
Join Date: Aug 2009
Location: The Central Coast of California
Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A
Quote:
Originally Posted by wn0x View Post
Thanks, I'll give that a try. I didn't think that Sigil had spell check, did I miss something? I have been extracting the epub files and using KomPozer (has spell check), which definately adds a few steps.

Rich
No, you are correct, Sigil does not have a Spell check.
I would suggest going the "T" route to handle the file (packaging) operations, and use Komposer from there
theducks is offline   Reply With Quote
Old 10-18-2010, 07:36 AM   #821
simonbcn
Simón
simonbcn began at the beginning.
 
simonbcn's Avatar
 
Posts: 19
Karma: 10
Join Date: Aug 2009
Location: Barcelona, Spain
Device: Sony PRS-650 / Papyre 6.1
Question Styles (from FB2 to EPUB)?

Hi,
I have converted one test FB2 (with several styles) to EPUB with Calibre.
FB2 screenshot:


The problem is when I convert this to EPUB, with Calibre, the output EPUB is only text. The conversion FB2 to EPUB doesn't convert any style?
Regards.

Last edited by simonbcn; 10-18-2010 at 07:39 AM.
simonbcn is offline   Reply With Quote
Old 10-18-2010, 10:59 AM   #822
kovidgoyal
creator of calibre
kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.
 
kovidgoyal's Avatar
 
Posts: 45,424
Karma: 27757236
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
it does support styles. you should open a ticket and attach your fb2
kovidgoyal is offline   Reply With Quote
Old 10-25-2010, 04:01 AM   #823
Andrew Brooks
Junior Member
Andrew Brooks began at the beginning.
 
Posts: 4
Karma: 10
Join Date: Oct 2010
Device: iPad and iPhone
Hi,
I'm a photographer whos pretty new to ePubs/iBooks, been working on one of my work for the last few weeks and have it looking good on my iPhone and iPab, but now really interested in making it available to my network using the iBooks store. On running it through http://threepress.org/document/epub-validate/ it came up with this error on every sound and video file that is in the book

ERROR: New Worlds.epub/OPS/chapter-1.xhtml(5): unknown element "audio" from namespace "http://www.w3.org/1999/xhtml"

and

ERROR: New Worlds.epub/OPS/chapter-30.xhtml(5): unknown element "video" from namespace "http://www.w3.org/1999/xhtml"

Also is there a file size limit on iBooks that can be uploaded, as mine features embedded video and sound it works out as just over 80meg.

I have put together the iBook using iWork pages.

Any help would be very appreciated,

Thanks for your time,

www.andrewbrooksphotography.com
Andrew Brooks is offline   Reply With Quote
Old 10-25-2010, 10:50 AM   #824
kovidgoyal
creator of calibre
kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.
 
kovidgoyal's Avatar
 
Posts: 45,424
Karma: 27757236
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
Neither audio nor video are officially supported in EPUB at this time. So if you want your EPUB to validate, you can't include them in the HTML
kovidgoyal is offline   Reply With Quote
Old 10-29-2010, 12:42 AM   #825
ramjet1953
Member
ramjet1953 began at the beginning.
 
Posts: 13
Karma: 10
Join Date: Dec 2009
Location: Thailand
Device: Hanlin V3
Smile Two problems when converting from PDF to ePub

Hi, Guys!

I am experiencing two problems when converting from a PDF file to an ePub.

1. After conversion, I have noticed that any word which contains a 'll' is converted to 'l '.
ie the word 'Hellenistic' would be converted to 'Hel enistic'.

2. When I load the epub file onto my Hanlin V3 the text is always Centre Justified, regardless of how I specify it in the menus. I have checked the .css and it is specified as left justified.

Any suggestions?

Regards,
Roger

Last edited by ramjet1953; 10-29-2010 at 12:43 AM. Reason: typo
ramjet1953 is offline   Reply With Quote
Reply


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
[Old Thread] Epub Output: Line Height greenapple Conversion 20 01-27-2013 09:27 AM
EPUB output justification toki08 Calibre 10 01-08-2011 04:14 PM
Calibre epub output details and Nook squidward Calibre 6 11-24-2010 03:21 PM
epub output metadata troymc Calibre 5 05-22-2010 12:23 AM
Problem with epub output in Cybook Gen3 fjf Calibre 3 02-03-2010 02:23 AM


All times are GMT -4. The time now is 05:35 PM.


MobileRead.com is a privately owned, operated and funded community.