|
|
Thread Tools | Search this Thread |
10-14-2010, 01:21 AM | #1 |
Enthusiast
Posts: 31
Karma: 10
Join Date: Oct 2010
Location: Arizona, USA
Device: None
|
How to allow our books to be converted to all ereader formats with calibre?
On our website http://www.thefolkofyore.com/ there are several books available for free reading and downloading as PDF files. Naively, as soon as we discovered calibre, we placed an invitation to our website visitors to use it to convert these files to the ebook reader of choice, including a link to the calibre website. For example: http://www.thefolkofyore.com/2-Sacre...adsacmiss.html
Of course, it's not that simple. The present PDF files don't convert well, exhibiting many of the symptoms described in the calibre manual, such as headers appearing within the text. In addition to a PDF file suitable for reading and printing from a computer, it looks like we need to offer an additional file dedicated for ebooks. To keep things simple, we'd like to offer one file in an optimal DRM-free format*that people can download and convert to their particular ebooks as quickly and easily as possible. It would be greatly appreciated if someone would recommend a general course of action, starting with what type of file to make available for conversion, and also address a few issues that have come up so far as I've been trying to do this on my own. The original source files are in Word 2008 for Mac. I'm using a MacBook Pro with OS 10.5.8, Adobe Acrobat Professional 8.2.5, calibre 0.7.23 and have Sigil 0.2.4, though I know little about Sigil and haven't used it yet. Since I don't own an ereader, to check results I'm using Kindle for Mac 1.0.0 Beta 1 (27214) and Barnes and Noble eReader Version 1.1. Eventually I'd like to check results in other formats, but right now it's complicated enough with these 2 alone. So far, I've gone back into the*Word*source files and replaced direct formatting with styles, removed headers, changed section breaks from "next page" to "continuous" and removed the table of contents (since it didn't seem to be doing anything once the files were converted to ereader formats). I've tried quite a few test files, some in HTML, some in PDF which I placed in calibre to convert.* I've tried various iterations. Converting ZIP and PDF to EPUB, MOBI, & PDB was completed in under 3 minutes. Many times*EPUB to MOBI & PDB was still working after 10 and up to 20 minutes so I stopped the job in these cases. Why is this happening? I had hoped to use EPUB*as the source format since that's*recommended in the manual and also the format Sigil uses. The best result I've achieved so far has come from converting Word to HTML, placing that in Calibre, and converting the ZIP file to MOBI. The appearance on the Kindle reader is close to my original, though the text is justified, whereas the original is left adjusted. All the chapters show up in the Kindle Table of contents, however the foreword and preface don't. All conversions to PDB viewed on the B&N reader are unsatisfactory because all italicization is lost. Also the text is justified in block form, double spaced between paragraphs, whereas in the original the first line of each paragraph is indented with no additional space between paragraphs. This is a just a sample of what I've encountered. Does anyone have any suggestions?*Thanks a lot for any help you can offer. |
10-14-2010, 02:31 AM | #2 |
Wizard
Posts: 4,552
Karma: 950151
Join Date: Nov 2008
Device: Sony PRS-950, iphone/ipad (Marvin/iBooks/QuickReader)
|
HTML will almost always be the best source format if you can get a good version. Resulting conversions should look just like they would in a browser as long as you have not used a feature not supported by the particular format. Calibre always converts any other format to HTML as an intermediate step in conversion anyway.
If using Word for the source files, then saving as Web Page (filtered) produces the best results. As to specifics of particular formats hopefully someone else will have a view. |
Advert | |
|
10-14-2010, 02:31 AM | #3 |
Wizard
Posts: 1,337
Karma: 123455
Join Date: Apr 2009
Location: Malaysia
Device: PRS-650, iPhone
|
If you were going to choose a single format it should be either epub or mobi. I would suggest epub because it's an industry/community standard, and you have good control over the markup to make it look quite close to the original book.
If you went that route you could convert it to from html to epub using Calibre and then finalize your book using Sigil. If you're not comfortable with css the justification is easiest to change with Sigil, and you can tweak the Table of Contents further there. As to why the conversion process seemed to be hanging while converting from epub to mobi/pdb, we'd need more information. You could either open a bug directly at bugs.calibre-ebook.com, with an attached file, or you could just post the output of conversion log here. You can get that by clicking on jobs in the lower left hand corner of the GUI, selecting the job that's hanging, and copying all the conversion details. The main reasons I'd recommend mobi is because the Kindle probably makes Mobi the most widely used format, and mobi to epub conversions are generally flawless, while epub to mobi conversions depend on whether or not newer css features were used. For either format you could get the table of contents to include the foreword and preface by changing the chapter detection xpath in the Calibre Structure detection options under conversion: This is the default: Code:
//*[((name()='h1' or name()='h2') and re:test(., 'chapter|book|section|part\s+', 'i')) or @class = 'chapter'] Code:
//*[((name()='h1' or name()='h2') and re:test(., 'chapter|foreword|preface|book|section|part\s+', 'i')) or @class = 'chapter'] |
10-14-2010, 04:54 AM | #4 | |
Wizard
Posts: 3,130
Karma: 91256
Join Date: Feb 2008
Location: Germany
Device: Cybook Gen3
|
Quote:
That being said, I'd suggest offering PDF, ePub and Mobi for download- that way, you'd have the most relevant formats all available. |
|
10-14-2010, 05:24 PM | #5 |
Jr. - Junior Member
Posts: 586
Karma: 2000358
Join Date: Aug 2010
Location: Alabama
Device: Archos, Asus, HP, Lenovo, Nexus and Samsung tablets in 7,8 and 10"
|
[QUOTE=ldolse;11play a mobi file61634]
As to why the conversion process seemed to be hanging while converting from epub to mobi/pdb, we'd need more information. I haven't had a epub to mobi conversion hang up and quit but any conversion to mobi runs at a snails pace. Sometimes taking 6-7 minutes to complete. However, I have had the viewer hang up trying to display a mobi file. I converted a 2.4MB epub to a 3.8MB mobi file that took at least 10 minutes (I didn;t time it). When I displayed them with the viewer the epub file was ready to read in a second or two. The mobi file took 6.5 minutes before it could be read (FBReader was ready to read the same file almost immediately). The viewers reflow is painfully slow on the file when I resize the window or when I display or hide the TOC (not so on the epub version). I am running Calibre 7.23 and get the same result on my Windows7 or Vista machines. Regards - John |
Advert | |
|
10-14-2010, 05:49 PM | #6 |
Enjoying the show....
Posts: 14,270
Karma: 10462841
Join Date: Jun 2008
Location: Arizona
Device: A K1, Kindle Paperwhite, an Ipod, IPad2, Iphone, an Ipad Mini & macAir
|
Welcome to MobileRead, webfolk.
You're sure to get some great suggestions here. |
10-15-2010, 12:34 AM | #7 |
US Navy, Retired
Posts: 9,864
Karma: 13806776
Join Date: Feb 2009
Location: North Carolina
Device: Icarus Illumina XL HD, Nexus 7
|
I have had the rare epub conversion take 6 hours to complete. Every time this happens it is because the epub was created from a source originally created with MS Word. When you save Word documents as html the amount of crap that is created in the css file is astronomical. Saving the Word document as web page filtered is better but still has a lot of baggage.
|
10-15-2010, 12:46 AM | #8 | |
Enthusiast
Posts: 44
Karma: 1590
Join Date: Oct 2010
Device: nook
|
[QUOTE=Jabby;1162645]
Quote:
|
|
10-15-2010, 11:13 AM | #9 | |
Wizard
Posts: 1,337
Karma: 123455
Join Date: Apr 2009
Location: Malaysia
Device: PRS-650, iPhone
|
Quote:
Dwanthny hit the nail on the head with the comments about formats that stemmed from word, word creates some hideous css/embedded/nested styles that takes forever to convert. Books with really clean formatting convert quite quickly. Filtered html should help, a number of users have said it's better to use OpenOffice to do the Word doc to HTML conversion instead. YMMV |
|
10-15-2010, 02:11 PM | #10 | |
ZCD BombShel
Posts: 4,793
Karma: 8293322
Join Date: Jan 2009
Location: The Frozen North (aka Illinois, USA)
Device: iPad, STB Kindle Oasis
|
Quote:
|
|
10-15-2010, 07:06 PM | #11 | |
Wizard
Posts: 1,613
Karma: 6718479
Join Date: Dec 2004
Location: Paradise (Key West, FL)
Device: Current:Surface Go & Kindle 3 - Retired: DellV8p, Clie UX50, ...
|
Quote:
Personally, I would recommend that you look to posting both MOBI and ePub. These will cover the vast majority of users. If your market is dominantly Europe, then it might be adequate to post only ePub. One thing to consider is that Amazon promotes their reader toward a non-geek market, likely resulting in a larger percentage of users that aren't skilled in conversion compared to the ePub community. |
|
10-15-2010, 07:52 PM | #12 |
Resident Curmudgeon
Posts: 73,887
Karma: 128597114
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
|
Forget PDF. That's step 1.
Step 2 is to use ePub as a source format. Then when Calibre converts to something else, it does a good job as long as the ePub is well made. |
10-15-2010, 11:16 PM | #13 | |
Wizard
Posts: 1,337
Karma: 123455
Join Date: Apr 2009
Location: Malaysia
Device: PRS-650, iPhone
|
Quote:
If the epub doesn't do anything complex then it's true the conversions are very clean. However if you use any advanced layout features you can have problems. Dropcaps get screwed up. Floating elements get stuck inline, which may cause real problems in the text layout. Variable margins in Mobi are a mess, which is why there are endless threads with users asking how to fix the margins of their epub sourced files. Even if the majority of the book uses no margins if there is some content in the book with variable margins this could be a problem. Epub has left and right margins support, mobi only has left nested blockquotes/indenting... There are probably a good list of other gotchas, those are just the ones off the top of my head. Because mobi layout features are a subset of epub if you're going to choose a single format to support mobi is probably better if you don't want to support epub conversions that may have the types of problems listed above. I also agree that two formats would be nicer (I'm a sony/epub user), but the OP did ask what is the best 'single' format to support. |
|
10-17-2010, 03:23 PM | #14 | |
Enthusiast
Posts: 31
Karma: 10
Join Date: Oct 2010
Location: Arizona, USA
Device: None
|
Thank you all for taking the time and sharing your knowledge. Each one of you has contributed to my growing understanding and knowledge and bringing this closer to a successful resolution. I've been studying your comments and, given my newness to all involved here, it's taking me some time to assimilate and act on your suggestions. As I implement them I'll post the results here.
First: Quote:
More to come as it does to me. Thanks again, all! |
|
10-19-2010, 03:47 AM | #15 | ||||
Enthusiast
Posts: 31
Karma: 10
Join Date: Oct 2010
Location: Arizona, USA
Device: None
|
After working with your suggestions, I've come to the conclusion we need to offer more than one format, which preliminarily would be EPUB because it's supported in all formats except Kindle and MOBI for the Kindle. Also because of what dwig said:
Quote:
Some questions: Opening the EPUB file in the Barnes and Noble viewer, the lines of text are shifted and indented strangely: many lines look like they're centered. How can this be corrected? A PDB file converted from either HTML or EPUB viewed in the Calibre viewer or B&N reader loses all font formatting (no bold, italics, font size variations). There's no book cover or publisher logo GIF. Why is this happening? How accurately does the Calibre viewer depict the actual book in each format? The MOBI book looks different in Calibre than it does in Kindle for Mac: for example, Calibre changes the font from serif to non-serif. Just how poorly or how well can I expect an EPUB file to be displayed on the many different devices? Quote:
Quote:
@font-face { for a gazillion different fonts. Does it make any sense to delete these in Sigil, keeping references only to fonts actually used?font-family: "Courier New"; panose-1: 2 7 3 9 2 2 5 2 4 4 } Quote:
Thanks again for all your help! |
||||
Tags |
calibre conversion, epub conversion, html2epub, pdf conversion, word conversion |
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Managing Converted Formats | objectcentral | Calibre | 5 | 08-03-2010 09:42 AM |
EReader Pro Not Recognizing Converted Books? | Balerion300 | Calibre | 1 | 07-18-2010 02:03 PM |
Calibre converted books won't open in BookDesigner | Ticallion | Workshop | 1 | 07-13-2010 03:29 AM |
HTML converted books causing crashes - Calibre or PRS-505 issue? | Katelyn | Calibre | 15 | 10-20-2009 06:08 PM |
Why do some Calibre converted books open in Word? | Matth3w | Sony Reader | 2 | 01-15-2009 07:20 AM |