01-12-2010, 05:32 PM | #1 |
Zealot
Posts: 112
Karma: 105
Join Date: Jan 2010
Device: Kindle 3 WiFi
|
PDF to ePub problem
I converted something in PDF to ePub (This only happened with 1 file)
This is how is came out. Hello this is a test. Anyone know how to fix this? |
01-12-2010, 06:08 PM | #2 |
Connoisseur
Posts: 61
Karma: 36
Join Date: Jan 2010
Location: Reston, Virginia, US
Device: ipad
|
I suspect you can fix this by going to the "pdf input" section on the conversion dialog and changing the line unwrapping value to 0.5. That worked for me.
|
Advert | |
|
01-12-2010, 11:05 PM | #3 |
Zealot
Posts: 112
Karma: 105
Join Date: Jan 2010
Device: Kindle 3 WiFi
|
Thank you ac4lt. That worked.
|
01-13-2010, 09:55 AM | #4 |
Junior Member
Posts: 2
Karma: 10
Join Date: Jan 2010
Device: HTC PDA-phone
|
Broken lines in the epub output
Hi,
I am new user of Calibre, and new member in forum. Regret my english isn't very good, my original language is hungarian. Sorry! My deal is to convert Adobe Indesign CS2 origin books to epub format. CS3 and CS4 contains epub export, but CS2 not. Avoid of problem I exported text from InDesign in PDF format, and converted the PDF by Calibre. In the Calibre converted epub file are broken lines, the last word of paragraph stays in a stand alone line, after this line begins in a newer line the next paragraph. The broken line isn't in all paragraph, only in the few, and I can not found some sytem in that. If I change something in the layout (e.g. shorter lines) and tha structure of paragraph, the error on the earlier place go out, but on newer place coming in. Do You found already this effect? What can I do? |
01-13-2010, 10:01 AM | #5 |
Resident Curmudgeon
Posts: 73,983
Karma: 128903378
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
|
The problem is there is NO program that can convert PDF to any other format without errors. So once you convert the PDF, the only choice you have is to a/b compare the PDF to the output and fix all the errors in the conversion. I know it's tedious. But, there's nothing else you can do.
|
Advert | |
|
01-20-2010, 08:51 PM | #6 |
Evangelist
Posts: 475
Karma: 590
Join Date: Aug 2009
Location: Bangkok, Thailand
Device: Kindle Paperwhite
|
|
01-20-2010, 09:49 PM | #7 |
Connoisseur
Posts: 61
Karma: 36
Join Date: Jan 2010
Location: Reston, Virginia, US
Device: ipad
|
I *think* it's deciding at what percentage of a line length it needs to pull in the next line, but I'm not exactly sure how it's implemented.
|
01-20-2010, 10:09 PM | #8 |
Bookaholic
Posts: 14,391
Karma: 54969924
Join Date: Oct 2007
Location: Minnesota
Device: iPad Mini 4, AuraHD, iPhone XR +
|
Does CS2 have export to XHTML? I can't remember if that was added in CS2 or CS3. If so you could do that and use Sigil to format your ePub. Or perhaps exporting to XML and converting that to HTML and going from there or something? Not sure if there's a converter that will go from XML straight to ePub.
|
01-21-2010, 11:33 AM | #9 |
Evangelist
Posts: 412
Karma: 546196
Join Date: Mar 2009
Location: UK canal boat
Device: sony prs505, prs650, kobo Glo HD liseuses
|
Fwiw, I've converted 20+ pdf titles to epub. I haven't found any foolproof, quick way of doing this. My rather laborious procedure now is:
1. If necessary, remove the pdf restrictions which prevent exporting as text; 2. Use the File | Save As Text menu option to create a .txt version; 3. Using the text editor of your choice, remove page headers, page numbers, extraneous front matter. Then find and change quotes, apostrophes and accents etc., to html named entities (e.g., " --> “ 4. Create an epub file using Sigil (thank you so, so much Valloric) 5. Load into Calibre (thank you Kovid); 6. Transfer to Sony reader; 7. Read, enjoy, bookmark the typos; 8. Back to Sigil, correct all the typos etc., repeat steps 4 to 7. This works, it's labour intensive, and during the process you may lose the will to live. |
01-21-2010, 12:39 PM | #10 | |
.
Posts: 3,408
Karma: 5647231
Join Date: Oct 2008
Device: never enough
|
Quote:
I am curious though, why do you go all the way back to text? Don't you lose italics, bold, etc.? Or do you put that back in during editing? |
|
01-21-2010, 01:30 PM | #11 |
Grand Sorcerer
Posts: 11,742
Karma: 6997045
Join Date: Jan 2010
Location: Notts, England
Device: Kobo Libra 2
|
I have had good luck converting single-column PDFs by:
1. Crop margins of the PDF to the text area. 2. Save as RTF 3. Use Calibre to convert RTF to EPUB 4. Use Sigil to fix line breaks across page breaks (and a few others) This method conserves all the character formatting, and (with one exception) the 10 or so files I have converted hasn't resulted in a large amount of repair work. The one case involved fixing up chapter headings. The original document had chapters in the form ROMAN_NUMERAL. CHAPTER TITLE followed by a few empty lines. The easiest way for me to fix this was to use VIM's global search & replace on the .html that came out of the epub conversion. I used a regexp that matched the two lines and replaced them with a single line wrapped in <h1></h1> tags. I tried using sigil, but couldn't figure out how to make a multi-line match expression (I admit I didn't look for a long time). |
01-22-2010, 02:49 AM | #12 | |
Junior Member
Posts: 2
Karma: 10
Join Date: Jan 2010
Device: HTC PDA-phone
|
HTML is my friend
Quote:
|
|
01-22-2010, 03:11 AM | #13 | |
MR Drone
Posts: 1,613
Karma: 15612282
Join Date: Oct 2007
Location: DRONEZONE
Device: PB360+, Huawei MP5, Libra H20
|
Quote:
Just in Case you want to do it a more simple way: I myself have converted quite a few books from PDF to epub. like alicE said convert it to a text format. Then, I pop it into Calibre. Conver it to epub and it's ready to read. Not sure about you but I care not if there are mistakes in quotes/indentations etc...... Yes I am a heathen I drink instant coffee. BUT for me content is more important than the bits and bobs... sum up: 1. convert PDF to text 2. Convert .txt format to epub with Calibre 3. Read.......... |
|
01-24-2010, 06:59 AM | #14 | |
Evangelist
Posts: 412
Karma: 546196
Join Date: Mar 2009
Location: UK canal boat
Device: sony prs505, prs650, kobo Glo HD liseuses
|
Quote:
With respect to the pain, whilst asprin helps, a single malt whisky is even better. |
|
01-25-2010, 03:22 PM | #15 |
Enthusiast
Posts: 41
Karma: 10
Join Date: Jan 2010
Device: Kobo Glo HD / Kobo Libra H2O
|
Does calibre recognize *.doc?
Why replace special caracters when using *.txt? The original is correct. Thank you for your help. Last edited by avresbo; 01-25-2010 at 03:28 PM. |
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
PDF 2 EPUB - font problem | sulka | Calibre | 18 | 09-16-2010 06:20 AM |
PDF to Epub (problem with pages) | violentlyserene | Calibre | 1 | 08-22-2010 10:38 AM |
Problem with accents converting PDF to EPUB | madeira | Calibre | 0 | 07-09-2010 05:15 PM |
Problem converting pdf to epub | smartin | Calibre | 3 | 05-02-2010 06:55 AM |
PDF to ePub (New line problem) | Dark123 | Calibre | 3 | 02-13-2010 08:41 PM |