Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book Software > Calibre

Notices

Reply
 
Thread Tools Search this Thread
Old 01-28-2010, 09:53 AM   #1
wildbilly
Junior Member
wildbilly began at the beginning.
 
Posts: 3
Karma: 10
Join Date: Jan 2010
Device: PRS600
EPUB: End of paragraph after letters with accent?

Just started using Calibre (Great Stuff!) with my PRS600.

Only minor issue i found converting .pdf italian books in .EPUB is that seems Calibre automatically add an "end of paragraph" after any letter with accent (backward or forward i.e. à è é ..).

I did try the conversion to Ascii in Look&Fell Menu but it only replace the letters with accent with same letter without accent and leave the paragraph break ...

I am now fixing this manually using Sigil after conversion, but i was wondering if i am missing any smarter way to use Calibre Conversion options.

Thank you in advance for the support!
wildbilly is offline   Reply With Quote
Old 01-29-2010, 08:27 AM   #2
wildbilly
Junior Member
wildbilly began at the beginning.
 
Posts: 3
Karma: 10
Join Date: Jan 2010
Device: PRS600
Further details.

Debugging the conversion, if in the input files there is a sentence like:

"Quello che sto cercando di dire è soltanto che
tutto quello che è stato non sara più perchè
ormai tutto è diverso"


The "parsed" file will be formatted as:

" Quello che sto cercando di dire è soltanto che tutto quello che è stato non sara più perchè

ormai tutto é diverso".


It means that not all the letters with accent are considered as end of paragraph but only letters with accent at the end of each input row.

Anybody with hints ?!?
Any possibility to customize output plugin?

Thank you again,
Andrea

Last edited by wildbilly; 01-29-2010 at 08:30 AM.
wildbilly is offline   Reply With Quote
Advert
Old 01-29-2010, 08:43 AM   #3
JSWolf
Resident Curmudgeon
JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.
 
JSWolf's Avatar
 
Posts: 73,982
Karma: 128903378
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
Quote:
Originally Posted by wildbilly View Post
Further details.

Debugging the conversion, if in the input files there is a sentence like:

"Quello che sto cercando di dire è soltanto che
tutto quello che è stato non sara più perchè
ormai tutto è diverso"


The "parsed" file will be formatted as:

" Quello che sto cercando di dire è soltanto che tutto quello che è stato non sara più perchè

ormai tutto é diverso".


It means that not all the letters with accent are considered as end of paragraph but only letters with accent at the end of each input row.

Anybody with hints ?!?
Any possibility to customize output plugin?

Thank you again,
Andrea
Are you sure the line is really one line? In your example, it looks like 3 lines. Can you attach a snippit of your book so we can also try it out?
JSWolf is online now   Reply With Quote
Old 01-29-2010, 09:15 AM   #4
wildbilly
Junior Member
wildbilly began at the beginning.
 
Posts: 3
Karma: 10
Join Date: Jan 2010
Device: PRS600
Attached is a page excerpted by the html file created in the input directory during debugging.

On my parsed file i found wrong "end of paragraph" on the second line (this is understandable as is an esclamation mark within a spoken sentence), on line 4 (ending with word "necessità") and line 12 (ending with word "inchinò").

This is just an example: please note that i tried with several .pdf files from different sources and behaviour is always the same.
Attached Files
File Type: zip excerpted from Input html.zip (3.0 KB, 265 views)

Last edited by wildbilly; 01-29-2010 at 09:18 AM.
wildbilly is offline   Reply With Quote
Reply


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
[Old Thread] accent letters in spanish elgabo Calibre 7 01-16-2012 08:39 AM
PDF to EPUB - spurious paragraph breaks RichieTheK Calibre 2 09-08-2010 11:27 AM
LRF to EPUB: Each line is a paragraph tag wudaben Calibre 5 07-14-2010 07:04 PM
Stop line wrapping at quotes at the end of a paragraph sherman ePub 6 05-13-2010 02:52 PM
Changing paragraph spacing in DRMed epub? salty-horse ePub 4 09-15-2009 06:48 AM


All times are GMT -4. The time now is 03:23 PM.


MobileRead.com is a privately owned, operated and funded community.