Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book Software > Calibre

Notices

Reply
 
Thread Tools Search this Thread
Old 10-13-2009, 09:14 PM   #1
gt_undergrad
Member
gt_undergrad began at the beginning.
 
Posts: 10
Karma: 10
Join Date: Sep 2009
Device: Kindle II
Weird Calibre Bugs

I have discovered some weird mistakes Calibre makes when it converts an HTML file to a MOBI file (Kindle). I don't know whether it is just my machine or you all have the same problem. Perhaps you can try.

(BUG 1) When you have something like Xxxx Street or Xxxx Boulevard or Xxxx Drive or Xxxx Highway. It creates a line break so the street name becomes a separate line in italic. See the example I have below.

Interestingly, if you have "... the Xxxx Highway ..." this does not occur.

(BUG 2) When you have two Capitalized words like Lake Huron or Dade County, the conversion makes it into a single word (LakeHuron, DadeCounty).

Please try it yourself. Here is a paragraph to try:

--------------------------------------------------

I’m living at 1924 Wilkenson Street. This is a good place, which is next to Highland Boulevard. Yes it may not be so convenient to go to Johnson Drive. But it is very close to Buford Highway.

I live very close to Lake Jesup. It is a great fishing lake in the Harrison County.

---------------------------------------------------

Paste it in Word, and save it as a webpage. Send it to Calibre to convert to mobi (assuming you have a Kindle). You'll see what I mean. Here is what the converted text looks like:

====================================
I’m living at
1924 Wilkenson Street
. This is a good place, which is next to
Highland Boulevard
. Yes it may not be so convenient to go to
Johnson Drive
. But it is very close to
Buford Highway
.

I live very close to LakeJesup. It is a great fishing lake in the HarrisonCounty. This is a very middle class county.
===================================

Thanks,
GT
gt_undergrad is offline   Reply With Quote
Old 10-13-2009, 09:35 PM   #2
JMikeD
Evangelist
JMikeD is as sexy as a twisted cruller doughtnut.JMikeD is as sexy as a twisted cruller doughtnut.JMikeD is as sexy as a twisted cruller doughtnut.JMikeD is as sexy as a twisted cruller doughtnut.JMikeD is as sexy as a twisted cruller doughtnut.JMikeD is as sexy as a twisted cruller doughtnut.JMikeD is as sexy as a twisted cruller doughtnut.JMikeD is as sexy as a twisted cruller doughtnut.JMikeD is as sexy as a twisted cruller doughtnut.JMikeD is as sexy as a twisted cruller doughtnut.JMikeD is as sexy as a twisted cruller doughtnut.
 
JMikeD's Avatar
 
Posts: 473
Karma: 15000
Join Date: Jul 2008
Device: Various and sundry
I just tried it and it converted from HTML to MOBI with no problems.
JMikeD is offline   Reply With Quote
Advert
Old 10-13-2009, 11:25 PM   #3
gt_undergrad
Member
gt_undergrad began at the beginning.
 
Posts: 10
Karma: 10
Join Date: Sep 2009
Device: Kindle II
I just saw that I'm using version 0.6.12 of Calibre. Do you use the latest version or an older version? Maybe I'll try on 6.17.

Also, it is possible that I have to click some box to get the correct conversion. So far I have not clicked any box in the conversion process. On the other hand, I don't get any problem unless what I describe occur in the Word converted webpage file.

I am attaching the two files: word file with the desired text (I couldn't upload the webpage file. This has to be converted to webpage in Word first), and the converted mobi file.

Could it be my machine?

Help!

GT

calibre test - Unknown.mobi

calibre test.doc
gt_undergrad is offline   Reply With Quote
Old 10-13-2009, 11:49 PM   #4
JMikeD
Evangelist
JMikeD is as sexy as a twisted cruller doughtnut.JMikeD is as sexy as a twisted cruller doughtnut.JMikeD is as sexy as a twisted cruller doughtnut.JMikeD is as sexy as a twisted cruller doughtnut.JMikeD is as sexy as a twisted cruller doughtnut.JMikeD is as sexy as a twisted cruller doughtnut.JMikeD is as sexy as a twisted cruller doughtnut.JMikeD is as sexy as a twisted cruller doughtnut.JMikeD is as sexy as a twisted cruller doughtnut.JMikeD is as sexy as a twisted cruller doughtnut.JMikeD is as sexy as a twisted cruller doughtnut.
 
JMikeD's Avatar
 
Posts: 473
Karma: 15000
Join Date: Jul 2008
Device: Various and sundry
I'm using the current version, whatever that is (0.6.17?).
JMikeD is offline   Reply With Quote
Old 10-14-2009, 08:21 PM   #5
jackie_w
Grand Sorcerer
jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.
 
Posts: 6,216
Karma: 16534894
Join Date: Sep 2009
Location: UK
Device: Kobo: KA1, ClaraHD, Forma, Libra2, Clara2E. PocketBook: TouchHD3
Hi GT,
There may be more than one solution to this, but have you tried saving your Word document as type "WebPage filtered" rather than type "WebPage"? This may get rid of the problem.

Jackie
jackie_w is offline   Reply With Quote
Advert
Old 10-14-2009, 08:38 PM   #6
gt_undergrad
Member
gt_undergrad began at the beginning.
 
Posts: 10
Karma: 10
Join Date: Sep 2009
Device: Kindle II
Dear Jackie:

Many thanks for the suggestion. Indeed I played around just a moment ago, and you beat me to it! So have you had the same problem?

Here is what I had found:

(1) Saving it as a web page filtered instead of just a webpage completely eliminates the problem! Great call, Jackie!

(2) Saving it as RTF is also good.

Now I did upgrade to Calibre 0.6.17, and the problem persisted.

So what could the problem with saving it as a "webpage"? Is it because of some of the tags in it that scew things up?

GT
gt_undergrad is offline   Reply With Quote
Old 10-15-2009, 03:54 AM   #7
itimpi
Wizard
itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.
 
Posts: 4,553
Karma: 950151
Join Date: Nov 2008
Device: Sony PRS-950, iphone/ipad (Marvin/iBooks/QuickReader)
Quote:
Originally Posted by gt_undergrad View Post
So what could the problem with saving it as a "webpage"? Is it because of some of the tags in it that scew things up?
If you save from Word as Web Page rather than Web Page (filtered) you get a vast number of additional HTML items added in the file that are really Microsoft specific.

If you try loading the results of the two different types of save into a Text editor you can easily see the vast difference in the generated HTML.
itimpi is offline   Reply With Quote
Old 10-15-2009, 07:14 AM   #8
jackie_w
Grand Sorcerer
jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.jackie_w ought to be getting tired of karma fortunes by now.
 
Posts: 6,216
Karma: 16534894
Join Date: Sep 2009
Location: UK
Device: Kobo: KA1, ClaraHD, Forma, Libra2, Clara2E. PocketBook: TouchHD3
Glad to be of help, GT. I am fairly new to ebooks, HTML and Calibre myself and have spent many a long hour trying to find ways to get nice, neat HTML out of Word so I could store it in Calibre. The solution I suggested was the biggest help. I'll also pass on a few more findings in case they are of use to you or anyone else.

1. If you style your main text paragraphs with the Word Style "Normal(Web)" rather than "Normal" then the HTML output will look like

<p>This is my paragraph</p>

instead of

<p class=MsoNormal>This is my paragraph</p>

which is easier to read if you have to edit the HTML afterwards.

2. Make sure you apply Word's built-in Styles "Heading 1", "Heading 2" etc to style your Titles and Chapter headings, as these will result in neat HTML like

<h1>My Book Title</h1>
<h2>Chapter 1</h2>

You can then use these h1, h2 etc tags to tell Calibre how to detect chapters and page breaks.
If you don't like the existing Word "Heading n" styles then modify the Word style, don't be tempted to modify your text directly.

3. If you want to go a step further, you can create your own standard CSS file containing all your styling info. Once you've got it just right for your needs you don't need to touch it again. Just link to it in each Word doc before saving as type Web-filtered. Then you can strip out all the generated HTML that Word produces between (and including) the
<style> and </style> tags
(and that can be an awful lot of code!) which makes for a smaller HTML file.

4. I didn't find RTF very satisfactory as a format for storing in Calibre. The main reasons being that some formatting was lost when converted to LRF/EPUB, namely, centre- and right-alignment, graphics, line-breaks. I don't know whether this is still the case.

Jackie

P.S. I think the problems you were having with the Street names etc were to do with Word's smart-tags "features". You could investigate switching these off.

Last edited by jackie_w; 10-15-2009 at 07:20 AM.
jackie_w is offline   Reply With Quote
Old 10-15-2009, 10:42 PM   #9
gt_undergrad
Member
gt_undergrad began at the beginning.
 
Posts: 10
Karma: 10
Join Date: Sep 2009
Device: Kindle II
Hi Jackie:

Thanks a lot for those tips! You don't look like someone who is new to ebooks, as you are far more advanced than I'm

GT
gt_undergrad is offline   Reply With Quote
Reply


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Weird formatting issues - Sigil .epub in Calibre viewer december Sigil 9 06-18-2010 04:04 PM
Weird 'Select All' bahaviour in Calibre Stinger Calibre 6 05-14-2010 06:21 PM
Calibre newbie -- bugs, comments, and feature requests :) partnerinflight Calibre 6 04-19-2010 11:01 PM
Weird characters on PRS-600 after transfer from calibre MarcWinter Calibre 2 02-08-2010 09:18 AM
Calibre 0.5.14 on Mac - Weird Send to PRS-505 Result? danviento Calibre 0 07-16-2009 06:28 PM


All times are GMT -4. The time now is 08:29 PM.


MobileRead.com is a privately owned, operated and funded community.