Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book Software > Calibre

Notices

Reply
 
Thread Tools Search this Thread
Old 07-26-2009, 10:16 AM   #16
itimpi
Wizard
itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.itimpi ought to be getting tired of karma fortunes by now.
 
Posts: 4,553
Karma: 950151
Join Date: Nov 2008
Device: Sony PRS-950, iphone/ipad (Marvin/iBooks/QuickReader)
Quote:
Originally Posted by Statch View Post
Now I have what is probably a stupid question. One of the many things I love about Calibre is being able to have all formats of the book show up in one place. (Meaning, when I click on the title in Calibre, I can see which formats of it I have and open any of them.) When I use the command line to produce the epub format from the html format, and then use the Gui to add the book to the library, it sees it as another title with the same name, rather than as an alternate format of the same title. (Am I making sense?) How can I make that right?
If you already have the book in the library, then do not add it via "Add Book", but instead find the book in the GUI and then use the "Edit Metadata" dialog which gives you the format to add a new format to an existing book entry.
itimpi is offline   Reply With Quote
Old 07-26-2009, 10:29 AM   #17
carlobee
Fanatic
carlobee has learned how to read e-bookscarlobee has learned how to read e-bookscarlobee has learned how to read e-bookscarlobee has learned how to read e-bookscarlobee has learned how to read e-bookscarlobee has learned how to read e-bookscarlobee has learned how to read e-bookscarlobee has learned how to read e-books
 
Posts: 535
Karma: 972
Join Date: Jul 2009
Device: The new iPad
Quote:
Originally Posted by Statch View Post
I should also specify that I've been having this problem with all books I've used ereader2html on, and they are just standard run-of-the-mill books published by large publishing houses.
exactly. been having problems with some of these too. but some work perfectly.
carlobee is offline   Reply With Quote
Advert
Old 07-26-2009, 10:32 AM   #18
user_none
Sigil & calibre developer
user_none ought to be getting tired of karma fortunes by now.user_none ought to be getting tired of karma fortunes by now.user_none ought to be getting tired of karma fortunes by now.user_none ought to be getting tired of karma fortunes by now.user_none ought to be getting tired of karma fortunes by now.user_none ought to be getting tired of karma fortunes by now.user_none ought to be getting tired of karma fortunes by now.user_none ought to be getting tired of karma fortunes by now.user_none ought to be getting tired of karma fortunes by now.user_none ought to be getting tired of karma fortunes by now.user_none ought to be getting tired of karma fortunes by now.
 
user_none's Avatar
 
Posts: 2,487
Karma: 1063785
Join Date: Jan 2009
Location: Florida, USA
Device: Nook STR
Quote:
Originally Posted by Statch View Post
Using the GUI, I had tried specifying cp1252, which produced funny characters in front of the quotes and em-dashes, and I tried not specifying anything, which removed the quotes and em-dashes altogether. When I did it from the command line, using the syntax the previous poster mentioned, I got perfect output.
Seems to be a problem that multiple people are running into... What OS are you using?

Quote:
Originally Posted by Statch View Post
Now I have what is probably a stupid question. One of the many things I love about Calibre is being able to have all formats of the book show up in one place. (Meaning, when I click on the title in Calibre, I can see which formats of it I have and open any of them.) When I use the command line to produce the epub format from the html format, and then use the Gui to add the book to the library, it sees it as another title with the same name, rather than as an alternate format of the same title. (Am I making sense?) How can I make that right?
Click the title in the library, click edit metadata, on the top right there is a list of all formats, use the add button to add a format to that book.
user_none is offline   Reply With Quote
Old 07-26-2009, 02:12 PM   #19
Statch
Connoisseur
Statch knows what time it isStatch knows what time it isStatch knows what time it isStatch knows what time it isStatch knows what time it isStatch knows what time it isStatch knows what time it isStatch knows what time it isStatch knows what time it isStatch knows what time it isStatch knows what time it is
 
Statch's Avatar
 
Posts: 96
Karma: 2084
Join Date: Aug 2008
Location: Georgia, USA
Device: Kindle PW2, Samsung Galaxy 3, Kindle Fire HD
Thanks to both of you for the answer on how to make one record with all the formats.

I'm using Windows XP.
Statch is offline   Reply With Quote
Old 07-26-2009, 03:00 PM   #20
tfarrell
Member
tfarrell began at the beginning.
 
Posts: 12
Karma: 10
Join Date: Jul 2009
Device: kindle 2
My experience with ereader books is that some of them apparently "just work" because they don't use any of the offending characters: they use three periods instead of an ellipsis character, a hyphen instead of an em-dash, etc. I think it has more to do with the way the individual book than the fact that it comes from ereader.

I'm not brilliant with character encodings, it's not something I deal with often in my programming, but I would think that if we put in a function that would detect and replace unusual characters with their character entity encodings (for example, ampersand rsquo; for a right single quote) it might work a lot better. (I had been doing a little of this manually to make my books work better and was going to write a short program to do it for me, but am not going to bother now that I see that I can do it from the command line and it works fine.)
tfarrell is offline   Reply With Quote
Advert
Old 07-26-2009, 03:58 PM   #21
kovidgoyal
creator of calibre
kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.
 
kovidgoyal's Avatar
 
Posts: 45,359
Karma: 27182818
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
See http://calibre.kovidgoyal.net/ticket/2923
kovidgoyal is offline   Reply With Quote
Old 07-26-2009, 08:40 PM   #22
FizzyWater
You kids get off my lawn!
FizzyWater ought to be getting tired of karma fortunes by now.FizzyWater ought to be getting tired of karma fortunes by now.FizzyWater ought to be getting tired of karma fortunes by now.FizzyWater ought to be getting tired of karma fortunes by now.FizzyWater ought to be getting tired of karma fortunes by now.FizzyWater ought to be getting tired of karma fortunes by now.FizzyWater ought to be getting tired of karma fortunes by now.FizzyWater ought to be getting tired of karma fortunes by now.FizzyWater ought to be getting tired of karma fortunes by now.FizzyWater ought to be getting tired of karma fortunes by now.FizzyWater ought to be getting tired of karma fortunes by now.
 
FizzyWater's Avatar
 
Posts: 4,220
Karma: 73492664
Join Date: Aug 2007
Location: Columbus, Ohio
Device: Oasis 2 and Libra H2O and half a dozen older models I can't let go of
I convert almost exclusively from eReader and the GUI works for me...but I'm still zipping my HTML and image files before importing them to Calibre. Would that make the difference? (I read your ticket notes and haven't had to do this...yet).
FizzyWater is offline   Reply With Quote
Old 07-26-2009, 08:49 PM   #23
kovidgoyal
creator of calibre
kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.
 
kovidgoyal's Avatar
 
Posts: 45,359
Karma: 27182818
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
If you zip them up before hand, specifying the input encoding in the GUI will work, as the html is not processed when adding the book.
kovidgoyal is offline   Reply With Quote
Old 07-26-2009, 08:53 PM   #24
FizzyWater
You kids get off my lawn!
FizzyWater ought to be getting tired of karma fortunes by now.FizzyWater ought to be getting tired of karma fortunes by now.FizzyWater ought to be getting tired of karma fortunes by now.FizzyWater ought to be getting tired of karma fortunes by now.FizzyWater ought to be getting tired of karma fortunes by now.FizzyWater ought to be getting tired of karma fortunes by now.FizzyWater ought to be getting tired of karma fortunes by now.FizzyWater ought to be getting tired of karma fortunes by now.FizzyWater ought to be getting tired of karma fortunes by now.FizzyWater ought to be getting tired of karma fortunes by now.FizzyWater ought to be getting tired of karma fortunes by now.
 
FizzyWater's Avatar
 
Posts: 4,220
Karma: 73492664
Join Date: Aug 2007
Location: Columbus, Ohio
Device: Oasis 2 and Libra H2O and half a dozen older models I can't let go of
Is there a benefit to not zipping the files? What kind of pre-processing does Calibre do to the HTML that would make it beneficial to allow it to happen?
FizzyWater is offline   Reply With Quote
Old 07-26-2009, 09:03 PM   #25
kovidgoyal
creator of calibre
kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.
 
kovidgoyal's Avatar
 
Posts: 45,359
Karma: 27182818
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
Basically it collects all the linked to files for you
kovidgoyal is offline   Reply With Quote
Old 07-27-2009, 01:05 PM   #26
Mordak
Junior Member
Mordak began at the beginning.
 
Posts: 2
Karma: 10
Join Date: Jul 2009
Device: PRS-505
Quote:
Originally Posted by tfarrell View Post
My experience with ereader books is that some of them apparently "just work" because they don't use any of the offending characters: they use three periods instead of an ellipsis character, a hyphen instead of an em-dash, etc. I think it has more to do with the way the individual book than the fact that it comes from ereader.
I've had exactly the same experience - some books just work, and some books have messed up characters. It depends on the publisher, and a single bookstore sometimes has books that use the odd characters and sometimes does not.

For me, I work around the issue with just a text editor. I use SubEthaEdit to open the document, use the select menu at the bottom of the window to Reinterpret as Windows Latin 1, and then use the same menu to Convert to UTF-8. It only takes a few seconds, and afterwards they import into Calibre and convert to ePub without difficulty.

Sometimes I have found books that use weird &#XXX codes to specify quotes and dashes that display fine in a web browser, but do not display correctly on my PRS-505, in which case I just use the SubEthaEdit to find/replace the offending characters, then re-import and convert. A little laborious, but it gets the job done.
Mordak is offline   Reply With Quote
Old 07-27-2009, 04:06 PM   #27
ficbot
Wizard
ficbot ought to be getting tired of karma fortunes by now.ficbot ought to be getting tired of karma fortunes by now.ficbot ought to be getting tired of karma fortunes by now.ficbot ought to be getting tired of karma fortunes by now.ficbot ought to be getting tired of karma fortunes by now.ficbot ought to be getting tired of karma fortunes by now.ficbot ought to be getting tired of karma fortunes by now.ficbot ought to be getting tired of karma fortunes by now.ficbot ought to be getting tired of karma fortunes by now.ficbot ought to be getting tired of karma fortunes by now.ficbot ought to be getting tired of karma fortunes by now.
 
Posts: 2,409
Karma: 4132096
Join Date: Sep 2008
Device: Kindle Paperwhite/iOS Kindle App
All right, I have a new system These books have been giving me more trouble lately, not sure why. And using Neo Office to generate an HTML file has been resulting in extra line breaks. So, on a mac, here is what I have been doing:

1) Run the macro to extract the html from the secure file
2) Open in Firefox, select all text, paste into Neo Office as unformatted text
3) Save as RTF
4) Open RTF file in Pages, turn on search for invisibles
5) Search for 3 paragraph marks, replace with 2
6) Save again as RTF, Select all and copy
7) Open Komposer (web editing program)
8) Paste in the RTF text
9) Give a final once-over, save as HTML
10) Import into Calibre, edit metadata, convert to LRF

Kovid says I could probably convert the exploded file straight to epub and not get the line spacing issues and centering issues I get with LRF. Have not tried that yet.
ficbot is offline   Reply With Quote
Old 07-27-2009, 04:29 PM   #28
Kostas
Still wondering why
Kostas has learned how to read e-booksKostas has learned how to read e-booksKostas has learned how to read e-booksKostas has learned how to read e-booksKostas has learned how to read e-booksKostas has learned how to read e-booksKostas has learned how to read e-books
 
Kostas's Avatar
 
Posts: 253
Karma: 800
Join Date: Jun 2009
Location: Athens, Greece
Device: PRS 505, (BlackBerry Bold ?)
Quote:
Originally Posted by ficbot View Post
All right, I have a new system These books have been giving me more trouble lately, not sure why. And using Neo Office to generate an HTML file has been resulting in extra line breaks.
If you can use MS Word, I think that saving the html file with the "save as html, filtered" option resolves the extra line breaks problem and can save you the pain of all this workaround.
Of course, you still would have to replace the 3 line breaks occurences by 2 before saving the file.
Kostas is offline   Reply With Quote
Old 07-28-2009, 01:39 AM   #29
mjmcleod
Connoisseur
mjmcleod is a marvel to beholdmjmcleod is a marvel to beholdmjmcleod is a marvel to beholdmjmcleod is a marvel to beholdmjmcleod is a marvel to beholdmjmcleod is a marvel to beholdmjmcleod is a marvel to beholdmjmcleod is a marvel to beholdmjmcleod is a marvel to beholdmjmcleod is a marvel to beholdmjmcleod is a marvel to behold
 
Posts: 55
Karma: 11501
Join Date: Jul 2009
Location: Australia
Device: Galaxy Tab
To fix the ereader2html.py script itself, open it in a text editor and look for the line that contains:

final = '<html><body>\n'

There should be only one such line.

Change it to read:

final = '<html><head><META HTTP-EQUIV="CONTENT-TYPE" CONTENT="text/html; c
harset=cp1252"/></head><body>\n'

(all one line.)

Now the HTML files it produces will properly declare themselves to be in cp1252 and the Calibre GUI will handle them correctly.
mjmcleod is offline   Reply With Quote
Old 07-29-2009, 07:44 PM   #30
Statch
Connoisseur
Statch knows what time it isStatch knows what time it isStatch knows what time it isStatch knows what time it isStatch knows what time it isStatch knows what time it isStatch knows what time it isStatch knows what time it isStatch knows what time it isStatch knows what time it isStatch knows what time it is
 
Statch's Avatar
 
Posts: 96
Karma: 2084
Join Date: Aug 2008
Location: Georgia, USA
Device: Kindle PW2, Samsung Galaxy 3, Kindle Fire HD
You're all so smart! Post #21, I looked at the resolution of the help ticket, and I'm afraid I just don't get it. What you said to add shows up for me as a slider bar?

#29, what a beautiful solution! I'm using a version of ereader2html that some nice person made into a little gui, so it doesn't work for me, but I wish it did. (Unfortunately, I don't remember where I downloaded the thing from, so can't credit the creator. I click on a file called "er" and just answer some questions.)

#28, using MS Word and saving the file as a 'web page, filtered' worked for me. It loses some of the metadata, but it's easy to put that back in Calibre. Problem solved.
Statch is offline   Reply With Quote
Reply


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
converting sony books or B&N books for ipad? cavi General Discussions 2 04-25-2010 11:45 PM
Converting to Palm Digital Ereader rocojo Calibre 5 12-27-2009 08:31 AM
Converting Fictionwise's Secure eReader to something my 505 will read RWJ Calibre 12 09-11-2009 04:33 PM
converting long, somewhat complex docs to eReader Richard Maseles Other formats 4 01-07-2009 05:28 PM
Converting books to eReader with Dropbook Robotech_Master Workshop 1 12-23-2008 12:46 PM


All times are GMT -4. The time now is 09:40 PM.


MobileRead.com is a privately owned, operated and funded community.