Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book Software > Sigil

Notices

Reply
 
Thread Tools Search this Thread
Old 12-15-2013, 09:51 PM   #1
allowingtoo
Bibliophile
allowingtoo began at the beginning.
 
Posts: 154
Karma: 10
Join Date: Sep 2010
Device: Amazon
Exclamation Can't convert Epub to Mobi

I've tried ALL DAY every way possible. It's the last book of a 5 part series and I bought the other 4 books from Amazon. This one has not been converted through their site yet unfortunately or I would just buy it! I have the Epub version but while it opens perfectly in Adobe Digital - it will not convert to Mobi format.

In Sigil I get EACH and every LINE starts out with

<p class="MsoNormal1 sgc-1"><span class="calibre2" xml:lang="EN-US" xmlns:xml="http://www.w3.org/XML/1998/namespace"> and then like six words and </span></p>

Which ends up looking like this:

The President's men said the original timing of the raid was bad because

of the initial, critical talks with Communist China. It was early

September before training began, and late November-after the

I have tried "remove Margins" in Calibre. Doesn't work. I've tried to format it to .txt. Doesn't work. Tried to save it as a Word Doc and change it there as something else, but I see that each Paragraph has a "border" around it.

If I hadn't given my Nook away, I'd just fire that up and finish reading the series. Sadly, that's not an option. Is there anyway to remove all those Paragraphs without hand editing? Thanks
allowingtoo is offline   Reply With Quote
Old 12-15-2013, 10:05 PM   #2
theducks
Well trained by Cats
theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.theducks ought to be getting tired of karma fortunes by now.
 
theducks's Avatar
 
Posts: 30,908
Karma: 60358908
Join Date: Aug 2009
Location: The Central Coast of California
Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A
Quote:
Originally Posted by allowingtoo View Post
I've tried ALL DAY every way possible. It's the last book of a 5 part series and I bought the other 4 books from Amazon. This one has not been converted through their site yet unfortunately or I would just buy it! I have the Epub version but while it opens perfectly in Adobe Digital - it will not convert to Mobi format.

In Sigil I get EACH and every LINE starts out with

<p class="MsoNormal1 sgc-1"><span class="calibre2" xml:lang="EN-US" xmlns:xml="http://www.w3.org/XML/1998/namespace"> and then like six words and </span></p>

Which ends up looking like this:

The President's men said the original timing of the raid was bad because

of the initial, critical talks with Communist China. It was early

September before training began, and late November-after the

I have tried "remove Margins" in Calibre. Doesn't work. I've tried to format it to .txt. Doesn't work. Tried to save it as a Word Doc and change it there as something else, but I see that each Paragraph has a "border" around it.

If I hadn't given my Nook away, I'd just fire that up and finish reading the series. Sadly, that's not an option. Is there anyway to remove all those Paragraphs without hand editing? Thanks
That is a cr*py quality book

This is a JOIN problem

</span></p> <<<< that is the (bogus) end of the Paragraph

The pre-packaged 'Join Paragraph' will not work because of the span there

Code:
Name=Cleanup/Join span Paras
Find="(?sm)([[:alpha:],])</span></p>\s*<p class=\"MsoNormal1\"><span class=\"calibre5\">([a-z])"
Replace=\1 \2
Adjust the green between the quotes to match your document
theducks is offline   Reply With Quote
Old 12-15-2013, 10:26 PM   #3
allowingtoo
Bibliophile
allowingtoo began at the beginning.
 
Posts: 154
Karma: 10
Join Date: Sep 2010
Device: Amazon
Sigh. Thanks for validating me about the quality

Doesn't work or I'm just doing something wrong.

<p class="MsoNormal1"><span class="calibre2" xml:lang="EN-US">summaries (known as DISUMs) until General Norman returned an hour later,</span></p>

<p class="MsoNormal1"><span class="calibre2" xml:lang="EN-US">wearing a look of grim satisfaction.</span></p>

<p class="MsoNormal1"><span class="calibre2" xml:lang="EN-US">&nbsp;</span></p>

<p class="MsoNormal1"><span class="calibre2" xml:lang="EN-US">"Now he believes me," Norman said.<span>&nbsp;</span> "We're putting our forces on alert</span></p>
allowingtoo is offline   Reply With Quote
Old 12-16-2013, 02:40 AM   #4
cybmole
Wizard
cybmole ought to be getting tired of karma fortunes by now.cybmole ought to be getting tired of karma fortunes by now.cybmole ought to be getting tired of karma fortunes by now.cybmole ought to be getting tired of karma fortunes by now.cybmole ought to be getting tired of karma fortunes by now.cybmole ought to be getting tired of karma fortunes by now.cybmole ought to be getting tired of karma fortunes by now.cybmole ought to be getting tired of karma fortunes by now.cybmole ought to be getting tired of karma fortunes by now.cybmole ought to be getting tired of karma fortunes by now.cybmole ought to be getting tired of karma fortunes by now.
 
Posts: 3,720
Karma: 1759970
Join Date: Sep 2010
Device: none
I usually do these things in stages
1. remove all the class="calibre2" xml:lang="EN-US" i.e. find this replace all with nothing. check that looks good & save, then
2. remove the spans : find <span>[(,*)]</span> & replace with \1
3. now do the joins

i'd skip step 2 if there were n=maybe other, valid spans, & write something that looked for a non-sentence end & remove the closing span tag, the line break, & the next opening p tag. for dealing with the line breaks I usually cust copy a chunk from code view, which has the line break embeded & put that into find, then tweak it

so I'd grab:
,</span></p>

<p class="MsoNormal1"><span>

put that in find, then edit the , to be ([a-z,])
and replace with \1 ( with a trailing space)
cybmole is offline   Reply With Quote
Old 12-16-2013, 12:20 PM   #5
allowingtoo
Bibliophile
allowingtoo began at the beginning.
 
Posts: 154
Karma: 10
Join Date: Sep 2010
Device: Amazon
Thank you all for your help. I really appreciate it.

I'm a little embarrassed to admit I gave up. There were also pictures embedded in the original books and well...anyway. So I started looking again today for another copy of the book. I thought maybe I could find the "original" copy before they did that awful conversion on it. And lo and behold - Smashwords had a copy for sale! I downloaded a preview version to run through Sigil to make sure it was not the same one, and this one was a true HTML copy.

Some day maybe I will learn Regex but this will not be the day

allowingtoo is offline   Reply With Quote
Reply


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Mobi to Epub Failed to Convert Klaw Conversion 4 10-26-2013 02:49 AM
Best way to convert from Mobi to ePub? crashnburn Kindle Formats 36 08-29-2013 10:52 AM
CLI Convert EPUB -> Mobi tuebl Conversion 6 01-09-2012 03:14 PM
Unable to convert Mobi to Epub Japes Conversion 27 07-19-2011 12:14 AM
Failure to convert from mobi to epub jlmwrite Calibre 9 01-06-2011 11:12 PM


All times are GMT -4. The time now is 01:02 PM.


MobileRead.com is a privately owned, operated and funded community.