Can't convert Epub to Mobi

allowingtoo · 12-15-2013, 09:51 PM

I've tried ALL DAY every way possible. It's the last book of a 5 part series and I bought the other 4 books from Amazon. This one has not been converted through their site yet unfortunately or I would just buy it! I have the Epub version but while it opens perfectly in Adobe Digital - it will not convert to Mobi format.

In Sigil I get EACH and every LINE starts out with

 and then like six words and 

Which ends up looking like this:

The President's men said the original timing of the raid was bad because

of the initial, critical talks with Communist China. It was early

September before training began, and late November-after the

I have tried "remove Margins" in Calibre. Doesn't work. I've tried to format it to .txt. Doesn't work. Tried to save it as a Word Doc and change it there as something else, but I see that each Paragraph has a "border" around it.

If I hadn't given my Nook away, I'd just fire that up and finish reading the series. Sadly, that's not an option. Is there anyway to remove all those Paragraphs without hand editing? Thanks

theducks · 12-15-2013, 10:05 PM

Quote:

Originally Posted by allowingtoo

I've tried ALL DAY every way possible. It's the last book of a 5 part series and I bought the other 4 books from Amazon. This one has not been converted through their site yet unfortunately or I would just buy it! I have the Epub version but while it opens perfectly in Adobe Digital - it will not convert to Mobi format.

In Sigil I get EACH and every LINE starts out with

 and then like six words and 

Which ends up looking like this:

The President's men said the original timing of the raid was bad because

of the initial, critical talks with Communist China. It was early

September before training began, and late November-after the

I have tried "remove Margins" in Calibre. Doesn't work. I've tried to format it to .txt. Doesn't work. Tried to save it as a Word Doc and change it there as something else, but I see that each Paragraph has a "border" around it.

If I hadn't given my Nook away, I'd just fire that up and finish reading the series. Sadly, that's not an option. Is there anyway to remove all those Paragraphs without hand editing? Thanks

That is a cr*py quality book

This is a JOIN problem

 <<<< that is the (bogus) end of the Paragraph

The pre-packaged 'Join Paragraph' will not work because of the span there

Code:

Name=Cleanup/Join span Paras
Find="(?sm)([[:alpha:],])</span></p>\s*<p class=\"MsoNormal1\"><span class=\"calibre5\">([a-z])"
Replace=\1 \2

Adjust the green between the quotes to match your document

allowingtoo · 12-15-2013, 10:26 PM

Sigh. Thanks for validating me about the quality

Doesn't work or I'm just doing something wrong.

summaries (known as DISUMs) until General Norman returned an hour later,

wearing a look of grim satisfaction.

 

"Now he believes me," Norman said.  "We're putting our forces on alert

cybmole · 12-16-2013, 02:40 AM

I usually do these things in stages
1. remove all the class="calibre2" xml:lang="EN-US" i.e. find this replace all with nothing. check that looks good & save, then
2. remove the spans : find [(,*)] & replace with \1
3. now do the joins

i'd skip step 2 if there were n=maybe other, valid spans, & write something that looked for a non-sentence end & remove the closing span tag, the line break, & the next opening p tag. for dealing with the line breaks I usually cust copy a chunk from code view, which has the line break embeded & put that into find, then tweak it

so I'd grab:
,


put that in find, then edit the , to be ([a-z,])
and replace with \1 ( with a trailing space)

allowingtoo · 12-16-2013, 12:20 PM

Thank you all for your help. I really appreciate it.

I'm a little embarrassed to admit I gave up. There were also pictures embedded in the original books and well...anyway. So I started looking again today for another copy of the book. I thought maybe I could find the "original" copy before they did that awful conversion on it. And lo and behold - Smashwords had a copy for sale! I downloaded a preview version to run through Sigil to make sure it was not the same one, and this one was a true HTML copy.

Some day maybe I will learn Regex but this will not be the day

12-15-2013, 09:51 PM	#1
allowingtoo Bibliophile Posts: 159 Karma: 10 Join Date: Sep 2010 Device: Amazon	Can't convert Epub to Mobi I've tried ALL DAY every way possible. It's the last book of a 5 part series and I bought the other 4 books from Amazon. This one has not been converted through their site yet unfortunately or I would just buy it! I have the Epub version but while it opens perfectly in Adobe Digital - it will not convert to Mobi format. In Sigil I get EACH and every LINE starts out with <p class="MsoNormal1 sgc-1"><span class="calibre2" xml:lang="EN-US" xmlns:xml="http://www.w3.org/XML/1998/namespace"> and then like six words and </span></p> Which ends up looking like this: The President's men said the original timing of the raid was bad because of the initial, critical talks with Communist China. It was early September before training began, and late November-after the I have tried "remove Margins" in Calibre. Doesn't work. I've tried to format it to .txt. Doesn't work. Tried to save it as a Word Doc and change it there as something else, but I see that each Paragraph has a "border" around it. If I hadn't given my Nook away, I'd just fire that up and finish reading the series. Sadly, that's not an option. Is there anyway to remove all those Paragraphs without hand editing? Thanks

12-15-2013, 10:26 PM	#3
allowingtoo Bibliophile Posts: 159 Karma: 10 Join Date: Sep 2010 Device: Amazon	Sigh. Thanks for validating me about the quality Doesn't work or I'm just doing something wrong. <p class="MsoNormal1"><span class="calibre2" xml:lang="EN-US">summaries (known as DISUMs) until General Norman returned an hour later,</span></p> <p class="MsoNormal1"><span class="calibre2" xml:lang="EN-US">wearing a look of grim satisfaction.</span></p> <p class="MsoNormal1"><span class="calibre2" xml:lang="EN-US"> </span></p> <p class="MsoNormal1"><span class="calibre2" xml:lang="EN-US">"Now he believes me," Norman said.<span> </span> "We're putting our forces on alert</span></p>

12-16-2013, 02:40 AM	#4
cybmole Wizard Posts: 3,720 Karma: 1759970 Join Date: Sep 2010 Device: none	I usually do these things in stages 1. remove all the class="calibre2" xml:lang="EN-US" i.e. find this replace all with nothing. check that looks good & save, then 2. remove the spans : find <span>[(,*)]</span> & replace with \1 3. now do the joins i'd skip step 2 if there were n=maybe other, valid spans, & write something that looked for a non-sentence end & remove the closing span tag, the line break, & the next opening p tag. for dealing with the line breaks I usually cust copy a chunk from code view, which has the line break embeded & put that into find, then tweak it so I'd grab: ,</span></p> <p class="MsoNormal1"><span> put that in find, then edit the , to be ([a-z,]) and replace with \1 ( with a trailing space)

Similar Threads
Thread	Thread Starter	Forum	Replies	Last Post
Mobi to Epub Failed to Convert	Klaw	Conversion	4	10-26-2013 02:49 AM
Best way to convert from Mobi to ePub?	crashnburn	Kindle Formats	36	08-29-2013 10:52 AM
CLI Convert EPUB -> Mobi	tuebl	Conversion	6	01-09-2012 03:14 PM
Unable to convert Mobi to Epub	Japes	Conversion	27	07-19-2011 12:14 AM
Failure to convert from mobi to epub	jlmwrite	Calibre	9	01-06-2011 11:12 PM

12-16-2013, 12:20 PM	#5
allowingtoo Bibliophile Posts: 159 Karma: 10 Join Date: Sep 2010 Device: Amazon	Thank you all for your help. I really appreciate it. I'm a little embarrassed to admit I gave up. There were also pictures embedded in the original books and well...anyway. So I started looking again today for another copy of the book. I thought maybe I could find the "original" copy before they did that awful conversion on it. And lo and behold - Smashwords had a copy for sale! I downloaded a preview version to run through Sigil to make sure it was not the same one, and this one was a true HTML copy. Some day maybe I will learn Regex but this will not be the day