View Single Post
Old 08-18-2015, 05:29 AM   #25
Nick_1964
Bookworm
Nick_1964 ought to be getting tired of karma fortunes by now.Nick_1964 ought to be getting tired of karma fortunes by now.Nick_1964 ought to be getting tired of karma fortunes by now.Nick_1964 ought to be getting tired of karma fortunes by now.Nick_1964 ought to be getting tired of karma fortunes by now.Nick_1964 ought to be getting tired of karma fortunes by now.Nick_1964 ought to be getting tired of karma fortunes by now.Nick_1964 ought to be getting tired of karma fortunes by now.Nick_1964 ought to be getting tired of karma fortunes by now.Nick_1964 ought to be getting tired of karma fortunes by now.Nick_1964 ought to be getting tired of karma fortunes by now.
 
Nick_1964's Avatar
 
Posts: 975
Karma: 768585
Join Date: Aug 2010
Location: Netherlands
Device: Sony prs-650, Kobo Glo HD (2x), Kobo Glo
Quote:
Originally Posted by PeterT View Post
I'd almost suggest returning the original epub to whence it came and buying a better version. It sounds like it's been through a lot PDF to epub conversion.
Found this inside:
/* Styles for document saved to a stream */
/* Generated by Aspose.Words for Java 11.11.0.0 */

So I know it is made by that Aspose.

*All other, I will try the suggestions for sure but until now the new version of Sigil (0.8.7.) does do a lot.
It is almost clean,but now left with a lot of blanc lines.
A lot of the <p class="dlct-000"> (from 000 till 972) contains margins like margin:12pt 0pt 3pt 5.5pt; but they are renumbered in every html so removing the values of margin:12pt 0pt 3pt 5.5pt; does help but also destroy whitespaces where they do belong.
So I am now trying to find html for html if there is one "dclt" that stands for a true paragraph so I can at least replace them for a </br> to maintain the real paragraphs,before removing them all.
If I remove all code for .css, rather in the html of as separate file, there is no way I can find and keep some of the original page lay-out.

For now the first part of the suggestion made by rubeus to use the search and replace with <span class="dlct-\d\d\d"> worked the most, then the auto repair of Sigil did the rest. Only there are now more then 100 different values for <p class="dlct-000">

As far as I know there is no "warranty" for an epub,it is coming from a small publisher, I can't ask a webshop "i want to buy this book,but can I see the code first" so if I buy it somewhere else it is no guarantee that I don't get exactly the same one.
For sure I want to try the plugin suggestions and cleanup but the internal editor works more different then the Sigil and I am used to that.
I do gonna use it but I try with Sigil first rather then to learn to use another editor.
Once I cleaned with Sigil I gonna import the "damaged" books into Calibre and start again there to see if it produces beter and work faster, so I do gonna use all the given options,because then I can learn it for future use.

*But I want to ask you kindly, for someone with my type of dislection, using 2 editors next to each other or learn to work with a new one is not so easy, that is why I use only one filemanager (Total commander) one mail program, one usenet program because if I am not i am overwhelmed by the new look and my brain will reset and I have only one year english lessons so sometimes I need a bit more then a oneliner,have some patient with me.
At the end, I could do 2 things,delete the books and say sorry to the girl,or give it to someone that will do it for me,but then..I will not learn.

Last edited by Nick_1964; 08-18-2015 at 05:35 AM.
Nick_1964 is offline   Reply With Quote