View Single Post
Old 08-17-2015, 08:47 AM   #3
Nick_1964
Bookworm
Nick_1964 ought to be getting tired of karma fortunes by now.Nick_1964 ought to be getting tired of karma fortunes by now.Nick_1964 ought to be getting tired of karma fortunes by now.Nick_1964 ought to be getting tired of karma fortunes by now.Nick_1964 ought to be getting tired of karma fortunes by now.Nick_1964 ought to be getting tired of karma fortunes by now.Nick_1964 ought to be getting tired of karma fortunes by now.Nick_1964 ought to be getting tired of karma fortunes by now.Nick_1964 ought to be getting tired of karma fortunes by now.Nick_1964 ought to be getting tired of karma fortunes by now.Nick_1964 ought to be getting tired of karma fortunes by now.
 
Nick_1964's Avatar
 
Posts: 975
Karma: 768585
Join Date: Aug 2010
Location: Netherlands
Device: Sony prs-650, Kobo Glo HD (2x), Kobo Glo
Quote:
Originally Posted by Ripplinger View Post
If there are a lot of other instances of "dict-###" with various numbers, you could use regex the same way and get them all.

Then just let Sigil clean up the stylesheet to delete any unused styles and try it.

I've never found any automated software that can make the proper decisions about what stays and what should go.
I tried with Sigil which I use the most but the changing "dicts" are everywhere and by removing a lot of span's I always get the warning that the html is not well formed and need to be cleaned.

Due to my dyslection I really don't understand the regex way of searching.
In the very beginning Sigil was working with wildcards like dict* but that isn't the case anymore..
Also it doesn't use any .css all the rules are in the html:
Spoiler:

p.dlct-085 {line-height:110%; margin-bottom:3pt; margin-right:3pt; margin-top:12pt; text-align:justify}
p.dlct-060 {line-height:110%; margin-bottom:3pt; margin-right:2.35pt; margin-top:12pt; text-align:justify}
p.dlct-046 {line-height:110%; margin-bottom:3pt; margin-right:0pt; margin-top:12pt; text-align:justify}
p.dlct-083 {line-height:110%; margin-bottom:3pt; margin-right:3.15pt; margin-top:12pt; text-align:justify}
p.dlct-091 {font-size:1.27em; line-height:110%; margin-bottom:3pt; margin-right:0pt; margin-top:12pt; text-align:justify}
p.dlct-058 {line-height:110%; margin-bottom:3pt; margin-right:2.5pt; margin-top:12pt; text-align:justify}
p.null {line-height:0.8249999em; margin-bottom:0pt; text-align:center}
p.dlct-050 {line-height:110%; margin-bottom:3pt; margin-right:3.45pt; margin-top:12pt; text-align:justify}
p.dlct-035 {line-height:110%; margin-bottom:3pt; margin-right:3.1pt; margin-top:12pt; text-align:justify}
p.dlct-051 {line-height:110%; margin-bottom:3pt; margin-right:3.4pt; margin-top:12pt; text-align:justify}
p.dlct-086 {line-height:110%; margin-bottom:3pt; margin-right:2pt; margin-top:12pt; text-align:justify}
p.null {line-height:0.8249999em; margin-bottom:0pt; text-indent:11pt}p.dlct-061 {line-height:110%; margin-bottom:3pt; margin-right:2.05pt; margin-top:12pt; text-align:justify}
p.dlct-056 {line-height:110%; margin-bottom:3pt; margin-right:2.2pt; margin-top:12pt; text-align:justify}
p.dlct-090 {font-size:1.09em; line-height:110%; margin:12pt 0pt 3pt 17.1pt; text-align:justify}

p.null {line-height:0.8249999em; margin-bottom:0pt}
p.dlct-078 {line-height:110%; margin:12pt 0pt 3pt 5.5pt; text-align:justify}
p.dlct-031 {text-align:justify}
p.dlct-055 {line-height:110%; margin-bottom:3pt; margin-right:3.05pt; margin-top:12pt; text-align:justify}
p.dlct-076 {line-height:110%; margin-bottom:3pt; margin-right:2.45pt; margin-top:12pt; text-align:justify}
p.dlct-020 {font-size:0.91em; line-height:115%; margin-bottom:0pt; text-align:justify}
p.dlct-053 {line-height:110%; margin-bottom:3pt; margin-right:2.9pt; margin-top:12pt; text-align:justify}
p.dlct-029 {font-size:1.09em; line-height:110%; margin-bottom:3pt; margin-right:2.35pt; margin-top:12pt; text-align:justify}
p.dlct-088 {line-height:110%; margin-bottom:3pt; margin-right:2.6pt; margin-top:12pt; text-align:justify}
p.null {line-height:0.9916666em; margin-bottom:0pt; margin-top:0.85pt; text-align:center}
p.dlct-084 {line-height:110%; margin-bottom:3pt; margin-right:3.3pt; margin-top:12pt; text-align:justify}
p.dlct-022 {font-size:1.27em; line-height:110%; margin-bottom:3pt; margin-top:12pt; text-align:justify}
p.dlct-057 {line-height:110%; margin-bottom:3pt; margin-right:2.1pt; margin-top:12pt; text-align:justify}
p.null {line-height:0.92499995em; margin-bottom:0pt; margin-left:5.8pt; margin-right:4.75pt; text-align:justify}
p.dlct-077 {line-height:110%; margin-bottom:3pt; margin-right:2.25pt; margin-top:12pt; text-align:justify}
p.null {line-height:0.7833333em; margin-bottom:0pt; margin-top:0.35pt; text-align:center}
p.dlct-074 {line-height:110%; margin-bottom:3pt; margin-right:2.3pt; margin-top:12pt; text-align:justify}
p.dlct-059 {line-height:110%; margin-bottom:3pt; margin-right:2.4pt; margin-top:12pt; text-align:justify}
p.dlct-094 {font-size:0.91em; line-height:110%; margin-bottom:3pt; margin-top:12pt; text-align:justify}
p.dlct-079 {line-height:110%; margin:12pt 2.4pt 3pt 5.8pt; text-align:justify}
p.dlct-103 {font-size:1.09em; line-height:110%; margin-bottom:3pt; margin-right:0pt; margin-top:12pt; text-align:justify}
p.dlct-096 {line-height:110%; margin-bottom:3pt; margin-right:3.35pt; margin-top:12pt; text-align:justify}
p.dlct-032 {line-height:110%; margin-bottom:3pt; margin-right:2.15pt; margin-top:12pt; text-align:justify}
p.dlct-089 {line-height:110%; margin:12pt 0pt 3pt 17.1pt; text-align:justify}
p.dlct-105 {font-size:1.09em; line-height:110%; margin-bottom:3pt; margin-right:2.3pt; margin-top:12pt; text-align:justify}
p.dlct-034 {font-size:1.09em; line-height:110%; margin-bottom:3pt; margin-right:3.1pt; margin-top:12pt; text-align:justify}
p.dlct-080 {line-height:110%; margin-bottom:3pt; margin-right:2.65pt; margin-top:12pt; text-align:justify}
p.null {line-height:0.9333333em; margin-bottom:0pt; margin-left:5.8pt; margin-right:2.7pt; text-align:justify}
p.dlct-010 {font-size:0.91em; line-height:normal; margin-bottom:0pt; margin-left:5.8pt; margin-right:223.3pt; text-align:justify}
p.null {line-height:0.9083333em; margin-bottom:0pt; margin-left:5.5pt; margin-top:0.1pt}
p.dlct-082 {line-height:110%; margin-bottom:3pt; margin-right:3.25pt; margin-top:12pt; text-align:justify}
p.dlct-027 {font-size:1.64em; line-height:110%; margin-bottom:3pt; margin-right:0pt; margin-top:12pt; text-align:justify}
span.dlct-102 {font-family:Garamond, serif}
span.dlct-064 {font-family:Verdana, sans-serif; font-style:italic; letter-spacing:-0.1pt}
span.dlct-018 {font-family:'Times New Roman', serif; font-size:0.91em; letter-spacing:-0.2pt}
span.dlct-003 {font-size:0.86em}
span.dlct-013 {font-family:'Times New Roman', serif; font-size:0.91em; letter-spacing:-0.1pt}
span.dlct-049 {font-family:Verdana, sans-serif; letter-spacing:0.15pt}
span.dlct-065 {font-family:Verdana, sans-serif; font-style:italic; letter-spacing:0.05pt}

This is only a small part of one html... and they use the margins-right to position the words (or it seems that way) maybe it is better to put it all in Calibre and convert it to txt and then build it again proper with Sigil if that is possible.
I really don't know what the publisher tries to do with it...

Yes.. I really tried to understand the regex method.. but it just doesn't stay in the part I reserved for it in my brains
And Imagine what happens with all the classes when I put it out with the KoboTouchExtended driver to a kepub ..

Last edited by Nick_1964; 08-17-2015 at 08:53 AM.
Nick_1964 is offline   Reply With Quote