Register Guidelines E-Books Search Today's Posts Mark Forums Read

Go Back   MobileRead Forums > E-Book Software > Sigil

Notices

Reply
 
Thread Tools Search this Thread
Old 08-12-2013, 04:10 PM   #31
Bob Houston
Junior Member
Bob Houston knows what time it isBob Houston knows what time it isBob Houston knows what time it isBob Houston knows what time it isBob Houston knows what time it isBob Houston knows what time it isBob Houston knows what time it isBob Houston knows what time it isBob Houston knows what time it isBob Houston knows what time it isBob Houston knows what time it is
 
Bob Houston's Avatar
 
Posts: 8
Karma: 2062
Join Date: Aug 2011
Location: Queens, NYC
Device: Kindle
Quote:
Originally Posted by meme View Post

If I recall correctly, the nbsp character is replaced with an entity as soon as the file is opened, otherwise it would be lost by later routines.
I inserted the Doctype entry at the beginning of the xhtml and html files inside my test epub and retested.

With the inclusion of the Doctype entry, as has been stated previously, everything worked fine. There was no 'not well formed' message, the nbsp entries were included, and the file validated.

My question is - Could the presence of 'Doctype' be noted, and inserted if not present, as the very first thing when the xhtml or html files are parsed?

Bob
Bob Houston is offline   Reply With Quote
Old 08-12-2013, 04:59 PM   #32
Jellby
frumious Bandersnatch
Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.
 
Jellby's Avatar
 
Posts: 6,308
Karma: 4898871
Join Date: Jan 2008
Location: Spaniard in Sweden
Device: Cybook Orizon, Kobo Aura
Quote:
Originally Posted by Bob Houston View Post
My question is - Could the presence of 'Doctype' be noted, and inserted if not present, as the very first thing when the xhtml or html files are parsed?
I'm afraid it's not that easy. For example, if the file contains any SVG code (as in an SVG-wrapped cover image), then, as far as I know, there is no appropriate DOCTYPE and none should be used.

But I'm not so sure a DOCTYPE is needed in order to use entities. It might be needed if you want to ensure XHTML compliance on its own, but inside an ePub the requirements for XHTML documents seem to be a bit different, and I'm not claiming I fully understand it. What does epubcheck say?
Jellby is offline   Reply With Quote
 
Advertisement
Old 08-12-2013, 06:58 PM   #33
Bob Houston
Junior Member
Bob Houston knows what time it isBob Houston knows what time it isBob Houston knows what time it isBob Houston knows what time it isBob Houston knows what time it isBob Houston knows what time it isBob Houston knows what time it isBob Houston knows what time it isBob Houston knows what time it isBob Houston knows what time it isBob Houston knows what time it is
 
Bob Houston's Avatar
 
Posts: 8
Karma: 2062
Join Date: Aug 2011
Location: Queens, NYC
Device: Kindle
Quote:
Originally Posted by Jellby View Post
I'm afraid it's not that easy.
Jellby, I think you lost me not to far after the above. I did run my test epub through EpubCheck after it was cleaned by Sigil and there were no problems noted.

At this point, I think I'm beyond my knowledge level. I just wanted to put out there the idea to see if someone may want to look at the order of how the files are parsed when cleaned, whether it's an xhtml or html file, to get the nbsp replacement code in v0.7.3 to work.

Bob
Bob Houston is offline   Reply With Quote
Old 08-13-2013, 04:13 AM   #34
Jellby
frumious Bandersnatch
Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.Jellby ought to be getting tired of karma fortunes by now.
 
Jellby's Avatar
 
Posts: 6,308
Karma: 4898871
Join Date: Jan 2008
Location: Spaniard in Sweden
Device: Cybook Orizon, Kobo Aura
I meant that adding a DOCTYPE when there was none to start with might be incorrect (whether or not flightcrew and/or epubcheck complain about it is a different matter).

Quote:
I did run my test epub through EpubCheck after it was cleaned by Sigil and there were no problems noted.
You mean that a document with   and no DOCTYPE passed epubcheck? That sort of agrees with the second part of my message: a DOCTYPE might be unneeded.
Jellby is offline   Reply With Quote
Old 08-13-2013, 10:17 AM   #35
Bob Houston
Junior Member
Bob Houston knows what time it isBob Houston knows what time it isBob Houston knows what time it isBob Houston knows what time it isBob Houston knows what time it isBob Houston knows what time it isBob Houston knows what time it isBob Houston knows what time it isBob Houston knows what time it isBob Houston knows what time it isBob Houston knows what time it is
 
Bob Houston's Avatar
 
Posts: 8
Karma: 2062
Join Date: Aug 2011
Location: Queens, NYC
Device: Kindle
Quote:
Originally Posted by Jellby View Post
I meant that adding a DOCTYPE when there was none to start with might be incorrect (whether or not flightcrew and/or epubcheck complain about it is a different matter).
After a reread, I did get your point that you thought that adding a DOCTYPE might be incorrect.

Isn't it added though, every time an epub is cleaned by Sigil?

Quote:
Originally Posted by Jellby View Post
You mean that a document with   and no DOCTYPE passed epubcheck? That sort of agrees with the second part of my message: a DOCTYPE might be unneeded.
For clarification, I inserted the DOCTYPE into a new test epub before sending it to Sigil. Because (as mentioned by others) the DOCTYPE was already there, the cleaning process did not error-out and the &nbsp were inserted correctly. That cleaned epub is what passed flightcrew and EpubCheck.

I'm still thinking that if DOCTYPE could be checked for first (If at all possible) at the start of the cleaning process, everything that followed would work like DOCTYPE had always been there. I realize it's easier said than done!
Bob Houston is offline   Reply With Quote
Old 08-13-2013, 12:06 PM   #36
roger64
Wizard
roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.
 
Posts: 1,482
Karma: 846401
Join Date: Jan 2009
Device: KoboGlo
Hi

With the new version of Sigil 0.7.3 (Linux 64 bits compiled by DiapDealer), I can use normally nbsp like it was before, for example for version 0.5.3. They are no more lost on opening and this is very good.

However, I failed to insert nnbsp (either in &#_x202f or &#_8239; forms - without the _ of course) in an EPUB. When I saved and opened the EPUB they were gone and replaced by a white space.

Up to now, I cannot point this problem squarely to Sigil and I still have some checks to do. I hope to be able to post a test EPUB soon. This is because with ADE 2.0, I experiment also some other unexplained problems: the text-indent is forgotten, and a space between paragraphs has been added without any code for it in the CSS. So, there could be some other murky things going on...

Last edited by roger64; 08-13-2013 at 12:09 PM.
roger64 is offline   Reply With Quote
Old 08-14-2013, 03:15 AM   #37
roger64
Wizard
roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.
 
Posts: 1,482
Karma: 846401
Join Date: Jan 2009
Device: KoboGlo
Hi

nnbsp (narrow no-break space) bug in Sigil 0.7.3

Usually with Sigil, the nnbsp entities are not displayed in code view: this means that you can ascertain the presence of a nnbsp only using your cursor. For example, when a nnbp has been inserted between e and ; you will need three cursors moves to go from e to ; but you will see only two characters (in code view). In book view, you can see the - small- white space.

For many months now, I have been consistently producing epubs for MR using some nbsp and mostly nnbsp, according to French typographic rules. For this, I always used some version of Sigil to fine-tune the book and insert the nnbsp within the EPUB.

The version of Sigil (0.7.3) I use has been compiled by DiapDealer for Linux 64 bits, like the previous one I used. It just seems to ignore the nnbsp:

- when I insert a nnbsp (either as &#_x202f; -or as &#_8239; - without the _ of course ) it does not appear again once the EPUB has been saved and reopened. I consistently get a normal white space.

- worse, when I reopen one of my former ebooks which has been published with nnbsp throughout, these entities are not displayed. One can see white spaces instead of nnbsp. Here for example.

@DiapDealer

In the meantime, I would need to reinstalll your former deb version (0.7.0) Could you provide me with a link?

Last edited by roger64; 08-14-2013 at 03:28 AM.
roger64 is offline   Reply With Quote
Old 08-14-2013, 04:40 AM   #38
meme
Sigil developer
meme ought to be getting tired of karma fortunes by now.meme ought to be getting tired of karma fortunes by now.meme ought to be getting tired of karma fortunes by now.meme ought to be getting tired of karma fortunes by now.meme ought to be getting tired of karma fortunes by now.meme ought to be getting tired of karma fortunes by now.meme ought to be getting tired of karma fortunes by now.meme ought to be getting tired of karma fortunes by now.meme ought to be getting tired of karma fortunes by now.meme ought to be getting tired of karma fortunes by now.meme ought to be getting tired of karma fortunes by now.
 
Posts: 1,275
Karma: 1101600
Join Date: Jan 2011
Location: UK
Device: Kindle PW, K4 NT, K3, Kobo Touch
Quote:
Originally Posted by DiapDealer View Post
So far as I understand, the DOCTYPE is not a requirement of the ePub spec (but it certainly needs to be correct if it IS present). So since Bob's attached, sample ePub is entirely compliant before 0.7.3 touches it, shouldn't the question be; "Why is Sigil inserting entities (by converting nbsp characters) into a document when doing so will make said document become 'not well formed?'"

I understand the reasoning behind wanting to preserve the intent of the non-breaking space character (which 0.7.2 did not do at all), but the fix shouldn't really come at the cost of making otherwise valid epubs, invalid. Should it?

I don't know. This is a bit of weird one. I always have the DOCTYPE in my files, myself, so this issue doesn't really affect me, but still... it seems like it's a bit of a catch-22.
Ideally the "fix" shouldn't cause the epub to become invalid. But unfortunately we seem to be stuck with that effect in this case. It's just a limitation of the Qt code Sigil is using to edit the HTML files: that it replaces nbsp characters with normal spaces.

The alternative, before 0.7.3, was that if you had an nbsp character in a UTF8 file, Sigil would remove it and replace it with a normal space - regardless of cleaning settings. So you were definitely losing information (enough that at least some people wanted it fixed).

Now, Sigil is preserving the nbsp character but to do so it has to convert it from a character to an entity so it doesn't get lost. For files with DOCTYPE already defined it isn't an issue. But in files that don't have the DOCTYPE set it means you either need to manually add the DOCTYPE or allow Sigil to clean the file. You still have the issue that if you manually insert an nbsp character in Sigil (not entity) it will immediately become a normal space. I think the biggest issue was not knowing why it was suddenly giving the error - at least now it's a little clearer why the error message is shown.
meme is offline   Reply With Quote
Old 08-14-2013, 05:01 AM   #39
meme
Sigil developer
meme ought to be getting tired of karma fortunes by now.meme ought to be getting tired of karma fortunes by now.meme ought to be getting tired of karma fortunes by now.meme ought to be getting tired of karma fortunes by now.meme ought to be getting tired of karma fortunes by now.meme ought to be getting tired of karma fortunes by now.meme ought to be getting tired of karma fortunes by now.meme ought to be getting tired of karma fortunes by now.meme ought to be getting tired of karma fortunes by now.meme ought to be getting tired of karma fortunes by now.meme ought to be getting tired of karma fortunes by now.
 
Posts: 1,275
Karma: 1101600
Join Date: Jan 2011
Location: UK
Device: Kindle PW, K4 NT, K3, Kobo Touch
Quote:
Originally Posted by roger64 View Post
Hi

nnbsp (narrow no-break space) bug in Sigil 0.7.3

Usually with Sigil, the nnbsp entities are not displayed in code view: this means that you can ascertain the presence of a nnbsp only using your cursor. For example, when a nnbp has been inserted between e and ; you will need three cursors moves to go from e to ; but you will see only two characters (in code view). In book view, you can see the - small- white space.

For many months now, I have been consistently producing epubs for MR using some nbsp and mostly nnbsp, according to French typographic rules. For this, I always used some version of Sigil to fine-tune the book and insert the nnbsp within the EPUB.

The version of Sigil (0.7.3) I use has been compiled by DiapDealer for Linux 64 bits, like the previous one I used. It just seems to ignore the nnbsp:

- when I insert a nnbsp (either as &#_x202f; -or as &#_8239; - without the _ of course ) it does not appear again once the EPUB has been saved and reopened. I consistently get a normal white space.

- worse, when I reopen one of my former ebooks which has been published with nnbsp throughout, these entities are not displayed. One can see white spaces instead of nnbsp. Here for example.
0.7.3 does show nnbsp characters as a normal space in Code View, and does not show them in 0.7.2. The nnbsp characters are still there as nnbsp characters, even after saving and re-opening, they just look different now in Code View. They are all still there if you open your test epub. Qt must have changed how it is showing them.

The attached example epub contains nbsp, nnbsp, and mdash entities and characters (the html has been hand edited outside of Sigil since Sigil won't save an nbsp character - if you open the epub in 0.7.3 you will see that both nbsp entries turn into entities).

The nnbsp character and entity show up as a space in Code View. But if you actually do a Find for an nnbsp character it will find the character, even after saving and re-opening. (As you probably already know, you can create this character by, for example, opening gedit on linux, typing ctrl-shift-u followed by 202f [RETURN], and can then paste that into Sigil. Or just copy it from Book View.)
Attached Files
File Type: epub nbsp_and_nnbsp_example.epub (2.5 KB, 25 views)
meme is offline   Reply With Quote
Old 08-14-2013, 05:22 AM   #40
roger64
Wizard
roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.
 
Posts: 1,482
Karma: 846401
Join Date: Jan 2009
Device: KoboGlo
@meme

Thanks very much for your quick and informative reply. nnbsp seems to get better and better support.

For this version of Sigil (0.7.3) this new and unannounced display of nnbsp in Sigil confused me and made me look for an -unexistent and unneeded- solution quite a long time.

For your information, a bug report has been sent about the display of nnbsp on Bugzilla for LibreOfice users (odt files). https://bugs.freedesktop.org/show_bug.cgi?id=67669

It is probable that this conversion in Code view of nnbsp into a plain white space will confuse some other people. It would be probably better to use some kind of greyed character. Hopefully, this will be for Qt 6. ;-)

Last edited by roger64; 08-14-2013 at 10:23 AM.
roger64 is offline   Reply With Quote
Old 08-14-2013, 10:49 AM   #41
Bob Houston
Junior Member
Bob Houston knows what time it isBob Houston knows what time it isBob Houston knows what time it isBob Houston knows what time it isBob Houston knows what time it isBob Houston knows what time it isBob Houston knows what time it isBob Houston knows what time it isBob Houston knows what time it isBob Houston knows what time it isBob Houston knows what time it is
 
Bob Houston's Avatar
 
Posts: 8
Karma: 2062
Join Date: Aug 2011
Location: Queens, NYC
Device: Kindle
Quote:
Originally Posted by meme View Post
But in files that don't have the DOCTYPE set it means you either need to manually add the DOCTYPE or allow Sigil to clean the file.
IF I'm comfortable with how my epub was constructed prior to opening it with Sigil, then all I need to do is just answer YES to fixing the 'not well formed HTML', my nbsp will be back, and I'll be a happy camper?
Bob Houston is offline   Reply With Quote
Old 08-14-2013, 09:16 PM   #42
Tex2002ans
Fanatic
Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.
 
Posts: 535
Karma: 562971
Join Date: Jul 2012
Device: Nook
Feature Requests:

Tools - Reports - Characters in HTML Files.

Currently, this report correctly goes from HTML entity names/decimal/hexadecimal and actual characters -> the report. What it does not do, is go the opposite.

When double clicking on a character in the report, it brings you to the next instance of only the actual character, but it does not find named/decimal/hexadecimal instances. I attached an image showing how clicking on '˘' does not lead to "ô" in the HTML file.

Click image for larger version

Name:	CharacterReport.png
Views:	80
Size:	23.1 KB
ID:	109396Click image for larger version

Name:	CharacterReport2.png
Views:	69
Size:	60.9 KB
ID:	109397

Tools - Reports - Links

While clicking on a link in this report, you would expect it to lead you directly to the location. Instead, it seems to only open up the HTML file in which the link occurs. From there, you have to manually search for the link.

Entity -> Character + Character -> Entity

Also, a nice thing to have added might be a setting for Sigil to automatically go from entity (names/decimal/hexadecimal) -> character, character -> entity (names/decimal/hexadecimal).... currently, I am doing this the slow way by mass running a huge batch of Saved Searches.

Click image for larger version

Name:	EntitiestoCharacter.png
Views:	81
Size:	10.4 KB
ID:	109400
Tex2002ans is offline   Reply With Quote
Old 08-15-2013, 01:28 AM   #43
roger64
Wizard
roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.
 
Posts: 1,482
Karma: 846401
Join Date: Jan 2009
Device: KoboGlo
Hi

@Tex2002ans

To my shame, I never used the "report" feature of Sigil and it's indeed a nice one. You have trouble displaying French (mostly) characters - Ó, Ú, Ŕ, ˘, etc.

I show you part of my report (figure one), how ˘ is displayed in
code view (arrow as an example in fig 2) but also Ú, Ŕ, Ó, and the declaration I use for every xhtml file (Note).

I make no use of a complicated translation table for entities. It all goes smoothly and displays conveniently all French characters. Maybe this could help.

Note:
<?xml version="1.0" encoding="UTF-8" standalone="no" ?><!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.1//EN" "http://www.w3.org/TR/xhtml11/DTD/xhtml11.dtd"><html xmlns="http://www.w3.org/1999/xhtml" xml:lang="fr-FR">
Attached Thumbnails
Click image for larger version

Name:	rapport.png
Views:	59
Size:	153.6 KB
ID:	109405   Click image for larger version

Name:	accent circonflexe.png
Views:	60
Size:	56.5 KB
ID:	109407  

Last edited by roger64; 08-15-2013 at 01:45 AM.
roger64 is offline   Reply With Quote
Old 08-15-2013, 02:29 AM   #44
Tex2002ans
Fanatic
Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.
 
Posts: 535
Karma: 562971
Join Date: Jul 2012
Device: Nook
Quote:
Originally Posted by roger64 View Post
To my shame, I never used the "report" feature of Sigil and it's indeed a nice one.
Yeah the Reports are great! I use the Character Report at least once per EPUB just to make sure I can catch any anamolies (I do a lot of PDF -> OCR -> EPUB). Most of the time, my source already has the actual characters.... I rarely use the entity names, so I never ran into this problem.

Over the past week and a half I have been importing 15 years worth of articles (~6500) into Sigil, and cleaning them all up to prepare a few gigantic yearly EPUB releases (~300 articles per EPUB). In this case, the original HTML used entity names. I wanted to do some cleanup in Sigil, then do code comparison to the originals (this is why I want entities there), then I want to easily be able to swap back to characters before proofreading and releasing the EPUB (actual characters allow me to read the code much easier, and be able to catch more mistakes).

I rarely use the Link Report, but in this case, there are THOUSANDS of links pointing everywhere on the internet. The Link Report allows me to easily spot links which do not belong in the EPUB, footnotes I have not normalized (over 15 years... you can imagine all the different tools/programs that were used to generate these things).

The Class Reports allow me to catch outliers in the code itself (a weird class name that was only used once in all the articles, etc.).

I will definitely be using it more in the future, it is really helping me consolidate code. HUGE time savers.

Quote:
Originally Posted by roger64 View Post
You have trouble displaying French (mostly) characters - Ó, Ú, Ŕ, ˘, etc.

[...]

<?xml version="1.0" encoding="UTF-8" standalone="no" ?><!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.1//EN" "http://www.w3.org/TR/xhtml11/DTD/xhtml11.dtd"><html xmlns="http://www.w3.org/1999/xhtml" xml:lang="fr-FR">
Looks to me like that XML declaration is just setting the language of the file to French. I believe Sigil uses the XML declaration at the top of the file to do some auto cleanup of entities. I know I read the logic behind the Sigil auto-clean of entities was somewhere on these forums (maybe meme or user_none in one of the older Sigil release topics?).

And I just thought of another slight tweak on the Entity -> Character, Character -> Entity request. Perhaps it can be added to the Right Click Menu -> Reformat HTML. So you will get 4 extra options there:

Characters to Entities
Characters to Entities - All HTML files
Entities to Characters
Entities to Characters - All HTML files

Last edited by Tex2002ans; 08-15-2013 at 02:31 AM.
Tex2002ans is offline   Reply With Quote
Old 08-15-2013, 06:25 PM   #45
Tex2002ans
Fanatic
Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.Tex2002ans ought to be getting tired of karma fortunes by now.
 
Posts: 535
Karma: 562971
Join Date: Jul 2012
Device: Nook
Feature Requests:

Duplicate Images Yes/No to All:

When Adding Images which already exist, you get this dialog:

Click image for larger version

Name:	DuplicateImageDialog.png
Views:	52
Size:	3.6 KB
ID:	109443

When adding tens/hundreds of potential duplicate files, you have to press "ok/cancel" tens/hundreds of times. It would be nice if there was a "Yes to All" and "No to All" added.

Duplicate HTML files Overwrite:

When adding duplicate XHTML files, you get this dialog:

Click image for larger version

Name:	DuplicateFileDialog.png
Views:	49
Size:	3.2 KB
ID:	109444

You have zero choice in what you want to do (in this case I want to overwrite, but I just have to press "ok"). This dialog also falls into having to click "ok" tens/hundreds of times. Most of the time I just kill Sigil, instead of dealing with hurting my mouse finger.

This dialog can also benefit from an "Overwrite", "Overwrite All", "Ok" choice. Or maybe it can complain and in one shot just give a list of files it cannot insert.

Currently, I either:

1. Open the EPUB in 7-zip, overwrite the XHTML files manually, and then reopen the EPUB in Sigil and continue working.
2. Use Sigil to mass delete all the XHTML, then add all the XHTML again using Sigil.

Easier Recognition of Not Well Formed Files

When you open up an EPUB with malformed files, and tell Sigil to not automatically clean the file, you get this dialog:

Click image for larger version

Name:	NotWellFormed.png
Views:	51
Size:	4.9 KB
ID:	109448

Would be nice if somehow it was possible to get a dialog such as this one, without having to exit/reenter Sigil. Also, it would be nice if this dialog would alphabetically sort the malformed files. Currently, it seems like they are just randomly placed there:

5296.xhtml, 5357.xhtml, 5100.xhtml, 5050.xhtml, 5400.xhtml, ...

Currently while you are in Sigil, and try to save when you have a malformed document in your EPUB, it is very hard to tell which file is the exact culprit. Perhaps this sentence can have at least some mention of which HTML file is causing the problem:

Click image for larger version

Name:	MalformedSigil.png
Views:	50
Size:	7.3 KB
ID:	109447

"EPUB saved, but not all HTML files are well formed: 5396.xhtml"

Yes yes, I know, you could always FlightCrew and try to spot the broken file, but in the case of having hundreds/thousands of other non-EPUB compliant code, it becomes impossible to spot which file is malformed. Perhaps the FlightCrew output can put some CRITICAL problems at the top of the list.

HTML File Report Problem (?)

Also, this column in the HTML Files Report seems to be me to be worthless. You cannot get into Reports unless the files are all well formed.

Click image for larger version

Name:	HTMLFilesReportWellFormed.png
Views:	52
Size:	17.1 KB
ID:	109449

Last edited by Tex2002ans; 08-15-2013 at 07:30 PM. Reason: More Suggestions
Tex2002ans is offline   Reply With Quote
Reply

Thread Tools Search this Thread
Search this Thread:

Advanced Search

Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Sigil 0.7.2 Released user_none Sigil 40 06-25-2013 12:35 AM
Sigil 0.7.1 Released user_none Sigil 64 03-26-2013 11:02 PM
Sigil 0.6.0 Released user_none Sigil 93 11-24-2012 07:50 PM
Sigil 0.5.3 Released user_none Sigil 85 05-13-2012 06:29 AM
Sigil 0.4.2 Released user_none Sigil 41 10-26-2011 07:03 AM


All times are GMT -4. The time now is 05:40 PM.


MobileRead.com is a privately owned, operated and funded community.