04-19-2012, 02:20 PM | #1 |
Fanatic
Posts: 541
Karma: 1152752
Join Date: Aug 2010
Location: Evansville, IN, USA
Device: Samsung Galaxy Tab 4 Nook & Samsung Galaxy Tab S 10.5
|
Different search results by file and by all html files?
I edit many, MANY ebooks on a daily basis using Sigil. Most I have no problems with at all. There are a few, a rare few though, that don't seem to search properly. Usually it's just special characters like an em dash or an ellipsis. When I search for them in a current file, the results are returned just fine. If I search for them over all html files they return a result of zero. I know there are hundreds of em dashes and ellipsis in the ebook but I can only find them searching file by file not by all html files. It is only the rare ebook that does this. I've checked them across various versions of Sigil across various computers and various operating systems. The only commonality seems to be the epub itself but I have no idea what about the epub would cause this.
Has anyone else ever heard of or experienced this issue? It's very strange and I'd like to figure it out and try to correct if at all possible. Thanks for any assistance anyone may be able to provide. Sincerely, - Byron Followell |
04-19-2012, 04:42 PM | #2 | |
Berti
Posts: 1,196
Karma: 4985964
Join Date: Jan 2012
Location: Zischebattem
Device: Acer Lumiread
|
Quote:
AFAIK, sigil converts automatically to utf8. Have you ever tried to save the epub with sigil, open it again and search (count) again ? |
|
Advert | |
|
04-19-2012, 05:03 PM | #3 | |
Fanatic
Posts: 541
Karma: 1152752
Join Date: Aug 2010
Location: Evansville, IN, USA
Device: Samsung Galaxy Tab 4 Nook & Samsung Galaxy Tab S 10.5
|
Quote:
Still, it seems very strange that it would work just fine when searching file by file but not when searching by all html files. I'll post here if this helps. - Byron |
|
04-19-2012, 06:49 PM | #4 |
Fanatic
Posts: 541
Karma: 1152752
Join Date: Aug 2010
Location: Evansville, IN, USA
Device: Samsung Galaxy Tab 4 Nook & Samsung Galaxy Tab S 10.5
|
I've got it figured out. OK, saving didn't change anything and didn't help any at all. I was using an older version, something prior to 0.5. One thing I noticed is that, when I looked at an ellipsis or an em dash in code view, I saw an em dash or an ellipsis. I downloaded the most up-to-date version, 0.5.3, and had the same problems and saving and trying again didn't make any difference. Now I've noticed that when looking at any of the special characters in code view, I don't see the characters but I see their character entities. For a left quote I see “ For an em dash I see —
It just seems strange that the older versions didn't show these entities in code view and that I could find these characters just fine searching file by file in code view but not by searching all html files. Anyway, it's fixed. I can now search for these character entities and change them to the actual character if I want or whatever I want to do with them using all html files and everything seems to be working just fine. Hopefully this might help someone else in the future. Thanks. - Byron |
04-20-2012, 02:03 AM | #5 | |
Berti
Posts: 1,196
Karma: 4985964
Join Date: Jan 2012
Location: Zischebattem
Device: Acer Lumiread
|
Quote:
It's funny to see how the spellcheck runs mad with these entities .... |
|
Advert | |
|
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Merging multiple HTML files into one HTML file | skoobwoman | Workshop | 45 | 07-11-2014 10:46 AM |
XHTML files not listed, only one HTML file | SmartyGuy | Sigil | 6 | 06-21-2011 12:32 PM |
Converting multiple HTML files into one EPUB file | bigdukesix | ePub | 3 | 03-08-2011 12:12 PM |
Recognition of author and title from html files/reading metadata from a seperate file | Lethe | Calibre | 5 | 04-03-2010 08:35 AM |
Merging several Html files into one file | nesseainie | Calibre | 8 | 06-03-2009 02:06 PM |