Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book Software > Sigil

Notices

Reply
 
Thread Tools Search this Thread
Old 04-19-2012, 02:20 PM   #1
bfollowell
Fanatic
bfollowell ought to be getting tired of karma fortunes by now.bfollowell ought to be getting tired of karma fortunes by now.bfollowell ought to be getting tired of karma fortunes by now.bfollowell ought to be getting tired of karma fortunes by now.bfollowell ought to be getting tired of karma fortunes by now.bfollowell ought to be getting tired of karma fortunes by now.bfollowell ought to be getting tired of karma fortunes by now.bfollowell ought to be getting tired of karma fortunes by now.bfollowell ought to be getting tired of karma fortunes by now.bfollowell ought to be getting tired of karma fortunes by now.bfollowell ought to be getting tired of karma fortunes by now.
 
Posts: 541
Karma: 1152752
Join Date: Aug 2010
Location: Evansville, IN, USA
Device: Samsung Galaxy Tab 4 Nook & Samsung Galaxy Tab S 10.5
Different search results by file and by all html files?

I edit many, MANY ebooks on a daily basis using Sigil. Most I have no problems with at all. There are a few, a rare few though, that don't seem to search properly. Usually it's just special characters like an em dash or an ellipsis. When I search for them in a current file, the results are returned just fine. If I search for them over all html files they return a result of zero. I know there are hundreds of em dashes and ellipsis in the ebook but I can only find them searching file by file not by all html files. It is only the rare ebook that does this. I've checked them across various versions of Sigil across various computers and various operating systems. The only commonality seems to be the epub itself but I have no idea what about the epub would cause this.

Has anyone else ever heard of or experienced this issue? It's very strange and I'd like to figure it out and try to correct if at all possible.

Thanks for any assistance anyone may be able to provide.

Sincerely,
- Byron Followell
bfollowell is offline   Reply With Quote
Old 04-19-2012, 04:42 PM   #2
mmat1
Berti
mmat1 ought to be getting tired of karma fortunes by now.mmat1 ought to be getting tired of karma fortunes by now.mmat1 ought to be getting tired of karma fortunes by now.mmat1 ought to be getting tired of karma fortunes by now.mmat1 ought to be getting tired of karma fortunes by now.mmat1 ought to be getting tired of karma fortunes by now.mmat1 ought to be getting tired of karma fortunes by now.mmat1 ought to be getting tired of karma fortunes by now.mmat1 ought to be getting tired of karma fortunes by now.mmat1 ought to be getting tired of karma fortunes by now.mmat1 ought to be getting tired of karma fortunes by now.
 
mmat1's Avatar
 
Posts: 1,196
Karma: 4985964
Join Date: Jan 2012
Location: Zischebattem
Device: Acer Lumiread
Quote:
Originally Posted by bfollowell View Post
Has anyone else ever heard of or experienced this issue? It's very strange and I'd like to figure it out and try to correct if at all possible.
Never heard about this. The only thing i can imagine, that there is a failure with the Character-Set. All the html should be utf8, but what if -for some unknown resason- it is not ? S/R would maybe behave as you decribed.

AFAIK, sigil converts automatically to utf8. Have you ever tried to save the epub with sigil, open it again and search (count) again ?
mmat1 is offline   Reply With Quote
Advert
Old 04-19-2012, 05:03 PM   #3
bfollowell
Fanatic
bfollowell ought to be getting tired of karma fortunes by now.bfollowell ought to be getting tired of karma fortunes by now.bfollowell ought to be getting tired of karma fortunes by now.bfollowell ought to be getting tired of karma fortunes by now.bfollowell ought to be getting tired of karma fortunes by now.bfollowell ought to be getting tired of karma fortunes by now.bfollowell ought to be getting tired of karma fortunes by now.bfollowell ought to be getting tired of karma fortunes by now.bfollowell ought to be getting tired of karma fortunes by now.bfollowell ought to be getting tired of karma fortunes by now.bfollowell ought to be getting tired of karma fortunes by now.
 
Posts: 541
Karma: 1152752
Join Date: Aug 2010
Location: Evansville, IN, USA
Device: Samsung Galaxy Tab 4 Nook & Samsung Galaxy Tab S 10.5
Quote:
Originally Posted by mmat1 View Post
AFAIK, sigil converts automatically to utf8. Have you ever tried to save the epub with sigil, open it again and search (count) again ?
Nope. I wasn't having a lot of luck making the changes I needed to make because of this issue. Since I hadn't really made any changes yet, I didn't save. I'll give that a try and see if it fixes anything.

Still, it seems very strange that it would work just fine when searching file by file but not when searching by all html files.

I'll post here if this helps.

- Byron
bfollowell is offline   Reply With Quote
Old 04-19-2012, 06:49 PM   #4
bfollowell
Fanatic
bfollowell ought to be getting tired of karma fortunes by now.bfollowell ought to be getting tired of karma fortunes by now.bfollowell ought to be getting tired of karma fortunes by now.bfollowell ought to be getting tired of karma fortunes by now.bfollowell ought to be getting tired of karma fortunes by now.bfollowell ought to be getting tired of karma fortunes by now.bfollowell ought to be getting tired of karma fortunes by now.bfollowell ought to be getting tired of karma fortunes by now.bfollowell ought to be getting tired of karma fortunes by now.bfollowell ought to be getting tired of karma fortunes by now.bfollowell ought to be getting tired of karma fortunes by now.
 
Posts: 541
Karma: 1152752
Join Date: Aug 2010
Location: Evansville, IN, USA
Device: Samsung Galaxy Tab 4 Nook & Samsung Galaxy Tab S 10.5
I've got it figured out. OK, saving didn't change anything and didn't help any at all. I was using an older version, something prior to 0.5. One thing I noticed is that, when I looked at an ellipsis or an em dash in code view, I saw an em dash or an ellipsis. I downloaded the most up-to-date version, 0.5.3, and had the same problems and saving and trying again didn't make any difference. Now I've noticed that when looking at any of the special characters in code view, I don't see the characters but I see their character entities. For a left quote I see “ For an em dash I see —

It just seems strange that the older versions didn't show these entities in code view and that I could find these characters just fine searching file by file in code view but not by searching all html files.

Anyway, it's fixed. I can now search for these character entities and change them to the actual character if I want or whatever I want to do with them using all html files and everything seems to be working just fine.

Hopefully this might help someone else in the future.

Thanks.

- Byron
bfollowell is offline   Reply With Quote
Old 04-20-2012, 02:03 AM   #5
mmat1
Berti
mmat1 ought to be getting tired of karma fortunes by now.mmat1 ought to be getting tired of karma fortunes by now.mmat1 ought to be getting tired of karma fortunes by now.mmat1 ought to be getting tired of karma fortunes by now.mmat1 ought to be getting tired of karma fortunes by now.mmat1 ought to be getting tired of karma fortunes by now.mmat1 ought to be getting tired of karma fortunes by now.mmat1 ought to be getting tired of karma fortunes by now.mmat1 ought to be getting tired of karma fortunes by now.mmat1 ought to be getting tired of karma fortunes by now.mmat1 ought to be getting tired of karma fortunes by now.
 
mmat1's Avatar
 
Posts: 1,196
Karma: 4985964
Join Date: Jan 2012
Location: Zischebattem
Device: Acer Lumiread
Quote:
Originally Posted by bfollowell View Post
Now I've noticed that when looking at any of the special characters in code view, I don't see the characters but I see their character entities. For a left quote I see “ For an em dash I see —
Yupp, older Versions of Sigil (0.3.4) sometimes replace entities (i. e. if you import a html). The latest verson does not !

It's funny to see how the spellcheck runs mad with these entities ....
mmat1 is offline   Reply With Quote
Advert
Reply


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Merging multiple HTML files into one HTML file skoobwoman Workshop 45 07-11-2014 10:46 AM
XHTML files not listed, only one HTML file SmartyGuy Sigil 6 06-21-2011 12:32 PM
Converting multiple HTML files into one EPUB file bigdukesix ePub 3 03-08-2011 12:12 PM
Recognition of author and title from html files/reading metadata from a seperate file Lethe Calibre 5 04-03-2010 08:35 AM
Merging several Html files into one file nesseainie Calibre 8 06-03-2009 02:06 PM


All times are GMT -4. The time now is 12:07 PM.


MobileRead.com is a privately owned, operated and funded community.