Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book Software > Sigil

Notices

Reply
 
Thread Tools Search this Thread
Old 12-11-2015, 01:06 AM   #1
DNSB
Bibliophagist
DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.
 
DNSB's Avatar
 
Posts: 35,428
Karma: 145525534
Join Date: Jul 2010
Location: Vancouver
Device: Kobo Sage, Forma, Clara HD, Lenovo M8 FHD, Paperwhite 4, Tolino epos
Issue opening HTML file

I'd picked up an ebook (well, 3/4 of an ebook) from Baen and went to open it's html files in Sigil. Sadly, it refused to open and gave an error message suggesting that I change the clean source preference to Pretty Print Tidy or HTML Tidy and reloading the file. See attached image. While neither Pretty Print Gumbo or Google Gumbo-Parser worked to allow opening the file, the message might be changed to reflect those two options.
Attached Thumbnails
Click image for larger version

Name:	1635_APoR.jpg
Views:	198
Size:	33.8 KB
ID:	144502  
DNSB is offline   Reply With Quote
Old 12-11-2015, 07:44 AM   #2
KevinH
Sigil Developer
KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.
 
Posts: 7,644
Karma: 5433388
Join Date: Nov 2009
Device: many
Hi,
We will update that error message. But you turned on Clean on Open, and still nothing could open that .htm file? Wow! Was it encrypted? Google's Gumbo parser has been able to read/parse literally billions of pages on the web. I would be interested in knowing why that page could not be parsed.

KevinH
KevinH is offline   Reply With Quote
Advert
Old 12-11-2015, 08:42 AM   #3
DiapDealer
Grand Sorcerer
DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.
 
DiapDealer's Avatar
 
Posts: 27,549
Karma: 193191846
Join Date: Jan 2010
Device: Nexus 7, Kindle Fire HD
I'd be willing to purchase this book from Baen in order to troubleshoot, if you (@DNSB) could tell me the exact format of the 3/4 version of this book you purchased.

EDIT: actually, never mind. It doesn't appear that the 3/4 version of the book is available any more.

Last edited by DiapDealer; 12-11-2015 at 08:52 AM.
DiapDealer is offline   Reply With Quote
Old 12-11-2015, 09:44 AM   #4
KevinH
Sigil Developer
KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.
 
Posts: 7,644
Karma: 5433388
Join Date: Nov 2009
Device: many
FYI: That error message has now been fixed in Sigil master and the fix will appear in the next release.
KevinH is offline   Reply With Quote
Old 12-11-2015, 11:31 AM   #5
eschwartz
Ex-Helpdesk Junkie
eschwartz ought to be getting tired of karma fortunes by now.eschwartz ought to be getting tired of karma fortunes by now.eschwartz ought to be getting tired of karma fortunes by now.eschwartz ought to be getting tired of karma fortunes by now.eschwartz ought to be getting tired of karma fortunes by now.eschwartz ought to be getting tired of karma fortunes by now.eschwartz ought to be getting tired of karma fortunes by now.eschwartz ought to be getting tired of karma fortunes by now.eschwartz ought to be getting tired of karma fortunes by now.eschwartz ought to be getting tired of karma fortunes by now.eschwartz ought to be getting tired of karma fortunes by now.
 
eschwartz's Avatar
 
Posts: 19,422
Karma: 85397180
Join Date: Nov 2012
Location: The Beaten Path, USA, Roundworld, This Side of Infinity
Device: Kindle Touch fw5.3.7 (Wifi only)
You can try @jackie_w's ScrambleEbook: Getting help with copyrighted books troubleshooting utility.

Hopefully that will give you a copyright-free EPUB that can replicate the problem.


EDIT: Oh wait, unpacked HTML?

Last edited by eschwartz; 12-11-2015 at 11:33 AM.
eschwartz is offline   Reply With Quote
Advert
Old 12-11-2015, 02:13 PM   #6
KevinH
Sigil Developer
KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.
 
Posts: 7,644
Karma: 5433388
Join Date: Nov 2009
Device: many
Hi,
Found a sample chapter of that book on the Baen website and used its html and received the following when I tested it for being well-formed by Gumbo:

--------
line: 2 col: 1 type 40 @1:1: This is not a legal doctype.
@2:1: This is not a legal doctype.
<!DOCTYPE html PUBLIC "+//ISBN 0-9673008-1-9//DTD OEB 1.2 Document//EN" "http://openebook.org/dtds/oeb-1.2/oebdoc12.dtd">

So all it is complaining about is the DOCTYPE not being an epub2 DOCTYPE. If you turn on Clean On Open in Sigil preferences it will still happily load and parse that file.

Your error message in the image cites the exact same line so your problem is probably the same non-standard doctype as well. Simply turn on Cleaning and it should load just fine into CodeView. You can then modify it if need be, otherwise leave it alone.

KevinH
KevinH is offline   Reply With Quote
Old 12-11-2015, 02:38 PM   #7
KevinH
Sigil Developer
KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.
 
Posts: 7,644
Karma: 5433388
Join Date: Nov 2009
Device: many
Ah! I see the issue. Gumbo detects the bad docytpe but allows it to pass through unchanged which just triggers the issue again. I will have to modify the gumbo code to not pass through known bad doctypes. No doctype at all would be better,

If you cut and paste the the html with this doctype into Sigil, Preview and BookView will barf until it is fixed.

I will look into fixing this.

KevinH
KevinH is offline   Reply With Quote
Old 12-11-2015, 03:11 PM   #8
KevinH
Sigil Developer
KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.
 
Posts: 7,644
Karma: 5433388
Join Date: Nov 2009
Device: many
Hi,
This is now fixed in master. Bad doctypes will no longer survive a gumbo parse/serialize sequence which means that Clean On Open with Gumbo will allow this book to load.
Thanks for the bug report!
KevinH
KevinH is offline   Reply With Quote
Old 12-17-2015, 03:13 PM   #9
DNSB
Bibliophagist
DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.
 
DNSB's Avatar
 
Posts: 35,428
Karma: 145525534
Join Date: Jul 2010
Location: Vancouver
Device: Kobo Sage, Forma, Clara HD, Lenovo M8 FHD, Paperwhite 4, Tolino epos
Quote:
Originally Posted by KevinH View Post
Hi,
This is now fixed in master. Bad doctypes will no longer survive a gumbo parse/serialize sequence which means that Clean On Open with Gumbo will allow this book to load.
Thanks for the bug report!
KevinH
Thanks for the fix!
DNSB is offline   Reply With Quote
Reply


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Opening Ebooks from within HTML document kerravonsen PocketBook 2 01-25-2014 09:03 PM
beginner wants help with html not opening on notepad m468949 Sigil 3 08-13-2013 04:37 AM
HTML input plugin stripping text within toc tags in child html file nimblebooks Conversion 3 02-21-2012 03:24 PM
Kindle and Overdrive - how to use? Opening windows issue netrate Amazon Kindle 8 01-04-2012 10:27 PM
Convert HTML to MOBI (HTML recognized as ZIP file) pdubois Conversion 1 01-25-2011 12:55 PM


All times are GMT -4. The time now is 04:10 AM.


MobileRead.com is a privately owned, operated and funded community.