![]() |
#1 |
Dylanologist
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 200
Karma: 146754
Join Date: Apr 2010
Location: Hanover, New Hampshire, USA
Device: none/all/any
|
![]()
When opening a 16,000 word html document, Sigil only opens the first 6,000 words. I'vechecked the original text document and the html marup document, bith are free of errors or extranious bits. The html document opens complete and in tact in a browser before opening in Sigil. After six attempts to make this work, I thought I'd ask here - What am I missing? Thanks.
P.S. In the past. I've sucessfully opened much larger html documents in Sigil. |
![]() |
![]() |
![]() |
#2 |
frumious Bandersnatch
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 7,543
Karma: 19001583
Join Date: Jan 2008
Location: Spaniard in Sweden
Device: Cybook Orizon, Kobo Aura
|
Does the HTML pass the HTML validator?
|
![]() |
![]() |
Advert | |
|
![]() |
#3 |
Dylanologist
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 200
Karma: 146754
Join Date: Apr 2010
Location: Hanover, New Hampshire, USA
Device: none/all/any
|
Great question Jellby. I'll test it now...
...FAILED! I fell into the trap of, "it always worked in the past, so I can skip that step." Thanks Jellby for giving be a basic heads up. --- The error was in the metadata, which I easily fixed and it passed. I then saved the html as an epub in Sigil. I unzipped the new epub and took a look at the html file inside. Lo 'n behold the file was truncated in the same location. hmmm Last edited by Fabe; 07-05-2010 at 11:24 AM. |
![]() |
![]() |
![]() |
#4 |
Created Sigil, FlightCrew
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1,982
Karma: 350515
Join Date: Feb 2008
Device: Kobo Clara HD
|
Create an issue on the tracker and attach the epub. Usually it doesn't matter if the HTML is valid or not, Sigil (Tidy) should be able to fix it.
There were truncation problems caused by unescaped ampersands (& needs to be & in HTML documents), but I fixed that a while ago. Which version of Sigil are you using? |
![]() |
![]() |
![]() |
#5 |
Dylanologist
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 200
Karma: 146754
Join Date: Apr 2010
Location: Hanover, New Hampshire, USA
Device: none/all/any
|
Valloric - The issue was in the html. It took me a while to get all the bits lined up. Interestingly, the last error was an ampersand. "&c" was in the document for etcetera. I changed it to etc. and all went smoothly after that. 100% of the document showed up in Sigil and the epub files were fine. My Sigil version is 0.2.1. I'm looking forward to 2.0.0 :-)
If I run into this issue again, I will attach the epub to the tracker. (Now what the heck is the tracker?) - Fabe |
![]() |
![]() |
Advert | |
|
![]() |
#6 |
Created Sigil, FlightCrew
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1,982
Karma: 350515
Join Date: Feb 2008
Device: Kobo Clara HD
|
|
![]() |
![]() |
![]() |
#7 |
Zealot
![]() ![]() ![]() ![]() Posts: 107
Karma: 396
Join Date: Jul 2008
Location: Meuse, France
Device: Pocketbook 623, Kobo Glo HD, Aura One,Aura H2O2
|
I have noticed that sigil does not like html with & in them. Opening an html to discover it was truncated happened to me a few times before I figured this one out. You have to change all the & into & .
Now maybe there are other similar issues, but I only ran into this one. |
![]() |
![]() |
![]() |
#8 |
Reader
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 520
Karma: 24612
Join Date: Aug 2009
Location: Utrecht, NL
Device: Kobo Aura 2, iPhone, iPad
|
This is standard HTML stuff. It has nothing to do with Sigil. In fact it is standard SGML/XML stuff. However, the parser could have been more forgiving. But it is Webkit that does the parsing, I suppose, so mostly out of Sigil's control.
|
![]() |
![]() |
![]() |
#9 |
Created Sigil, FlightCrew
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1,982
Karma: 350515
Join Date: Feb 2008
Device: Kobo Clara HD
|
QDom does the parsing, and dies on unescaped ampersands. But I added code to work around that a while ago. You shouldn't get this ampersand-related truncation anymore in recent versions of Sigil (if you do, report it).
|
![]() |
![]() |
![]() |
#10 | |
Zealot
![]() ![]() ![]() ![]() Posts: 107
Karma: 396
Join Date: Jul 2008
Location: Meuse, France
Device: Pocketbook 623, Kobo Glo HD, Aura One,Aura H2O2
|
Quote:
![]() |
|
![]() |
![]() |
![]() |
#11 |
Groupie
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 184
Karma: 2572
Join Date: Aug 2010
Device: Kindle
|
I have the problem of a truncated file. I can find no ampersands except for those in the html entities. It's true that the file ended at one such, which happened to be – but I tried removing that entire phrase and opening the file again. It stopped at about the same place--actually at the beginning of that particularly paragraph.
When I view Code in the truncated file, it ends rather oddly: <p class="sgc-6"></p> </blockquote> </body> </html> What's that /blockquote doing there? Hm.... Okay, I found and removed the extra blockquote tag. Seems unlikely that was the problem, however. Just to clear up another possibility: there's no such thing as a fully operative paid version of Sigil as opposed to the free download, right? |
![]() |
![]() |
![]() |
#12 | |
Well trained by Cats
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 30,880
Karma: 59840450
Join Date: Aug 2009
Location: The Central Coast of California
Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A
|
Quote:
![]() But it is still the same Sigil ![]() A mis-match blockquote tag? Were you editing in CV? Did you "indent" the whole (or a big chunk) document? (Setting a style with different margins is probably the better way) |
|
![]() |
![]() |
![]() |
#13 |
Groupie
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 184
Karma: 2572
Join Date: Aug 2010
Device: Kindle
|
Thanks. I was worried that perhaps I had a crippled version of Sigil, and it would only do say 20,000 words of text. If that's not the case, what are the possible reasons for a file's being truncated?
About the blockquote: I found a case midway where <blockquote> appeared twice before an indented section. I eliminated the extra tag and likewise the closing tag at the end of the truncated file. That doesn't seem to have been my problem. Should I validate my xhmtl/html before opening it in Sigil? I don't see any problem where the truncated book ends--all the paragraphs end with </p> etc. |
![]() |
![]() |
![]() |
#14 | |
Created Sigil, FlightCrew
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1,982
Karma: 350515
Join Date: Feb 2008
Device: Kobo Clara HD
|
Quote:
That's not necessary. Sigil can usually open even files with severe errors. |
|
![]() |
![]() |
![]() |
#15 |
Groupie
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 184
Karma: 2572
Join Date: Aug 2010
Device: Kindle
|
Thank you. I've uploaded both the truncated epub file and the underlying html document.
|
![]() |
![]() |
![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
need advice doc/html to epub | ebooker | Workshop | 2 | 08-14-2010 10:06 PM |
HTML doc type kindle? | poshm | Workshop | 2 | 02-17-2010 01:59 AM |
Can't convert this html doc (attached) | phunkysai | Calibre | 8 | 07-19-2009 10:59 PM |
iLiad Linking to other files from HTML doc? | marussell01 | iRex Developer's Corner | 2 | 01-14-2008 08:45 PM |
html or doc better? | spear | Fictionwise eBookwise | 11 | 12-16-2007 09:43 PM |