Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book Software > Sigil

Notices

Reply
 
Thread Tools Search this Thread
Old 09-06-2021, 11:45 PM   #1
iieeann
Member
iieeann began at the beginning.
 
iieeann's Avatar
 
Posts: 12
Karma: 10
Join Date: Dec 2020
Device: Android
Question Unable to open an epub file

Hi,

I have not been able to open a specific epub file (10MB) successfully with Sigil as well as Calibre. Sigil can't show any viewable page while Caibre only show 1% of the pages. When i change the epub extension to zip, the extracted files seem incomplete as well (4.3MB), 99% of the text files are missing.

I use an Android epub reader Moon+ Reader Pro to open the file, it works perfectly fine.

Sigil error


Calibre Viewer error


Calibre Editor screen (no error reported)


If the epub file is corrupted, Moon+ Reader won't be able to open it. What is wrong here?

The epub file is attached as below.
Moderator Notice
Copyrighted content removed by moderator

Last edited by DiapDealer; 09-07-2021 at 04:07 PM.
iieeann is offline   Reply With Quote
Old 09-07-2021, 07:43 AM   #2
DiapDealer
Grand Sorcerer
DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.
 
DiapDealer's Avatar
 
Posts: 27,546
Karma: 193191846
Join Date: Jan 2010
Device: Nexus 7, Kindle Fire HD
For Sigil's part: it is the insane xhtml filenames that are the problem. Plus, it appears there may be tons of improperly encoded characters for the UTF-8/16 encoding.

I'm going to give the benefit of the doubt in assuming you haven't posted content that infringes on the rights of any potential copyright holders, here. Just know that your upload will be deleted as soon as it's determined otherwise.
DiapDealer is offline   Reply With Quote
Advert
Old 09-07-2021, 07:48 AM   #3
DiapDealer
Grand Sorcerer
DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.
 
DiapDealer's Avatar
 
Posts: 27,546
Karma: 193191846
Join Date: Jan 2010
Device: Nexus 7, Kindle Fire HD
The fact that Moon Reader will open it has no bearing on whether or not it's a valid, spec-compliant, epub
DiapDealer is offline   Reply With Quote
Old 09-07-2021, 09:43 AM   #4
KevinH
Sigil Developer
KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.
 
Posts: 7,636
Karma: 5433388
Join Date: Nov 2009
Device: many
The file names are ridiculous and something about them is making Sigil think they are not the same as listed in the opf manifest. I have not figured out why (yet). If you load the epub into Sigil, it will tell you that it is seeing these files as unmanifested. So the path in the opf manifest does not seem to match the path as created when extracting file from the zip archive.

If you unzip the epub, you can actually use AddExisting to import the entire set of files and save it as an epub, and then it will work.

So it may be an encoding issue in the content.opf itself as it appears it was edited by some tool that is insane enough to add a BOM (byte order mark) to a utf-8 file! There no big endian vs little endian when it is a stream of just bytes (ie. in a utf-8 file) and no ByteOrder mark is ever needed or ever used anymore.

I did notice the Sigil 1.7 metadata at some point in the content.opf so at some point this was edited with Sigil but Sigil will not write out BOM so something has edited it after that.

Last edited by KevinH; 09-07-2021 at 09:46 AM.
KevinH is offline   Reply With Quote
Old 09-07-2021, 11:19 AM   #5
elibrarian
Imperfect Perfectionist
elibrarian ought to be getting tired of karma fortunes by now.elibrarian ought to be getting tired of karma fortunes by now.elibrarian ought to be getting tired of karma fortunes by now.elibrarian ought to be getting tired of karma fortunes by now.elibrarian ought to be getting tired of karma fortunes by now.elibrarian ought to be getting tired of karma fortunes by now.elibrarian ought to be getting tired of karma fortunes by now.elibrarian ought to be getting tired of karma fortunes by now.elibrarian ought to be getting tired of karma fortunes by now.elibrarian ought to be getting tired of karma fortunes by now.elibrarian ought to be getting tired of karma fortunes by now.
 
elibrarian's Avatar
 
Posts: 460
Karma: 724664
Join Date: Dec 2011
Location: Ølstykke, Denmark
Device: none
I don't have a solution, but some clues for more knowledgeable people around here perhaps: The book is in Traditional Chinese (Google/Chrome tells me), and I suspect that the ridiculous file names probably was too at some point of their existence. Given the problems with spaces, commas etc. in filenames inside epubs, we've discussed ad nauseam here on Mobileread for years, I think the problem might lie here.

The Moon reader seems to be originating from Hong Kong, and thus chinese probably are not at problem for it.

Just my 2 cents - as usual.

Regards,

Kim
elibrarian is offline   Reply With Quote
Advert
Old 09-07-2021, 11:45 AM   #6
kovidgoyal
creator of calibre
kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.
 
kovidgoyal's Avatar
 
Posts: 43,843
Karma: 22666666
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
Do an EPUB to AZW3 to EPUB conversion in calibre that will take care of most issues. Or even just an EPUB to EPUB though that will not fix the invalid filenames.
kovidgoyal is offline   Reply With Quote
Old 09-07-2021, 12:50 PM   #7
Sarmat89
Evangelist
Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.
 
Posts: 482
Karma: 2267928
Join Date: Nov 2015
Device: none
The easiest way is to use Macintosh or Linux machine/VM to edit the file and mass rename all 900+ files to some Latin letters.
Sarmat89 is offline   Reply With Quote
Old 09-07-2021, 01:00 PM   #8
KevinH
Sigil Developer
KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.
 
Posts: 7,636
Karma: 5433388
Join Date: Nov 2009
Device: many
Or just load the epub in Sigil then use AddExisting to add the .xhtml files from an unzipped version of the epub. Then you can use Sigil's regex rename and batch renaming to create better filenames while properly handling any internal links.
KevinH is offline   Reply With Quote
Old 09-07-2021, 01:10 PM   #9
Sarmat89
Evangelist
Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.Sarmat89 ought to be getting tired of karma fortunes by now.
 
Posts: 482
Karma: 2267928
Join Date: Nov 2015
Device: none
That file cannot be extracted in Windows, because it does not allow : and * in the Win32-compatible filenames.
Sarmat89 is offline   Reply With Quote
Old 09-07-2021, 03:12 PM   #10
Binchen
Connoisseur
Binchen began at the beginning.
 
Posts: 57
Karma: 10
Join Date: Jul 2021
Device: Abakus
Thats a chinese book:

Quote:
Dabong's shift worker
The little man who sells newspapers
WeChat public number-Asheng reads books
The filenames contains characters which are not allowed as Sarmat already mentioned.

Quote:
Chapter 197
Dinner Party and Loquat

A dinner party? Well, the governor's lord has been in the city for so long, it is impossible for the Yunzhou officialdom not to know ...... Xu Qi'an's mental state is much better, anyway, he can't sleep, it is boring to stay at the post, so he said.

"Good, your Excellency wait for a moment."

He put on his boots, tied the gong placed on the bed to his chest, and hung the black and gold long knife on his back waist, and opened the door of his room.

Inspector Zhang stood outside the door, wearing a scarlet official robe, with a straight posture and a distinguished air.
I asume thats acommercial book. If its copyrighted i dont know. Maybe the strange filenames are coming from a failed unicode conversion or may be some protection or failed try to get rif of a protection.
Binchen is offline   Reply With Quote
Old 09-07-2021, 03:33 PM   #11
DiapDealer
Grand Sorcerer
DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.
 
DiapDealer's Avatar
 
Posts: 27,546
Karma: 193191846
Join Date: Jan 2010
Device: Nexus 7, Kindle Fire HD
If it's a commercial book that's not being given away for free, it needs to be taken down.

The filenames make zero sense in any conversion, or liberation sense. There's no reason for them to be converted to extremely long meaningless strings comprised of nothing but two characters (both of which are illegal on Windows). There's nothing human parseable in "::***::*:*::**::**::**:::***:::****::::****:.xhtm l" in any language so something like f00001.xhtml, f00002.xhtml, etc... would work equally well (in any language) and not break.

My advice is to throw it back and buy a better copy.

Last edited by DiapDealer; 09-07-2021 at 03:36 PM.
DiapDealer is offline   Reply With Quote
Old 09-07-2021, 03:48 PM   #12
Binchen
Connoisseur
Binchen began at the beginning.
 
Posts: 57
Karma: 10
Join Date: Jul 2021
Device: Abakus
I am pretty sure this is a copyrighted commercial book, the author is Xiao Lang Jun
and there currently plans to adopt this to a cinematic film for the chinese market.

Klick
Binchen is offline   Reply With Quote
Old 09-07-2021, 04:04 PM   #13
DiapDealer
Grand Sorcerer
DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.DiapDealer ought to be getting tired of karma fortunes by now.
 
DiapDealer's Avatar
 
Posts: 27,546
Karma: 193191846
Join Date: Jan 2010
Device: Nexus 7, Kindle Fire HD
Down it comes, then.
DiapDealer is offline   Reply With Quote
Old 09-08-2021, 11:39 AM   #14
KevinH
Sigil Developer
KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.KevinH ought to be getting tired of karma fortunes by now.
 
Posts: 7,636
Karma: 5433388
Join Date: Nov 2009
Device: many
And I was able to track down the issue for Sigil on macOS and Linux. When reading in a manifest item, it looks at the href for a ":" which should only exist in the url's scheme and if it exists, the href is considered to be external and not actually present in the epub. Using external resources in the manifest is allowed in some case under epub3.

So all and all a very strange beast set up to fail on almost very platform.
KevinH is offline   Reply With Quote
Old 09-10-2021, 03:59 PM   #15
iieeann
Member
iieeann began at the beginning.
 
iieeann's Avatar
 
Posts: 12
Karma: 10
Join Date: Dec 2020
Device: Android
yes get a better copy is the solution. There are 2 formats available, i get the mobi format and convert it to epub, then the file is now editable.
iieeann is offline   Reply With Quote
Reply


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Sigil unable to open a file Andjety Sigil 39 03-20-2017 11:08 PM
Unable to open file Toreth Sigil 25 03-16-2015 06:36 PM
Unable to open database file wladdy Library Management 6 08-28-2012 05:04 AM
Unable to open database file JulieR Calibre 2 04-24-2009 04:40 AM
Unable to open file that is 8MB timyap Sony Reader 12 05-09-2008 09:51 AM


All times are GMT -4. The time now is 06:26 PM.


MobileRead.com is a privately owned, operated and funded community.