Register Guidelines E-Books Search Today's Posts Mark Forums Read

Go Back   MobileRead Forums > E-Book Software > Sigil

Notices

Reply
 
Thread Tools Search this Thread
Old 04-03-2021, 09:47 PM   #1
wellesradio
Member
wellesradio is a rising star in the heavenswellesradio is a rising star in the heavenswellesradio is a rising star in the heavenswellesradio is a rising star in the heavenswellesradio is a rising star in the heavenswellesradio is a rising star in the heavenswellesradio is a rising star in the heavenswellesradio is a rising star in the heavenswellesradio is a rising star in the heavenswellesradio is a rising star in the heavenswellesradio is a rising star in the heavens
 
Posts: 21
Karma: 13884
Join Date: Jan 2014
Device: apple ipad (3rd generation)
Can I break up an HTML file using a TOC?

I have some public domain ebooks in epub that contain a Table of Contents, but it seems that the books contains only a small handful of HTML files, each with multiple chapters in them. However my e-reader only recognizes chapter progress within “sections”, meaning within each HTML file and not according to the TOC which is just linking to paragraphs within each HTML file.

I’d like to break up the chapters into their own separate HTML files using the TOC as a guide. I’d like to be able to do it automatically rather than manually.
wellesradio is offline   Reply With Quote
Old 04-04-2021, 02:37 AM   #2
Doitsu
Grand Sorcerer
Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.Doitsu ought to be getting tired of karma fortunes by now.
 
Doitsu's Avatar
 
Posts: 5,175
Karma: 18533687
Join Date: Dec 2010
Device: Kindle PW2
Quote:
Originally Posted by wellesradio View Post
I’d like to break up the chapters into their own separate HTML files using the TOC as a guide. I’d like to be able to do it automatically rather than manually.
In Sigil, select Tools > Table of Contents > Generate Table of Contents or simply press CTRL+T.
If you see all chapter titles, you can simply insert a Sigil split marker tag before each chapter heading tag. For example, if all chapter headings are <h1> tags, you'd use:

Find:<h1
Replace:<hr class="sigil_split_marker" /><h1

and then select Edit > Split at markers followed by Tools > Table of Contents > Generate Table of Contents.

If the TOC is empty when you select Tools > Table of Contents > Generate Table of Contents, you can use KevinH's TOCSaver plugin to change paragraph tags to heading tags or insert hidden heading tags.
Doitsu is offline   Reply With Quote
Old 04-05-2021, 10:41 AM   #3
exaltedwombat
Guru
exaltedwombat ought to be getting tired of karma fortunes by now.exaltedwombat ought to be getting tired of karma fortunes by now.exaltedwombat ought to be getting tired of karma fortunes by now.exaltedwombat ought to be getting tired of karma fortunes by now.exaltedwombat ought to be getting tired of karma fortunes by now.exaltedwombat ought to be getting tired of karma fortunes by now.exaltedwombat ought to be getting tired of karma fortunes by now.exaltedwombat ought to be getting tired of karma fortunes by now.exaltedwombat ought to be getting tired of karma fortunes by now.exaltedwombat ought to be getting tired of karma fortunes by now.exaltedwombat ought to be getting tired of karma fortunes by now.
 
Posts: 874
Karma: 2457128
Join Date: Nov 2011
Device: none
If the document consistently gives chapter titles a particular tag (and doesn't use that tag elsewhere) a simple Search and Replace to insert "sigil_split_marker" will do the job.

But if the code were that organised, I suspect the TOC would have been sorted out already. You may have no practical alternative to finding the chapter titles you want to list in a TOC and applying the h1 tag manually.

How many chapters? Some jobs are really too small to be worth automating.
exaltedwombat is offline   Reply With Quote
Old 04-05-2021, 06:11 PM   #4
hobnail
Running with scissors
hobnail ought to be getting tired of karma fortunes by now.hobnail ought to be getting tired of karma fortunes by now.hobnail ought to be getting tired of karma fortunes by now.hobnail ought to be getting tired of karma fortunes by now.hobnail ought to be getting tired of karma fortunes by now.hobnail ought to be getting tired of karma fortunes by now.hobnail ought to be getting tired of karma fortunes by now.hobnail ought to be getting tired of karma fortunes by now.hobnail ought to be getting tired of karma fortunes by now.hobnail ought to be getting tired of karma fortunes by now.hobnail ought to be getting tired of karma fortunes by now.
 
Posts: 1,037
Karma: 10000000
Join Date: Nov 2019
Device: none
In addition to what's said above, what I also do in order to have chapter breaks only before chapter headings is to join all of the chapter files into one large html/xhtml file, then do the splitting as per above. (But I've forgotten how I joined the separate files so hunt around and experiment.)
hobnail is offline   Reply With Quote
Old 04-05-2021, 09:26 PM   #5
DNSB
Bibliophagist
DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.DNSB ought to be getting tired of karma fortunes by now.
 
DNSB's Avatar
 
Posts: 12,146
Karma: 59280049
Join Date: Jul 2010
Location: Vancouver
Device: Kobo Forma, Clara HD, Nexus 7 HD, iPad Pro, Tolino epos
To continue from what @hobnail said, I also tend to join all the chapter files into a single file since Gutenberg has a love for having massive files with chapters split between the files. To do this, I select the files I want to merge, and then right click and merge or Ctrl-M. After this, I insert the split markers and split.

Quite often the split markers are simple to insert but at other times, the regex to insert the split markers can be a learning experience.
DNSB is offline   Reply With Quote
Old 04-07-2021, 05:16 PM   #6
exaltedwombat
Guru
exaltedwombat ought to be getting tired of karma fortunes by now.exaltedwombat ought to be getting tired of karma fortunes by now.exaltedwombat ought to be getting tired of karma fortunes by now.exaltedwombat ought to be getting tired of karma fortunes by now.exaltedwombat ought to be getting tired of karma fortunes by now.exaltedwombat ought to be getting tired of karma fortunes by now.exaltedwombat ought to be getting tired of karma fortunes by now.exaltedwombat ought to be getting tired of karma fortunes by now.exaltedwombat ought to be getting tired of karma fortunes by now.exaltedwombat ought to be getting tired of karma fortunes by now.exaltedwombat ought to be getting tired of karma fortunes by now.
 
Posts: 874
Karma: 2457128
Join Date: Nov 2011
Device: none
Quote:
Originally Posted by DNSB View Post
Quite often the split markers are simple to insert but at other times, the regex to insert the split markers can be a learning experience.

Indeed. I'm all in favour of learning experiences. But sometimes you have to balance an hour's research into Regex with the time taken to manually insert 16 chapter breaks!
exaltedwombat is offline   Reply With Quote
Old 04-07-2021, 06:04 PM   #7
JSWolf
Resident Curmudgeon
JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.JSWolf ought to be getting tired of karma fortunes by now.
 
JSWolf's Avatar
 
Posts: 62,333
Karma: 102150074
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Aura H2O, PRS-650, PRS-T1, nook STR, iPad 4, iPhone SE 2020, PW3
Quote:
Originally Posted by exaltedwombat View Post
Indeed. I'm all in favour of learning experiences. But sometimes you have to balance an hour's research into Regex with the time taken to manually insert 16 chapter breaks!
While it might take longer to learn regex, once you've learned it, it will eventually take less time.
JSWolf is offline   Reply With Quote
Old 04-07-2021, 06:39 PM   #8
exaltedwombat
Guru
exaltedwombat ought to be getting tired of karma fortunes by now.exaltedwombat ought to be getting tired of karma fortunes by now.exaltedwombat ought to be getting tired of karma fortunes by now.exaltedwombat ought to be getting tired of karma fortunes by now.exaltedwombat ought to be getting tired of karma fortunes by now.exaltedwombat ought to be getting tired of karma fortunes by now.exaltedwombat ought to be getting tired of karma fortunes by now.exaltedwombat ought to be getting tired of karma fortunes by now.exaltedwombat ought to be getting tired of karma fortunes by now.exaltedwombat ought to be getting tired of karma fortunes by now.exaltedwombat ought to be getting tired of karma fortunes by now.
 
Posts: 874
Karma: 2457128
Join Date: Nov 2011
Device: none
But when that hour results in the conclusion that chapter's AREN'T marked in any consistent and unique way.... :-( If these were well-constructed EPUB files we wouldn't be having to do this job in the first place.
exaltedwombat is offline   Reply With Quote
Reply

Thread Tools Search this Thread
Search this Thread:

Advanced Search

Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
HTML Entities placed in ToC break Kobo Aura trekky0623 Calibre 11 12-16-2016 04:22 PM
Kindler previewer not recognizing toc.ncx file, my html toc, or the start point... petercrowell Kindle Formats 2 05-01-2012 08:14 AM
HTML input plugin stripping text within toc tags in child html file nimblebooks Conversion 3 02-21-2012 03:24 PM
NCX file generator (and html ToC and opf) GiorgioC Workshop 0 07-12-2011 06:55 AM
can't generate a toc from an html file p3aul Calibre 13 08-27-2010 05:44 AM


All times are GMT -4. The time now is 03:42 PM.


MobileRead.com is a privately owned, operated and funded community.