View Single Post
Old 02-20-2010, 04:48 AM   #4
iodine9176
Junior Member
iodine9176 began at the beginning.
 
Posts: 5
Karma: 10
Join Date: Feb 2010
Device: stanza
Quote:
Originally Posted by frabjous View Post
OK, I'm rather confused. The title of the thread is about HTML documents, and it's in the ePub forum. But then in your post, you ask about splitting PDF and DOC files into TXT files? Why would you want to change PDF or DOC to TXT? What does this have to do with HTML or ePub? Please be more specific about what you're trying to do.

I usually use <H2>...</H2> tags for chapter headings in my HTML code. (Or perhaps <H2 class="chaptertitle">...</H2>, etc. Calibre allows you to set the XPath expression for chapter detection, but if memory serves, it default setting will pick up H2 tags. It'll do the splitting for you, at least with normal settings.
Thx for your reply.
Actually i am using Ecub as a compiler to generate epub file. As for ecub, it can only imports plain txt files or HTML file. Therefore i wonder will there be a way to make chapters for a pdf or doc , then i can use ecub to compile epub. But actually i am more interested in how to split a pdf or doc into different xhtml files according to chapters .
iodine9176 is offline   Reply With Quote