Quote:
Originally Posted by frabjous
OK, I'm rather confused. The title of the thread is about HTML documents, and it's in the ePub forum. But then in your post, you ask about splitting PDF and DOC files into TXT files? Why would you want to change PDF or DOC to TXT? What does this have to do with HTML or ePub? Please be more specific about what you're trying to do.
I usually use <H2>...</H2> tags for chapter headings in my HTML code. (Or perhaps <H2 class="chaptertitle">...</H2>, etc. Calibre allows you to set the XPath expression for chapter detection, but if memory serves, it default setting will pick up H2 tags. It'll do the splitting for you, at least with normal settings.
|
Thx for your reply.
Actually i am using Ecub as a compiler to generate epub file. As for ecub, it can only imports plain txt files or HTML file. Therefore i wonder will there be a way to make chapters for a pdf or doc , then i can use ecub to compile epub. But actually i am more interested in how to split a pdf or doc into different xhtml files according to chapters .