| 
			
			 | 
		#691 | 
| 
			
			
			
			 creator of calibre 
			
			![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 45,609 
				Karma: 28549044 
				Join Date: Oct 2006 
				Location: Mumbai, India 
				
				
				Device: Various 
				
				
				 | 
	
	
	
		
		
		
		
		 
			
			put all files in a zip file
		 
		
	
		
		
		
		
		
		
		
		
		
		
	
	 | 
| 
		 | 
	
	
	
		
		
		
		
			 
		
		
		
		
		
		
		
			
		
		
		
	 | 
| 
			
			 | 
		#692 | 
| 
			
			
			
			 http://is.gd/4flJX 
			
			![]() ![]() ![]() ![]() ![]() Posts: 134 
				Karma: 422 
				Join Date: Jan 2008 
				
				
				
				Device: Sony PRS-505, iPhone 3G 
				
				
				 | 
	
	
	
		
		
		
		
		 
			
			Thanks for the prompt response. 
		
	
		
		
		
		
		
		
		
		
		
		
	
	When i downloaded the Web page with all dependencies (in a way to refer to local files) it all worked. So, it solves my problem. Just as a feedback: When I put the same files in a ZIP file, I'm getting this error: Processing rob-blog.zip Traceback (most recent call last): File "convert_from.py", line 1922, in <module> File "convert_from.py", line 1916, in main File "convert_from.py", line 1808, in process_file File "convert_from.py", line 266, in __init__ File "convert_from.py", line 371, in add_file File "calibre\ebooks\chardet\__init__.pyo", line 59, in xml_to_unicode TypeError: decode() argument 1 must be string, not None Have you ever considered supporting the .mht (Web Archive) format? Thanks again,  | 
| 
		 | 
	
	
	
		
		
		
		
			 
		
		
		
		
		
		
		
			
		
		
		
	 | 
| 
			
			 | 
		#693 | 
| 
			
			
			
			 creator of calibre 
			
			![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 45,609 
				Karma: 28549044 
				Join Date: Oct 2006 
				Location: Mumbai, India 
				
				
				Device: Various 
				
				
				 | 
	
	
	
		
		
		
		
		 
			
			You have to run the zip file through any2lrf 
		
	
		
		
		
		
		
		
		
		
		
		
	
	and calibre contains very sophisticated code to download and convert web content, see feeds2lrf and web2lrf  | 
| 
		 | 
	
	
	
		
		
		
		
			 
		
		
		
		
		
		
		
			
		
		
		
	 | 
| 
			
			 | 
		#694 | 
| 
			
			
			
			 http://is.gd/4flJX 
			
			![]() ![]() ![]() ![]() ![]() Posts: 134 
				Karma: 422 
				Join Date: Jan 2008 
				
				
				
				Device: Sony PRS-505, iPhone 3G 
				
				
				 | 
	
	
	
		
		
		
		
		 
			
			Wow! I'm impressed with the efficiency of the news and RSS feeds conversion. It works perfectly with the blog site that I was looking for.  
		
	
		
		
		
		
		
		
		
		
		
		
	
	Thanks,  | 
| 
		 | 
	
	
	
		
		
		
		
			 
		
		
		
		
		
		
		
			
		
		
		
	 | 
| 
			
			 | 
		#695 | 
| 
			
			
			
			 Junior Member 
			
			![]() Posts: 1 
				Karma: 10 
				Join Date: May 2008 
				
				
				
				Device: Sony Reader, Amazon Kindle 
				
				
				 | 
	
	
	
		
		
			
			 
			
			Hello, 
		
	
		
		
		
		
		
		
		
		
		
		
	
	When using html2lrf with the --chapter-regex option, can I apply the search for a tag set instead of what's between the tags? In other words, if I want anything using <H1></H1> tags to be a chapter can I set this up using the --chapter-regex option? Thanks.  | 
| 
		 | 
	
	
	
		
		
		
		
			 
		
		
		
		
		
		
		
			
		
		
		
	 | 
| 
			
			 | 
		#696 | 
| 
			
			
			
			 creator of calibre 
			
			![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 45,609 
				Karma: 28549044 
				Join Date: Oct 2006 
				Location: Mumbai, India 
				
				
				Device: Various 
				
				
				 | 
	
	
	
		
		
		
		
		 
			
			No chapter-regex only searches the contents of tags. You can force page breaks before tags based on ther tag names and attributes, using the various force-page-break options
		 
		
	
		
		
		
		
		
		
		
		
		
		
	
	 | 
| 
		 | 
	
	
	
		
		
		
		
			 
		
		
		
		
		
		
		
			
		
		
		
	 | 
| 
			
			 | 
		#697 | 
| 
			
			
			
			 Junior Member 
			
			![]() Posts: 9 
				Karma: 10 
				Join Date: Jul 2006 
				
				
				
				
				
				
				 | 
	
	
	
		
		
		
		
		 
			
			Maybe I'm asking a stupid question, but I couldn't find the answer using the search engine.  
		
	
		
		
		
		
		
		
		
		
		
		
	
	Here it goes: I have a .doc document. I created a TOC autodetecting its chapters. I saved and from here I created an html file. Everything worked and looked fine. Then I converted the html to lrf with Calibre and the TOC just lost all of its title descriptions. That's what I mean: I. Chapter one title.................5 II: Chapter two title................32 III. etc. in the lrf changed to: LINK.....................5 LINK....................32 LINK..etc Why am I loosing the chapter names? How could I bring them to the lrf file?  | 
| 
		 | 
	
	
	
		
		
		
		
			 
		
		
		
		
		
		
		
			
		
		
		
	 | 
| 
			
			 | 
		#698 | 
| 
			
			
			
			 creator of calibre 
			
			![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 45,609 
				Karma: 28549044 
				Join Date: Oct 2006 
				Location: Mumbai, India 
				
				
				Device: Various 
				
				
				 | 
	
	
	
		
		
		
		
		 
			
			post the section of the HTML file that contains the TOC
		 
		
	
		
		
		
		
		
		
		
		
		
		
	
	 | 
| 
		 | 
	
	
	
		
		
		
		
			 
		
		
		
		
		
		
		
			
		
		
		
	 | 
| 
			
			 | 
		#699 | 
| 
			
			
			
			 Junior Member 
			
			![]() Posts: 9 
				Karma: 10 
				Join Date: Jul 2006 
				
				
				
				
				
				
				 | 
	
	
	
		
		
		
		
		 
			
			Could it be this part? 
		
	
		
		
		
		
		
		
		
		
		
		
		
			<w:Sdt SdtDocPart="t" DocPartType="Table of Contents" DocPartUnique="t" ID="21517536"> <p class=MsoTocHeading><span lang=ES>Contenido<w:sdtPr></w:sdtPr></span></p> <p class=MsoToc1 style='tab-stops:right dotted 481.55pt'><!--[if supportFields]><span lang=ES><span style='mso-element:field-begin'></span><span style='mso-spacerun:yes'> </span>TOC \o "1-3" \h \z \u <span style='mso-element:field-separator'></span></span><![endif]--><span lang=ES><span class=MsoHyperlink><span style='mso-no-proof:yes'><a href="#_Toc200384664"><span style='mso-fareast-font-family:"Times New Roman"'>Introducción</span><span style='color:windowtext;display:none;mso-hide:screen;text-decoration:none; text-underline:none'><span style='mso-tab-count:1 dotted'>. </span></span><!--[if supportFields]><span style='color:windowtext;display:none;mso-hide:screen;text-decoration:none; text-underline:none'><span style='mso-element:field-begin'></span></span><span style='color:windowtext;display:none;mso-hide:screen;text-decoration:none; text-underline:none'> PAGEREF _Toc200384664 \h </span><span style='color:windowtext; display:none;mso-hide:screen;text-decoration:none;text-underline:none'><span style='mso-element:field-separator'></span></span><![endif]--><span style='color:windowtext;display:none;mso-hide:screen;text-decoration:none; text-underline:none'>5</span><span style='color:windowtext;display:none; mso-hide:screen;text-decoration:none;text-underline:none'><!--[if gte mso 9]><xml> <w:data>08D0C9EA79F9BACE118C8200AA004BA90B02000000 080000000E0000005F0054006F006300320030003000330038 0034003600360034000000</w:data> </xml><![endif]--></span><!--[if supportFields]><span style='color:windowtext; display:none;mso-hide:screen;text-decoration:none;text-underline:none'><span style='mso-element:field-end'></span></span><![endif]--></a></span></span></span><span style='mso-ansi-language:CA;mso-fareast-language:CA;mso-no-proof:yes'><o  ></o ></span></p><p class=MsoToc1 style='tab-stops:right dotted 481.55pt'><span class=MsoHyperlink><span lang=ES style='mso-no-proof:yes'><a href="#_Toc200384665"><span style='mso-fareast-font-family:"Times New Roman"'>CAPITULO I</span><span style='color:windowtext;display:none;mso-hide:screen;text-decoration: none;text-underline:none'><span style='mso-tab-count:1 dotted'> </span></span><!--[if supportFields]><span style='color:windowtext;display:none;mso-hide:screen;text-decoration:none; text-underline:none'><span style='mso-element:field-begin'></span></span><span style='color:windowtext;display:none;mso-hide:screen;text-decoration:none; text-underline:none'> PAGEREF _Toc200384665 \h </span><span style='color:windowtext; display:none;mso-hide:screen;text-decoration:none;text-underline:none'><span style='mso-element:field-separator'></span></span><![endif]--><span style='color:windowtext;display:none;mso-hide:screen;text-decoration:none; text-underline:none'>19</span><span style='color:windowtext;display:none; mso-hide:screen;text-decoration:none;text-underline:none'><!--[if gte mso 9]><xml> <w:data>08D0C9EA79F9BACE118C8200AA004BA90B02000000 080000000E0000005F0054006F006300320030003000330038 0034003600360035000000</w:data> </xml><![endif]--></span><!--[if supportFields]><span style='color:windowtext; display:none;mso-hide:screen;text-decoration:none;text-underline:none'><span style='mso-element:field-end'></span></span><![endif]--></a></span></span><span style='mso-ansi-language:CA;mso-fareast-language:CA;mso-no-proof:yes'><o  ></o ></span></p>etc. Last edited by marwick; 06-08-2008 at 01:47 PM.  | 
| 
		 | 
	
	
	
		
		
		
		
			 
		
		
		
		
		
		
		
			
		
		
		
	 | 
| 
			
			 | 
		#700 | 
| 
			
			
			
			 creator of calibre 
			
			![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 45,609 
				Karma: 28549044 
				Join Date: Oct 2006 
				Location: Mumbai, India 
				
				
				Device: Various 
				
				
				 | 
	
	
	
		
		
		
		
		 
			
			Try saving as a "clean HTML" file from Word, that will remove all the microsoft junk and give you an easier to understand HTML file and hopefully better conversion results as well.
		 
		
	
		
		
		
		
		
		
		
		
		
		
	
	 | 
| 
		 | 
	
	
	
		
		
		
		
			 
		
		
		
		
		
		
		
			
		
		
		
	 | 
| 
			
			 | 
		#701 | 
| 
			
			
			
			 zeldinha zippy zeldissima 
			
			![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 27,827 
				Karma: 921169 
				Join Date: Dec 2007 
				Location: Paris, France 
				
				
				Device: eb1150 & is that a nook in her pocket, or she just happy to see you? 
				
				
				 | 
	
	
	
		
		
		
		
		 
			
			how do you save as a "clean html" file from word ? i didn't know that was possible, but it would be great if it is... i didn't find the option in the "format" dropdown list in the "save as" dialogue.
		 
		
	
		
		
		
		
		
		
		
		
		
		
	
	 | 
| 
		 | 
	
	
	
		
		
		
		
			 
		
		
		
		
		
		
		
			
		
		
		
	 | 
| 
			
			 | 
		#702 | 
| 
			
			
			
			 creator of calibre 
			
			![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 45,609 
				Karma: 28549044 
				Join Date: Oct 2006 
				Location: Mumbai, India 
				
				
				Device: Various 
				
				
				 | 
	
	
	
		
		
		
		
		 
			
			Depends on the version of word you have and unfortunately, I no longer have a copy of word on hand to check, but I'm sure someone will be able to tell you. At a guess look at the "export" options
		 
		
	
		
		
		
		
		
		
		
		
		
		
	
	 | 
| 
		 | 
	
	
	
		
		
		
		
			 
		
		
		
		
		
		
		
			
		
		
		
	 | 
| 
			
			 | 
		#703 | 
| 
			
			
			
			 zeldinha zippy zeldissima 
			
			![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 27,827 
				Karma: 921169 
				Join Date: Dec 2007 
				Location: Paris, France 
				
				
				Device: eb1150 & is that a nook in her pocket, or she just happy to see you? 
				
				
				 | 
	
	
	
		
		
		
		
		 
			
			aaah... i suspected as much. i think my version is probably too old (2000). i'll just have to stick to writing my html code by hand. it's worked so far.
		 
		
	
		
		
		
		
		
		
		
		
		
		
	
	 | 
| 
		 | 
	
	
	
		
		
		
		
			 
		
		
		
		
		
		
		
			
		
		
		
	 | 
| 
			
			 | 
		#704 | 
| 
			
			
			
			 Resident Curmudgeon 
			
			![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 80,784 
				Karma: 150249619 
				Join Date: Nov 2006 
				Location: Roslindale, Massachusetts 
				
				
				Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3 
				
				
				 | 
	
	|
| 
		
 | 
	
	
	
		
		
		
		
			 
		
		
		
		
		
		
		
			
		
		
		
	 | 
| 
			
			 | 
		#705 | 
| 
			
			
			
			 creator of calibre 
			
			![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 45,609 
				Karma: 28549044 
				Join Date: Oct 2006 
				Location: Mumbai, India 
				
				
				Device: Various 
				
				
				 | 
	
	
	
		
		
		
		
		 
			
			rtf2lrf is not really as feature rich as html2lrf (it basically converts the RTF to html and then runs html2lrf on it)
		 
		
	
		
		
		
		
		
		
		
		
		
		
	
	 | 
| 
		 | 
	
	
	
		
		
		
		
			 
		
		
		
		
		
		
		
			
		
		
		
	 | 
![]()  | 
            
        
            
| Tags | 
| html2lrf, libprs500 | 
            
  | 
    
			 
			Similar Threads
		 | 
	||||
| Thread | Thread Starter | Forum | Replies | Last Post | 
| Change font of header for LRF Output on PRS 505 | duckbill | Calibre | 3 | 05-15-2010 12:07 PM | 
| Pissed off with LRF formatting: LRF/LRS clean tool? | grimborg | LRF | 8 | 02-15-2010 02:14 PM | 
| Fonts for LRF output | krischik | Calibre | 1 | 10-03-2009 06:01 AM | 
| CBZ > LRF (LRF>HTML/MOBI????) | sideburnt | Calibre | 4 | 09-15-2009 07:44 AM | 
| libprs500 Issues Converting .LIT to .LRF - .LRF crashes everything | vasbinde | Calibre | 6 | 02-14-2008 01:16 PM |