| 
			
			 | 
		#1 | 
| 
			
			
			
			 Junior Member 
			
			![]() Posts: 6 
				Karma: 10 
				Join Date: Mar 2013 
				
				
				
				Device: none 
				
				
				 | 
	
	
	
		
		
			
			 
				
				Convert HTML to RTF with Page Breaks
			 
			
			
			I've been trying to convert HTML files to RTF to be edited in Word. I would like there to be page breaks at each page but have not been able to get Calibre to do so. This is what I'm working with. Any clues? 
		
	
		
		
		
		
		
		
		
		
		
		
	
	<pagenum page="normal" id="p6" smilref="Fudge_a_Mania00001.smil#p6">6</pagenum> </level2> <level2 id="level2_000004"> <h2 id="h2_000004"> <strong id="strong_000003" smilref="Fudge_a_Mania00001.smil#strong_000003">2</strong> <span class="text" id="span_000019" smilref="Fudge_a_Mania00001.smil#span_000019">Pete and</span> <span class="text" id="span_000020" smilref="Fudge_a_Mania00001.smil#span_000020">Farl ey</span> </h2>  | 
| 
		 | 
	
	
	
		
		
		
		
			 
		
		
		
		
		
		
		
			
		
		
		
	 | 
| 
			
			 | 
		#2 | 
| 
			
			
			
			 Wizard 
			
			![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 4,553 
				Karma: 950151 
				Join Date: Nov 2008 
				
				
				
				Device: Sony PRS-950, iphone/ipad (Marvin/iBooks/QuickReader) 
				
				
				 | 
	
	
	
		
		
		
		
		 
			
			Why do you need to convert the files to rtf?  Word can read and edit html files.
		 
		
	
		
		
		
		
		
		
		
		
		
		
	
	 | 
| 
		 | 
	
	
	
		
		
		
		
			 
		
		
		
		
		
		
		
			
		
		
		
	 | 
| 
			
			 | 
		#3 | 
| 
			
			
			
			 Resident Curmudgeon 
			
			![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 80,784 
				Karma: 150249619 
				Join Date: Nov 2006 
				Location: Roslindale, Massachusetts 
				
				
				Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3 
				
				
				 | 
	
	
	
		
		
		
		
		 
			
			Why not edit in Sigil and create an ePub that you can use as a source format?
		 
		
	
		
		
		
		
		
		
		
		
		
		
	
	 | 
| 
		
 | 
	
	
	
		
		
		
		
			 
		
		
		
		
		
		
		
			
		
		
		
	 | 
| 
			
			 | 
		#4 | 
| 
			
			
			
			 Junior Member 
			
			![]() Posts: 6 
				Karma: 10 
				Join Date: Mar 2013 
				
				
				
				Device: none 
				
				
				 | 
	
	
	
		
		
		
		
		 
			
			The objective is to turn Daisy books into large print paper books for visually impaired kids.   I get the Daisy books from Bookshare that contain 10 files including XML, XSL, CSS and OPF. Since not all our kids can have eReaders I make them paper books from this in Word. Any ideas? The way I do it now is very time consuming and I know there must be a quicker way.
		 
		
	
		
		
		
		
		
		
		
		
		
		
	
	 | 
| 
		 | 
	
	
	
		
		
		
		
			 
		
		
		
		
		
		
		
			
		
		
		
	 | 
| 
			
			 | 
		#5 | 
| 
			
			
			
			 Resident Curmudgeon 
			
			![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 80,784 
				Karma: 150249619 
				Join Date: Nov 2006 
				Location: Roslindale, Massachusetts 
				
				
				Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3 
				
				
				 | 
	
	
	
		
		
		
		
		 
			
			You could try using Calibre to convert to RTF and see how that goes.
		 
		
	
		
		
		
		
		
		
		
		
		
		
	
	 | 
| 
		
 | 
	
	
	
		
		
		
		
			 
		
		
		
		
		
		
		
			
		
		
		
	 | 
| 
			
			 | 
		#6 | 
| 
			
			
			
			 Junior Member 
			
			![]() Posts: 6 
				Karma: 10 
				Join Date: Mar 2013 
				
				
				
				Device: none 
				
				
				 | 
	
	
	
		
		
		
		
		 
			
			Calibre seems like a good option but I can't make it keep page breaks in Word. I thought maybe that was possible under Structure Detection when converting but I have had no luck. The output is a RTF with no page breaks.
		 
		
	
		
		
		
		
		
		
		
		
		
		
	
	 | 
| 
		 | 
	
	
	
		
		
		
		
			 
		
		
		
		
		
		
		
			
		
		
		
	 | 
| 
			
			 | 
		#7 | |
| 
			
			
			
			 Well trained by Cats 
			
			![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 31,268 
				Karma: 61916422 
				Join Date: Aug 2009 
				Location: The Central Coast of California 
				
				
				Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A 
				
				
				 | 
	
	
	
		
		
		
		
		 Quote: 
	
 Did you try setting the 'Page Setup' to 'Generic' ?  | 
|
| 
		
 | 
	
	
	
		
		
		
		
			 
		
		
		
		
		
		
		
			
		
		
		
	 | 
| 
			
			 | 
		#8 | 
| 
			
			
			
			 null operator (he/him) 
			
			![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 22,018 
				Karma: 30277294 
				Join Date: Mar 2012 
				Location: Sydney Australia 
				
				
				Device: none 
				
				
				 | 
	
	|
| 
		 | 
	
	
	
		
		
		
		
			 
		
		
		
		
		
		
		
			
		
		
		
	 | 
| 
			
			 | 
		#9 | 
| 
			
			
			
			 Resident Curmudgeon 
			
			![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 80,784 
				Karma: 150249619 
				Join Date: Nov 2006 
				Location: Roslindale, Massachusetts 
				
				
				Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3 
				
				
				 | 
	
	
	
		
		
		
		
		 
			
			That is a very good idea to add something to the source to indicate page breaks and then search/replace into a page break.
		 
		
	
		
		
		
		
		
		
		
		
		
		
	
	 | 
| 
		
 | 
	
	
	
		
		
		
		
			 
		
		
		
		
		
		
		
			
		
		
		
	 | 
| 
			
			 | 
		#10 | 
| 
			
			
			
			 null operator (he/him) 
			
			![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 22,018 
				Karma: 30277294 
				Join Date: Mar 2012 
				Location: Sydney Australia 
				
				
				Device: none 
				
				
				 | 
	
	
	
		
		
		
		
		 
			
			My assumption was that the 'something' comes via the input, and I hoped my conditional statement implied that, but let's be pedantically explicit   
		
	
		
		
		
		
		
		
		
		
		
		
	
	If there's 'something' in the output RTF that ORIGINATES FROM THE INPUT that indicates 'force new page here' eg a squiggly line Code: 
	~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ BR  | 
| 
		 | 
	
	
	
		
		
		
		
			 
		
		
		
		
		
		
		
			
		
		
		
	 | 
| 
			
			 | 
		#11 | 
| 
			
			
			
			 creator of calibre 
			
			![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 45,609 
				Karma: 28549044 
				Join Date: Oct 2006 
				Location: Mumbai, India 
				
				
				Device: Various 
				
				
				 | 
	
	
	
		
		
		
		
		 
			
			IIRC, there is no support for page breaks in the RTF output plugin (which is largely unmaintained anyway).
		 
		
	
		
		
		
		
		
		
		
		
		
		
	
	 | 
| 
		 | 
	
	
	
		
		
		
		
			 
		
		
		
		
		
		
		
			
		
		
		
	 | 
| 
			
			 | 
		#12 | 
| 
			
			
			
			 Junior Member 
			
			![]() Posts: 6 
				Karma: 10 
				Join Date: Mar 2013 
				
				
				
				Device: none 
				
				
				 | 
	
	
	
		
		
		
		
		 
			
			This is a good idea. I will try to find something to do this. Thanks for the suggestion.
		 
		
	
		
		
		
		
		
		
		
		
		
		
	
	 | 
| 
		 | 
	
	
	
		
		
		
		
			 
		
		
		
		
		
		
		
			
		
		
		
	 | 
![]()  | 
            
        
    
            
  | 
    
			 
			Similar Threads
		 | 
	||||
| Thread | Thread Starter | Forum | Replies | Last Post | 
| page breaks in html document | michaelsmith1983 | Conversion | 1 | 03-06-2012 11:32 PM | 
| HTML to MOBI conversion ignores page breaks | LeftHanded Matt | Conversion | 2 | 12-21-2011 01:25 PM | 
| RTF conversion problem - no page breaks | jhsrennie | Conversion | 7 | 06-16-2011 02:29 PM | 
| Cannot Convert HTML to RTF | LightGuard | Calibre | 1 | 06-27-2010 11:37 AM | 
| RTF vs HTML---best way to convert my files? | ficbot | Workshop | 16 | 05-06-2010 07:05 PM |