|  11-22-2022, 03:10 AM | #1 | 
| Connoisseur            Posts: 72 Karma: 800000 Join Date: Jun 2021 Device: Kindle Paperwhite (PW1|PW3|PW4), Kindle Voyage |  Automate a page split... Find & Replace doesn't work for this 
			
			I've been trying to find a way to split a page before every <h1> heading & before & after every image with the word "insert" in the name. I have found many results but all of them end up saying the same thing "use Regex/Find & Replace" &/or "add a Page Break" but the problem is that I'm not just needing to replace some text in a page, rather what I need is to take a page & split it right before a set, repeated entry. Some of the topics I found seemed to be asking the same thing but were not clear enough to prevent misunderstanding & just seem to end with the answering party believing they solved the problem. What I am currently doing is just a Find for <h1> & clicking before the found text & selecting Edit > Split at Cursor. I then do another search, this time for <img &, after looking at it to be sure it's an image that is intended to be on it's own page, splitting at cursor before & after the image. I then go & rename the pages, with a set pattern, the image pages are named what the image is "insert1", "insert2", etc, the ones that start at headings are named "ch#_" with the consecutive number to the previous one, (usually starting at "0" if there's a "Prologue" chapter but if an automation would require it to always start at 1 I'm fine with that too), followed by the chapter's 1st 7 non-whitespace characters in the <h1> name. Is there any way to shorten this process? I have a saved search for the 2, but the time that saves seems to be none, only really helping in the potential future where I won't have to remember what to search for. I'm new to Sigil, & feel I must be missing something because this seems like regular things people would want to do, but all my searches just seem to go cold with the same non-answer. | 
|   |   | 
|  11-22-2022, 03:26 AM | #2 | 
| Guru            Posts: 899 Karma: 3501166 Join Date: Jan 2017 Location: Poland Device: Various | 
			
			Ad h1: Search: <h1 Replace: <hr class="sigil_split_marker" /><h1 + Split at Markers (F6). Ad images: Show sample code with img. The solution will be very similar. 1. Find something that is usually at the beginning of the image code. 2. Put an additional splitting code in front of it. 3. Press F6 to split the files. | 
|   |   | 
|  11-22-2022, 09:57 AM | #3 | 
| A Hairy Wizard            Posts: 3,394 Karma: 20212733 Join Date: Dec 2012 Location: Charleston, SC today Device: iPhone 15/11/X/6/iPad 1,2,Air & Air Pro/Surface Pro/Kindle PW & Fire | 
			
			Something along the lines of: <img insert alt="" src="../Images/example.jpg"/> -OR- <img alt="" src="../Images/insertexample.jpg"/> Code: find: <img(.*?)insert(.*?)/>
replace: <hr class="sigil_split_marker" /><img\1\2/><hr class="sigil_split_marker" />click Edit/Split at Markers (F6) Last edited by Turtle91; 11-22-2022 at 10:02 AM. | 
|   |   | 
|  11-22-2022, 10:37 AM | #4 | 
| A Hairy Wizard            Posts: 3,394 Karma: 20212733 Join Date: Dec 2012 Location: Charleston, SC today Device: iPhone 15/11/X/6/iPad 1,2,Air & Air Pro/Surface Pro/Kindle PW & Fire | 
			
			....although that brings up the question of WHY you want an image to be on it's own page?? If it truly needs to be all by itself on the page then I would recommend wrapping it in an svg...that solves most of the image issues. Spoiler: 
 You can even give Sigil a template file that will do this for you when you select TOOLS/Add Cover. It automagically adds the proper image width/height/filename Just add the COMPLETE xhtml file to your Sigil Preferences Location (EDIT/PREFERENCES/Open Preferences Location) and name it "cover.xhtml" Here is the one I use... you can modify it however you wish: Spoiler: 
 Edit: I almost forgot to mention CalibUser's Sigil plugin: InsertImageSVG Last edited by Turtle91; 11-22-2022 at 10:41 AM. | 
|   |   | 
|  11-25-2022, 09:03 PM | #5 | |
| Connoisseur            Posts: 72 Karma: 800000 Join Date: Jun 2021 Device: Kindle Paperwhite (PW1|PW3|PW4), Kindle Voyage | Quote: 
 I have found that many, if not most, official books have this format with a new xhtml for the image & another after the image. I'm not opposed to a different approach, but that seems to be a common industry standard for professionally done ePubs, so I have no intention of reinventing the wheel, there's probably a reason they do it that way, that way looks nice from my experience, & looks nice across different devices, so unless there's a different way that is clearly better for some reason I'll stick to the tried & true method personally. As far as the SVG goes... I'm not exactly sure what that's supposed to do. All I know for SVG is the Vector Graphics format & the images I have are not vector graphics. From the code you shared it looks like the image is being scaled to a specific resolution, which seems unnecessary & gives the same problem I'm trying to solve with compatibility. The issues that cause a desire for the images to be on separate pages? Sometimes the image is just under the resolution of the screen being used, for example if the epub was made from images sized to the old PW1 & are displayed on a PW3 you end up with 1-3 lines of text above or below the image. this is annoying & completely avoided when using a separate page for the image. I have run into the issue a few times where images were made smaller than my PW3 screen & they don't fill the page, but instead leave a blank area around it, that's not ideal, but better, IMO at least, to having the image with a single or 2 lines of text. Even when it had 4 lines I'd rather the text just start on the next page. If I were to use your SVG code then the image would be scaled to width="400" height="600" or height="99vh" width="99vw" which doesn't sound like it is going to be good unless it is made for only a single device & not intended to be used on another device. As someone who has 2 PW3s, 2 PW1s, my Dad's old Kindle that is lower resolution, & people who use tablets & iOS & Android phones in my house building to a single dimension for one device leaves problems for other devices. With that in mind, would you be able to give a brief explanation on why the SVG thing is what you think is better? I'm new to Sigil so I'm very much still learning. I just don't understand what it benefits from adding all that extra code when it doesn't seem to be better than what most pro-books do. Is there something I'm missing? I'm also still trying to figure out what SVG means in your context, Scalable Vector Graphics are nice, but they have major limitations, you can't really have vector graphics with a lot of detail, & they take a lot more processing power to display than JPG or PNG images, with the more detail an image has requiring more processing power, & with illustrations that is gonna be a lot, even more than WEBP possibly. So if you are converting images to SVG that sounds problematic & definitely won't be supported across many devices. Is it something else? | |
|   |   | 
|  11-25-2022, 09:28 PM | #6 | |
| Connoisseur            Posts: 72 Karma: 800000 Join Date: Jun 2021 Device: Kindle Paperwhite (PW1|PW3|PW4), Kindle Voyage | Quote: 
 At the very least it's a lot better than what I was previously dealing with. Thanks a bunch | |
|   |   | 
|  11-26-2022, 12:03 AM | #7 | |
| Wizard            Posts: 2,306 Karma: 13057279 Join Date: Jul 2012 Device: Kobo Forma, Nook | Quote: 
 What he is talking about is an "SVG Wrapper"... not an actual SVG (vector image). This will scale your bitmap image properly to fit any screen resolution + rotation. (See the thread for examples.) - - - Side Note: There are many awesome advantages of vector images too. Especially Formulas/Equations + Charts/Graphs and all sorts of computer-generated things. (These can be scaled infinitely or zoomed in with zero loss in quality.) Vector images are not intended for photographs. Last edited by Tex2002ans; 11-26-2022 at 12:05 AM. | |
|   |   | 
|  11-26-2022, 04:12 AM | #8 | |
| Sigil Developer            Posts: 9,070 Karma: 6361556 Join Date: Nov 2009 Device: many | Quote: 
 See the "Filter Replacements" in the Find and Replace chapter in the latest Sigil User Guide for details. Last edited by KevinH; 11-26-2022 at 04:16 AM. | |
|   |   | 
|  11-26-2022, 05:18 AM | #9 | 
| Connoisseur            Posts: 97 Karma: 21870 Join Date: Apr 2021 Location: Spain Device: Kobo Libra 2 | 
			
			When I want a single image on a page I use the following styles.  To centre the image vertically, depending on its height I use a class that has the appropriate top margin (alone00 for none, alone10 por 10%, etc). Code: figure { margin:1em auto; padding:0; text-align:center; }
img { border:0; margin:0; padding:0; width:100%; }
figcaption { 
	margin:0; padding:.4em 0 .3em; text-align:center;
	text-indent:0; font-size:.82em; font-family:sans-serif;
}
figure.alone00 { page-break-before:always; margin-top:0; margin-bottom:0; }
figure.alone10 { page-break-before:always; padding-top:10%; page-break-after:always; }
figure.alone20 { page-break-before:always; padding-top:20%; page-break-after:always; }Code:   <p>Text.</p>
  <figure class="alone00">
    <img alt="" src="../images/1.jpg" />
    <figcaption><b>Fig. 1:</b> Text.</figcaption>
  </figure>
  <p>Text.</p> | 
|   |   | 
|  11-26-2022, 06:49 AM | #10 | 
| A Hairy Wizard            Posts: 3,394 Karma: 20212733 Join Date: Dec 2012 Location: Charleston, SC today Device: iPhone 15/11/X/6/iPad 1,2,Air & Air Pro/Surface Pro/Kindle PW & Fire | 
			
			SVG wrapper's are a workaround to provide functionality that is not included in current CSS/HTML. It does not reformat a jpg/png to SVG. One awesome function is the 'preserveAspectRatio' which, as it states, preserves the images aspect ratio...It prevents an image from getting clipped, or stretched, in the wrong direction...which is a failing of the method repilo mentioned. No matter the orientation of the display, or weather the image is portrait or landscape, the image will display properly on the 'page'. There will be blank space to fill whichever dimension needs it. the 'xMidYMid meet' is one of the options that simply says 'center the image on the available display vertically and horizontally. The image height and width is not setting, or restricting, the resolution. It is informing the svg shell what the images height and width are. This is used in conjunction with the 'viewbox' setting to determine which portion of the image to display. Most often you want to display the whole image, so you would set the viewbox from the origin (0 0) to the imageWidth imageHeight (as in '0 0 600 889' for an image that is 600 pixels wide by 889 tall). You can focus on only one portion of the image by restricting those coordinates. For more details check out tex's link. Cheers! | 
|   |   | 
|  11-26-2022, 10:41 AM | #11 | |||
| Connoisseur            Posts: 72 Karma: 800000 Join Date: Jun 2021 Device: Kindle Paperwhite (PW1|PW3|PW4), Kindle Voyage | Quote: 
 Just like I thing WEBP images are nice, you can get better compression with less quality loss, but there's a big difference in what the device reading it has to do to see it & that makes putting them in an eBook not ideal either. It's like if you have an ePub with the highest level of zip compression, it takes a tiny bit longer to open & page turns are slightly slower, & your computer uses a bit more RAM to view it. WEBP has the same problem & for a device like a kindle, with a much slower processor, that makes more of a difference. & it impacts the battery life. They save some space, but there's no such thing as a free lunch, there's a cost. It doesn't matter for a computer or modern phone, but the difference is actually measurable. Now take a device that only has ½GB or ¼GB of RAM instead of 32-128GBs & 1GHz or ¾GHz CPU instead of 3-5GHz with a single core instead of 6-32 cores & what is measurable but hardly noticeable on a computer makes a much bigger difference. Now if it's just a wrapper designed to work with vector images that's fine, I just don't see why it's necessary, not that I don't see a use for the technology. Quote: 
 Code: <?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE html>
<html xmlns="http://www.w3.org/1999/xhtml" xmlns:epub="http://www.idpf.org/2007/ops" lang="en" xml:lang="en">
<head>
  <meta content="text/html; charset=UTF-8" http-equiv="default-style"/>
  <title>The Book Title Here</title>
  <link href="../Styles/stylesheet.css" rel="stylesheet" type="text/css"/>
</head>
<body class="nomargin center">
  <section epub:type="bodymatter chapter" id="insert1">
    <img alt="insert1" class="insert" src="../Images/Insert1.jpg"/>
  </section>
</body>
</html>In fact before trying to do more with sigil I would take my ePubs, unzip them, rotate all wide images, then batch scale the images to slightly bigger than the largest screen we will be using, convert the non-color images to greyscale to save a few ks since they generally are RGB even though they are greyscale images for some reason. I generally take a 40MB ePub & turn it into a 6MB ePub by doing just that & convverting ` and ` to ` & ` which usually cuts out 2-8 pages but probably increases the size as it actually consists of 2 more characters in data with the `&`. The ones I don't do will, just like you said, show the wide image taking up half of the screen because it's aspect ratio is preserved naturally, without any special code making that happen. If a tall image is not the screen ratio it will have empty space on the side, it won't get clipped or stretched on the kindle at all, & I've only seen that clipping on phone/tablets when the image in inline with the text. Calibre's ePub Previewer will do that to the inline images if the window is small, but not to the ones that are like what I shared, probably why the Professional publishers use that format... The only difference I see from that extra steps & extra code is centering the image... Which I don't think worth that effort in the slightest. The one that I split before & after the image: Code: <?xml version="1.0" encoding="utf-8"?>
<!DOCTYPE html>
<html xmlns="http://www.w3.org/1999/xhtml" xmlns:epub="http://www.idpf.org/2007/ops" epub:prefix="z3998: http://www.daisy.org/z3998/2012/vocab/structure/#" lang="en" xml:lang="en">
<head>
    <title>Part 1</title>
    <link href="book.css" rel="stylesheet" type="text/css"/>
  </head>
  <body>
<section epub:type="bodymatter chapter"><div class="main"><img src="../Images/insert_1.jpg"/>
</div>
</section>
</body>
</html>Quote: 
 Currently I solve the empty page problem with Code: Find: <br/><br/></p>
      <h1
Replace: </p><hr class="sigil_split_marker" /><h1I've heard the reasons for the SVG Wrapper, & I thank you for helping me understand it, but I'm now confident that it's not something I'm interested in. I'm more concerned with trying to find a way to improve the automation of splitting the images & find a way to automate renaming of pages if possible | |||
|   |   | 
|  11-26-2022, 11:52 AM | #12 | |||||||
| Wizard            Posts: 2,306 Karma: 13057279 Join Date: Jul 2012 Device: Kobo Forma, Nook | Quote: 
 
 Then you're already using SVG Wrappers. It's the same exact code. Is the EPUB reader exploding? Quote: 
 You do know that fonts are essentially just vectors—math curves that the device will scale to fit the screen... - - - And, as a complete sidenote, one of the bugs/quirks of SVG (Wrappers)+Amazon-conversion is giving you exactly what you were wanting... page-breaks before and after all your images! LOL. Quote: 
 And you definitely want to avoid using them in EPUBs for now (because most readers can't support that format yet/ever). For more discussion, see: Quote: 
 Quote: 
 The second you: 
 the "cutoff" image or "oompa loompa" effect more easily happens. While you can try to MITIGATE this, by shoving each image into its own HTML file... that just isn't possible when you're dealing with inline images. For example, what if you have text that's: Code: <p>As can be seen in Figure 1:</p> <div class="image"> <img [...]> <p class="caption">Figure 1: A thing</p> </div> <p>the issue can be seen. Also, see Figure 2:</p> 
 Yes, in some books, the single-file-per-image-method might work—like photographs between (or at the end of) each chapter—but in all the ebooks I've worked on, only a handful would even fall into that category. (I've professionally converted 650+ books, mostly Non-Fiction.) Quote: 
 Quote: 
 
 is the best way. But you trying to split the files before/after every image is a poor idea. Better to use good, clean HTML+CSS and rely on page-break-before + page-break-after. See my posts in: Last edited by Tex2002ans; 11-26-2022 at 12:05 PM. | |||||||
|   |   | 
|  11-26-2022, 12:28 PM | #13 | |
| Sigil Developer            Posts: 9,070 Karma: 6361556 Join Date: Nov 2009 Device: many | Quote: 
 Last edited by KevinH; 11-26-2022 at 04:08 PM. | |
|   |   | 
|  11-26-2022, 12:49 PM | #14 | 
| A Hairy Wizard            Posts: 3,394 Karma: 20212733 Join Date: Dec 2012 Location: Charleston, SC today Device: iPhone 15/11/X/6/iPad 1,2,Air & Air Pro/Surface Pro/Kindle PW & Fire | 
			
			LostOnTheLine: It appears you think there is some significance to be gained by emphasizing 'Professional Publishers'... First, there are multiple individuals here on MR that are 'Professional Publishers'...they come here to help the not so professional people...to keep them from making egregious mistakes...and to keep abreast of the goings on in the ePub world. I wouldn't dismiss what you see here as 'less than professional'.  Second, the majority of people who post here regularly are VERY experienced in cleaning up the crap code that is published by those you refer to as 'professional'...it really is quite bad sometimes... some of those here can identify the 'professional publisher' just by looking at the method of their coding.... you see, the big 'professional publishers' usually don't code the html themselves... it is sub-contracted out to someone, or who uses a template and/or a software that produces absolute junk. Anyway, assuming what you see in 'professional publisher's' books is the 'right/best way' to do things... well following 'that path leads to the Dark Side', 'Here There Be Dragons', 'Danger Will Robinson!' Third, your examples didn't have the CSS associated with them, so I'm not sure if you have some magical method of fixing the image problem that people have been fighting for many years...it would be interesting to see that. My bet is that it would take me about 3 seconds to 'break' that image display...and 2.5 of those seconds would be loading the book and flipping to that page. As always, you are completely free to use any of the techniques you find here, or not... I'm certainly not trying to say you MUST use my techniques... Just be aware that your logic behind turning down that technique doesn't quite cover it. Perhaps it's because we didn't explain it very well... Good luck to you, and Cheers! | 
|   |   | 
|  11-26-2022, 02:48 PM | #15 | 
| Resident Curmudgeon            Posts: 80,682 Karma: 150249619 Join Date: Nov 2006 Location: Roslindale, Massachusetts Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3 | 
			
			@Tutrle91, that was very well said.
		 | 
|   |   | 
|  | 
| Tags | 
| automation, find and replace, sigil file splitting, split-header, split-image | 
| 
 | 
|  Similar Threads | ||||
| Thread | Thread Starter | Forum | Replies | Last Post | 
| Search and replace sometimes doesn't work across all files | GregTheGrate | Editor | 15 | 11-10-2021 01:49 AM | 
| Calibre2OPDS split tags just doesn't work | DavidTC | Related Tools | 16 | 11-04-2014 09:58 AM | 
| Regex Help: Find page number & Replace+Remove 2x Line Breaks in Sigil | Contre-jour | Sigil | 9 | 02-01-2013 10:47 AM | 
| Search & Replace doesn't work for quotes | habanr | Conversion | 11 | 04-22-2011 11:50 AM | 
| Find, replace, split chapter? | laelaps | Sigil | 8 | 01-20-2011 04:01 PM |