|  10-22-2015, 09:30 AM | #136 | ||
| frumious Bandersnatch            Posts: 7,570 Karma: 20150435 Join Date: Jan 2008 Location: Spaniard in Sweden Device: Cybook Orizon, Kobo Aura | Quote: 
 Quote: 
 EDIT: Scrambling to non-ascii characters will probably cause problems with fonts: a font may a character for "é", but not for "þ" (even though they are in the same group). And any scrambling will cause problems with subset fonts. Last edited by Jellby; 10-22-2015 at 09:35 AM. | ||
|   |   | 
|  10-22-2015, 09:44 AM | #137 | |
| The Grand Mouse 高貴的老鼠            Posts: 74,423 Karma: 318076944 Join Date: Jul 2007 Location: Norfolk, England Device: Kindle Oasis | Quote: 
 if you're using unicode characters and strings (as you should) in python, take a look at unicodedata.category(), which returns the abbreviated general category values for Unicode characters enumerated here. | |
|   |   | 
|  10-22-2015, 10:17 AM | #138 | |||
| Grand Sorcerer            Posts: 6,266 Karma: 16544702 Join Date: Sep 2009 Location: UK Device: ClaraHD, Forma, Libra2, Clara2E, LibraCol, PBTouchHD3 | Quote: 
 Quote: 
 Quote: 
 When I release the v0.2 version, perhaps some multilingual people, who are following this thread, will beta test examples of non-English books and report back on perceived issues? | |||
|   |   | 
|  10-22-2015, 11:18 AM | #139 | |
| Grand Sorcerer            Posts: 6,266 Karma: 16544702 Join Date: Sep 2009 Location: UK Device: ClaraHD, Forma, Libra2, Clara2E, LibraCol, PBTouchHD3 | Quote: 
  My primary goals, at the moment, are to create something which is: 
 I think refining the scrambling algorithm would be the next logical step once the above is nearer to satisfactory - which may take a little while yet. If anyone thinks differently, feel free to comment. In the long run, this Scrambling utility will only be of any practical use if the MR expert problem-solvers, who already handle most of the troubleshooting, encourage the user-with-problem-book to use it for the convenience of both. It's easy to imagine that a newly launched utility which promises to Scramble your books - the easy way may not be at the top of every user's wishlist.  Don't touch with a 10-foot bargepole might be the more likely reaction. | |
|   |   | 
|  10-22-2015, 04:33 PM | #140 | 
| Resident Curmudgeon            Posts: 80,700 Karma: 150249619 Join Date: Nov 2006 Location: Roslindale, Massachusetts Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3 | 
			
			What's going to happen to high-bit characters?
		 | 
|   |   | 
|  10-22-2015, 07:26 PM | #141 | |
| Grand Sorcerer            Posts: 6,266 Karma: 16544702 Join Date: Sep 2009 Location: UK Device: ClaraHD, Forma, Libra2, Clara2E, LibraCol, PBTouchHD3 | 
			
			For the time being, exactly what is happening in v0.1. Chars like ÅÉçčè, which are valid unicode, will be converted to random ascii chars. Mdashes and curly quotes will be unchanged In the near-ish future see my above post, namely this bit: Quote: 
 One thing to bear in mind is that the calibre tools convert everything to unicode from the outset. For example, any named entities in the source book (e.g.  ), will not be present in the scrambled output. Whether this will confuse, compound or be totally irrelevant to the issue of high bits chars I'm really not sure. In the meantime maybe you could do some tests with v0.1 to gather some helpful examples of precisely where things go wrong with the current setup. It's always easier to work with facts than vague what-ifs. | |
|   |   | 
|  10-22-2015, 08:49 PM | #142 | 
| Ex-Helpdesk Junkie            Posts: 19,421 Karma: 85400180 Join Date: Nov 2012 Location: The Beaten Path, USA, Roundworld, This Side of Infinity Device: Kindle Touch fw5.3.7 (Wifi only) | 
			
			Do they really? Thinking about ebook-edit for example... I believe that is part of Beautify actually. Plus, it *can* replace them as you type -- which is an option that can be turned off in the settings. | 
|   |   | 
|  10-22-2015, 09:49 PM | #143 | |
| Grand Sorcerer            Posts: 6,266 Karma: 16544702 Join Date: Sep 2009 Location: UK Device: ClaraHD, Forma, Libra2, Clara2E, LibraCol, PBTouchHD3 | Quote: 
 My comment arose from what I observed when testing with one of my AZW3 books plus its KindleUnpack epub counterpart. Neither have had any editing other than whatever KUnpk does. In both cases the books had fewer level 2 errors in the scrambled version than in the source. When I investigated the disappearing errors all related to 'named entities present'. All had been replaced with their unicode char equivalent. All I can say is that I haven't actively knowingly coded for them to be auto-replaced - but equally, I can't rule out having used default settings rather than an unknown (to me) optional setting which would have forced them to NOT be auto-replaced. If you forsee this auto-replacing as a problem I'd need to ask Kovid's advice. | |
|   |   | 
|  11-01-2015, 04:10 PM | #144 | 
| Grand Sorcerer            Posts: 6,266 Karma: 16544702 Join Date: Sep 2009 Location: UK Device: ClaraHD, Forma, Libra2, Clara2E, LibraCol, PBTouchHD3 | 
				
				beta version 0.0.2 added
			 
			
			I've updated post #2. Comments welcome.   v0.0.2 - beta release Nov 1, 2015 
 | 
|   |   | 
|  11-03-2015, 06:21 PM | #145 | |
| Wizard            Posts: 1,876 Karma: 8821117 Join Date: Mar 2013 Location: Rosario - Santa Fe - Argentina Device: Kindle 4 NT | Quote: 
 In order to replace svg images (in the book to be scrambled), in the original book, where there is a svg image, you'll find yourself with the following alternatives: 1. Svg images inside svg wrappers: In this case, you have to find the following statement: Code: xlink:href="../Images/Name_of_the_original_image.svg" 2. Svg images with <img> tag: In this case is like any other image; you have to find: Code: src="../Images/Name_of_the_original_image.svg" 3. It can be others cases where you can find svg images. I will send you an epub with lots of svg images (with images and text) so you can find the pattern (there is a pattern  ) to do the replace. Regards | |
|   |   | 
|  11-18-2015, 08:16 PM | #146 | 
| Grand Sorcerer            Posts: 6,266 Karma: 16544702 Join Date: Sep 2009 Location: UK Device: ClaraHD, Forma, Libra2, Clara2E, LibraCol, PBTouchHD3 | 
			
			Due to the lack of beta-test volunteers I have no plans to pursue this project any further.
		 | 
|   |   | 
|  11-18-2015, 08:19 PM | #147 | 
| Grand Sorcerer            Posts: 13,687 Karma: 79983758 Join Date: Nov 2007 Location: Toronto Device: Libra H2O, Libra Colour |     | 
|   |   | 
|  11-18-2015, 10:56 PM | #148 | 
| null operator (he/him)            Posts: 22,008 Karma: 30277294 Join Date: Mar 2012 Location: Sydney Australia Device: none | |
|   |   | 
|  11-18-2015, 11:21 PM | #149 | 
| Ex-Helpdesk Junkie            Posts: 19,421 Karma: 85400180 Join Date: Nov 2012 Location: The Beaten Path, USA, Roundworld, This Side of Infinity Device: Kindle Touch fw5.3.7 (Wifi only) | |
|   |   | 
|  11-19-2015, 10:08 AM | #150 | |
| Grand Sorcerer            Posts: 6,266 Karma: 16544702 Join Date: Sep 2009 Location: UK Device: ClaraHD, Forma, Libra2, Clara2E, LibraCol, PBTouchHD3 | 
			
			I hid the request in plain sight in post #2, perhaps I should have been more explicit, but I don't like to demand other people's free time. Or maybe it fell victim to tl;dnr syndrome. Quote: 
 I don't really think anyone who has posted in this thread is a potential user of the tool, at least not directly, but I would suggest that everyone here can help by guiding potential users towards it when the situation demands. Which, as I only just noticed this morning, PeterT did here. Thanks, Peter, that's exactly what I think is needed. My intention was to launch the calibre plugin in the main Plugin subforum, but I don't expect it to be a #1 Bestseller. Before I do there are 4 main areas for beta-testing: 
 Last edited by jackie_w; 11-19-2015 at 10:22 AM. Reason: item 4 added | |
|   |   | 
|  | 
| 
 | 
|  Similar Threads | ||||
| Thread | Thread Starter | Forum | Replies | Last Post | 
| Sony ereader troubleshoot | chrisms | Sony Reader | 3 | 10-02-2013 02:08 PM | 
| Out of copyright ebooks/writers you'd most like to see digitized? | pstjmack | Reading Recommendations | 18 | 09-14-2012 08:46 PM | 
| whispersync not working: can anyone help me troubleshoot? | rheostaticsfan | Amazon Kindle | 4 | 10-27-2011 08:09 AM | 
| troubleshoot battery life | kkinser | Amazon Kindle | 2 | 04-18-2011 09:05 PM | 
| The copyright issues of fan fiction eBooks | Kirok | Lounge | 33 | 12-08-2008 06:54 PM |