10-22-2015, 09:30 AM | #136 | ||
frumious Bandersnatch
Posts: 7,516
Karma: 19000001
Join Date: Jan 2008
Location: Spaniard in Sweden
Device: Cybook Orizon, Kobo Aura
|
Quote:
Quote:
EDIT: Scrambling to non-ascii characters will probably cause problems with fonts: a font may a character for "é", but not for "þ" (even though they are in the same group). And any scrambling will cause problems with subset fonts. Last edited by Jellby; 10-22-2015 at 09:35 AM. |
||
10-22-2015, 09:44 AM | #137 | |
The Grand Mouse 高貴的老鼠
Posts: 71,615
Karma: 306652114
Join Date: Jul 2007
Location: Norfolk, England
Device: Kindle Voyage
|
Quote:
if you're using unicode characters and strings (as you should) in python, take a look at unicodedata.category(), which returns the abbreviated general category values for Unicode characters enumerated here. |
|
Advert | |
|
10-22-2015, 10:17 AM | #138 | |||
Grand Sorcerer
Posts: 6,212
Karma: 16534894
Join Date: Sep 2009
Location: UK
Device: Kobo: KA1, ClaraHD, Forma, Libra2, Clara2E. PocketBook: TouchHD3
|
Quote:
Quote:
Quote:
When I release the v0.2 version, perhaps some multilingual people, who are following this thread, will beta test examples of non-English books and report back on perceived issues? |
|||
10-22-2015, 11:18 AM | #139 | |
Grand Sorcerer
Posts: 6,212
Karma: 16534894
Join Date: Sep 2009
Location: UK
Device: Kobo: KA1, ClaraHD, Forma, Libra2, Clara2E. PocketBook: TouchHD3
|
Quote:
My primary goals, at the moment, are to create something which is:
I think refining the scrambling algorithm would be the next logical step once the above is nearer to satisfactory - which may take a little while yet. If anyone thinks differently, feel free to comment. In the long run, this Scrambling utility will only be of any practical use if the MR expert problem-solvers, who already handle most of the troubleshooting, encourage the user-with-problem-book to use it for the convenience of both. It's easy to imagine that a newly launched utility which promises to Scramble your books - the easy way may not be at the top of every user's wishlist. Don't touch with a 10-foot bargepole might be the more likely reaction. |
|
10-22-2015, 04:33 PM | #140 |
Resident Curmudgeon
Posts: 74,565
Karma: 129670952
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
|
What's going to happen to high-bit characters?
|
Advert | |
|
10-22-2015, 07:26 PM | #141 | |
Grand Sorcerer
Posts: 6,212
Karma: 16534894
Join Date: Sep 2009
Location: UK
Device: Kobo: KA1, ClaraHD, Forma, Libra2, Clara2E. PocketBook: TouchHD3
|
For the time being, exactly what is happening in v0.1. Chars like ÅÉçčè, which are valid unicode, will be converted to random ascii chars. Mdashes and curly quotes will be unchanged
In the near-ish future see my above post, namely this bit: Quote:
One thing to bear in mind is that the calibre tools convert everything to unicode from the outset. For example, any named entities in the source book (e.g.  ), will not be present in the scrambled output. Whether this will confuse, compound or be totally irrelevant to the issue of high bits chars I'm really not sure. In the meantime maybe you could do some tests with v0.1 to gather some helpful examples of precisely where things go wrong with the current setup. It's always easier to work with facts than vague what-ifs. |
|
10-22-2015, 08:49 PM | #142 |
Ex-Helpdesk Junkie
Posts: 19,421
Karma: 85397180
Join Date: Nov 2012
Location: The Beaten Path, USA, Roundworld, This Side of Infinity
Device: Kindle Touch fw5.3.7 (Wifi only)
|
Do they really? Thinking about ebook-edit for example...
I believe that is part of Beautify actually. Plus, it *can* replace them as you type -- which is an option that can be turned off in the settings. |
10-22-2015, 09:49 PM | #143 | |
Grand Sorcerer
Posts: 6,212
Karma: 16534894
Join Date: Sep 2009
Location: UK
Device: Kobo: KA1, ClaraHD, Forma, Libra2, Clara2E. PocketBook: TouchHD3
|
Quote:
My comment arose from what I observed when testing with one of my AZW3 books plus its KindleUnpack epub counterpart. Neither have had any editing other than whatever KUnpk does. In both cases the books had fewer level 2 errors in the scrambled version than in the source. When I investigated the disappearing errors all related to 'named entities present'. All had been replaced with their unicode char equivalent. All I can say is that I haven't actively knowingly coded for them to be auto-replaced - but equally, I can't rule out having used default settings rather than an unknown (to me) optional setting which would have forced them to NOT be auto-replaced. If you forsee this auto-replacing as a problem I'd need to ask Kovid's advice. |
|
11-01-2015, 04:10 PM | #144 |
Grand Sorcerer
Posts: 6,212
Karma: 16534894
Join Date: Sep 2009
Location: UK
Device: Kobo: KA1, ClaraHD, Forma, Libra2, Clara2E. PocketBook: TouchHD3
|
beta version 0.0.2 added
I've updated post #2. Comments welcome.
v0.0.2 - beta release Nov 1, 2015
|
11-03-2015, 06:21 PM | #145 | |
Wizard
Posts: 1,567
Karma: 7043711
Join Date: Mar 2013
Location: Rosario - Santa Fe - Argentina
Device: Kindle 4 NT
|
Quote:
In order to replace svg images (in the book to be scrambled), in the original book, where there is a svg image, you'll find yourself with the following alternatives: 1. Svg images inside svg wrappers: In this case, you have to find the following statement: Code:
xlink:href="../Images/Name_of_the_original_image.svg" 2. Svg images with <img> tag: In this case is like any other image; you have to find: Code:
src="../Images/Name_of_the_original_image.svg" 3. It can be others cases where you can find svg images. I will send you an epub with lots of svg images (with images and text) so you can find the pattern (there is a pattern ) to do the replace. Regards |
|
11-18-2015, 08:16 PM | #146 |
Grand Sorcerer
Posts: 6,212
Karma: 16534894
Join Date: Sep 2009
Location: UK
Device: Kobo: KA1, ClaraHD, Forma, Libra2, Clara2E. PocketBook: TouchHD3
|
Due to the lack of beta-test volunteers I have no plans to pursue this project any further.
|
11-18-2015, 08:19 PM | #147 |
Grand Sorcerer
Posts: 12,255
Karma: 74007256
Join Date: Nov 2007
Location: Toronto
Device: Nexus 7, Clara, Touch, Tolino EPOS
|
|
11-18-2015, 10:56 PM | #148 |
null operator (he/him)
Posts: 20,668
Karma: 26966376
Join Date: Mar 2012
Location: Sydney Australia
Device: none
|
|
11-18-2015, 11:21 PM | #149 |
Ex-Helpdesk Junkie
Posts: 19,421
Karma: 85397180
Join Date: Nov 2012
Location: The Beaten Path, USA, Roundworld, This Side of Infinity
Device: Kindle Touch fw5.3.7 (Wifi only)
|
|
11-19-2015, 10:08 AM | #150 | |
Grand Sorcerer
Posts: 6,212
Karma: 16534894
Join Date: Sep 2009
Location: UK
Device: Kobo: KA1, ClaraHD, Forma, Libra2, Clara2E. PocketBook: TouchHD3
|
I hid the request in plain sight in post #2, perhaps I should have been more explicit, but I don't like to demand other people's free time. Or maybe it fell victim to tl;dnr syndrome.
Quote:
I don't really think anyone who has posted in this thread is a potential user of the tool, at least not directly, but I would suggest that everyone here can help by guiding potential users towards it when the situation demands. Which, as I only just noticed this morning, PeterT did here. Thanks, Peter, that's exactly what I think is needed. My intention was to launch the calibre plugin in the main Plugin subforum, but I don't expect it to be a #1 Bestseller. Before I do there are 4 main areas for beta-testing:
Last edited by jackie_w; 11-19-2015 at 10:22 AM. Reason: item 4 added |
|
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Sony ereader troubleshoot | chrisms | Sony Reader | 3 | 10-02-2013 02:08 PM |
Out of copyright ebooks/writers you'd most like to see digitized? | pstjmack | Reading Recommendations | 18 | 09-14-2012 08:46 PM |
whispersync not working: can anyone help me troubleshoot? | rheostaticsfan | Amazon Kindle | 4 | 10-27-2011 08:09 AM |
troubleshoot battery life | kkinser | Amazon Kindle | 2 | 04-18-2011 09:05 PM |
The copyright issues of fan fiction eBooks | Kirok | Lounge | 33 | 12-08-2008 06:54 PM |