![]() |
#76 |
Grand Sorcerer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 24,905
Karma: 47303824
Join Date: Jul 2011
Location: Sydney, Australia
Device: Kobo:Touch,Glo, AuraH2O, GloHD,AuraONE, ClaraHD, Libra H2O; tolinoepos
|
|
![]() |
![]() |
![]() |
#77 |
Addict
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 224
Karma: 55704
Join Date: Sep 2013
Device: Kobo Glo
|
I also have a large collection of books which stalled at 80% during Kobo Processing of the entire collection.
I discovered via Sigil and Calibre that several books had major errors in their TOC and structural errors in the code, I first cleaned them in Sigil, then fixed the TOC in Calibre and bug checked in calibre. With a large unverified collection I would batch load the books. Whenever I do a clean install of my entire collection to my Glo I batch load then, 200 per batch, let the Glo process 200 at a time and it works fine. Last edited by leftright; 06-18-2015 at 12:27 AM. |
![]() |
![]() |
![]() |
#78 |
Guru
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 977
Karma: 2209358
Join Date: Nov 2011
Location: London, UK
Device: Kobo Aura, Kobo Aura ONE, PocketBook InkPad Color 3
|
Breaking down the adobehost strace a little more.
First we can see some timestamp information in the clock_gettime calls. The numbers printed are time since some point in { seconds, nanoseconds }. At 269.466764989 the file is opened, the first 16KB is read, and the file's closed. There isn't anything you can do with the first bit of a zip file except check the first bytes match "PK". Then the file is opened again, and the last 2744 bytes read. I think that's a required zip file trailer, which lets the program find stuff in the rest of the file. Then the file is mmap()ed in, then closed, and then unmapped. Just before 269.521851864, 1089 bytes is written to nickel including the text "DC.creator". So this looks like some (probably all) of the relevant metadata from the OPF file. At 269.526128239, the file is opened for the third time, the first 16KB is read, and the file closed. Then the file is opened again, the last 2744 bytes read, and then the whole file is mmap()ed again. After the file is closed and unmapped, 2213 bytes is written back including some file:// URL. We can't see most of this. There's no disk writes. Could this be the TOC? However something is also logged at this point "<15>Jun 17 16:37:09 adobehost: v"... which may be worth finding. It is now 270.338781864. About 1/2 a second later at 270.801502864 the file is opened again so its first 16KB can be read again Can we deduce anything from this? Well, the metadata extraction from the OPF is quick (55ms) so that doesn't seem to be the issue. Something else big is being passed back, which contains a file URL. Given the tests in this thread I'd guess that's the TOC. Generally the process looks inefficient because it keeps opening and closing the same file. I wonder if reading 16KB matches the block size used on the filesystem. Overall the work took from 269.466764989 to 270.801502864, a little less than 1.5 seconds, but at least a third of that time is waiting for nickel. BTW nice to see some Karl May! I've only ever read English translations of his books though :-( |
![]() |
![]() |
![]() |
#79 | ||||||
Grand Sorcerer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 24,905
Karma: 47303824
Join Date: Jul 2011
Location: Sydney, Australia
Device: Kobo:Touch,Glo, AuraH2O, GloHD,AuraONE, ClaraHD, Libra H2O; tolinoepos
|
Quote:
Quote:
Quote:
Quote:
Quote:
Quote:
|
||||||
![]() |
![]() |
![]() |
#80 |
Linux User
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 2,282
Karma: 6123806
Join Date: Sep 2010
Location: Heidelberg, Germany
Device: none
|
Don't think so. I could do a block dump but it will only catch the first read, not subsequent re-reads from cache.
Well, that's handled by the Linux VFS cache. So it's more like a memory copy than a physical read. But it raises questions regarding how the data is processed behind the scenes. Reading 16KB of already cached data might not take particularly long. But re-parsing and re-validating of the same data might add up. |
![]() |
![]() |
![]() |
#81 |
Resident Curmudgeon
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 79,763
Karma: 145864619
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
|
I've read of Kindles slowing down due to problematic books when indexing.
|
![]() |
![]() |
![]() |
#82 | ||||
Guru
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 977
Karma: 2209358
Join Date: Nov 2011
Location: London, UK
Device: Kobo Aura, Kobo Aura ONE, PocketBook InkPad Color 3
|
Quote:
Quote:
Quote:
Quote:
Getting timestamps in the strace output would be useful. I can't see where the time's spent apart from in the OPF scanning (55ms). It looks a bit like the TOC stuff which is ironic if it isn't used any more. |
||||
![]() |
![]() |
![]() |
#83 | ||
Grand Sorcerer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 24,905
Karma: 47303824
Join Date: Jul 2011
Location: Sydney, Australia
Device: Kobo:Touch,Glo, AuraH2O, GloHD,AuraONE, ClaraHD, Libra H2O; tolinoepos
|
Quote:
Quote:
Last edited by davidfor; 06-18-2015 at 08:03 AM. Reason: Fixed the quoting. |
||
![]() |
![]() |
![]() |
#84 |
Ex-Helpdesk Junkie
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 19,421
Karma: 85400180
Join Date: Nov 2012
Location: The Beaten Path, USA, Roundworld, This Side of Infinity
Device: Kindle Touch fw5.3.7 (Wifi only)
|
|
![]() |
![]() |
![]() |
#85 | |
Zealot
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 101
Karma: 470830
Join Date: Nov 2012
Location: Berlin
Device: Kobo Libra 2
|
Quote:
As is, we get the worst of both options. The device is unusable, and the software won't even tell us which of the books it finds offensive. The other aspect is a question of priorities. The normal book-reading with turning a page every other minute can't be all that CPU-intensive. Seems like a no-brainer to let the user do that while you do whatever processing you think you have to do in the background? Then give them a message about the results of your processing? |
|
![]() |
![]() |
![]() |
#86 |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 3,058
Karma: 54671821
Join Date: Feb 2012
Location: New England
Device: PW 1, 2, 3, Voyage, Oasis 2 & 3, Fires, Aura HD, iPad
|
|
![]() |
![]() |
![]() |
#87 | |
Guru
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 977
Karma: 2209358
Join Date: Nov 2011
Location: London, UK
Device: Kobo Aura, Kobo Aura ONE, PocketBook InkPad Color 3
|
Quote:
I wonder if we can figure out some tests to see what about the ToC extraction is so slow. Does nesting make a difference - as Kobo doesn't support multiple levels? |
|
![]() |
![]() |
![]() |
#88 | |
Bibliophagist
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 46,273
Karma: 168983734
Join Date: Jul 2010
Location: Vancouver
Device: Kobo Sage, Libra Colour, Lenovo M8 FHD, Paperwhite 4, Tolino epos
|
Quote:
Once the ebooks are cleaned up and transferred sucessfully to a Kobo, I don't worry about smaller batches for those ebooks. |
|
![]() |
![]() |
![]() |
#89 | ||
Grand Sorcerer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 24,905
Karma: 47303824
Join Date: Jul 2011
Location: Sydney, Australia
Device: Kobo:Touch,Glo, AuraH2O, GloHD,AuraONE, ClaraHD, Libra H2O; tolinoepos
|
Quote:
Quote:
|
||
![]() |
![]() |
![]() |
#90 | |
Ex-Helpdesk Junkie
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 19,421
Karma: 85400180
Join Date: Nov 2012
Location: The Beaten Path, USA, Roundworld, This Side of Infinity
Device: Kindle Touch fw5.3.7 (Wifi only)
|
Quote:
I could see it as being quite reasonable to branch off the code and try to update it to work with the new paradigm for their other code that they did, somehow, end up changing. |
|
![]() |
![]() |
![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
Aura HD Stuck at 85% "Processing content" issue | Ken Maltby | Kobo Reader | 26 | 07-07-2013 03:59 PM |
Bug in Kobo processing of epub files causing hang in "Processing content" | BensonBear | Kobo Reader | 21 | 12-21-2012 05:47 AM |
Recovery from "Processing Content" | mutant_matt | Kobo Reader | 8 | 11-17-2012 07:05 PM |
Touch Stuck on "Processing Content" and can't transfer books from Calibre?? | Feisty | Kobo Reader | 33 | 01-22-2012 10:15 AM |
"Content locked..." with Kobo books - questions | JVM | Kobo Reader | 4 | 04-18-2011 08:58 AM |