04-19-2012, 08:29 AM | #16 | |
Connoisseur
Posts: 68
Karma: 116806
Join Date: Mar 2011
Device: Sony DPT-s1, Pocketbook 903, Onyx M90, iPad 4
|
Quote:
However, since these are scannerized files, vocabularies and other softwares that work with words or data of a text cannot be applied. Kindle Doesn't support DJVU files... probably because a lot of books that you can free download from the web are on this format. |
|
04-19-2012, 05:29 PM | #17 |
Member
Posts: 11
Karma: 10
Join Date: Apr 2012
Device: iCarus eXcel
|
Rephrasing my question:
Since Kindle DX (532 Mhz) is much faster handling PDF files than Icarus eXcel/Boox M92 (800 Mhz), could that be the result of software limitations? |
04-20-2012, 06:36 AM | #18 |
Addict
Posts: 294
Karma: 1196776
Join Date: Nov 2008
Location: Bulgaria
Device: Kindle 4 NT, Onyx Boox M92
|
Software matters, too. I use Linux on a dual-core processor and with the default viewer (Evince) it takes about 1-3 seconds to render the page of a scanned pdf (I think it takes more time when the document is initially opened, then it becomes faster). If I use mupdf as an alternative viewer, the rendering of the page is less than a second. Acrobat reader also renders the pages faster.
Last edited by slex; 04-20-2012 at 06:39 AM. |
04-20-2012, 08:10 AM | #19 | |
Connoisseur
Posts: 62
Karma: 1114
Join Date: Jan 2012
Device: Onyx Boox M92
|
Quote:
It is worth noting that the M92 pdf viewer uses Acrobat's engine, that is usually quite fast, so it's a bit odd it is so slow to flip pages. I thought it rendered and cached the next page... BTW it would be nice to have an alternative PDF viewer based on the mupdf rendering engine |
|
04-20-2012, 08:17 AM | #20 |
Wizard
Posts: 1,138
Karma: 1577094
Join Date: Oct 2008
Device: Samsung EB60, Onyx M92, Onyx Max2
|
|
04-20-2012, 08:43 AM | #21 | |
Addict
Posts: 294
Karma: 1196776
Join Date: Nov 2008
Location: Bulgaria
Device: Kindle 4 NT, Onyx Boox M92
|
Quote:
|
|
04-20-2012, 09:18 AM | #22 | |
Addict
Posts: 294
Karma: 1196776
Join Date: Nov 2008
Location: Bulgaria
Device: Kindle 4 NT, Onyx Boox M92
|
Quote:
---------- pdf2djvu filename.pdf -o filename.djvu This is quick and dirty, but it doesn't produce the best results (more suitable for conversion of text-based pdfs). The output djvu file is comparable in size with the original pdf. Variant 2 ---------- You can get better results if you put the pdf file in a separate directory. Then pdfimages filename.pdf f This will produce image files named f-001.pbm (if the pdf is black and white) ... or f-001.ppm (if it has color). You have several options from then on. Variant 2A ~~~~~~ Find a copy of the windows program DjVuSolo and install it with Wine. It works without any problems. Then, if you have PBM files, you can add them directly to DjVuSolo. You choose File->Open->Type of file (pnm file) and select the first page. After the first page is added you go to Edit->Append Pages->Type of file (pnm file) and select all the other files by clicking on the second image file, then press the down arrow while holding shift until you select all the pages (if there are lots of pages, the program might appear to freeze - it doesn't - just give it some time - sometimes more than 5 minutes for a full-sized book of several hundred pages). After all the pages have been added, go to File-> Encode as DjVu -> Bundled -> [give the file a name and write it with the extension .djvu] -> Resolution (leave the default value - the program usually detects it), e.g. 300 and choose Bitonal from the radio button options. The output file will be much smaller than the original pdf - probably 10 times - without visible loss of quality. Variant 2B ~~~~~~ Get the program gscan2pdf from the repositories. You can add pbm files directly into gscan2pdf. Then export as djvu. The file will be smaller than the original pdf - probably 2-3 times - but you are unlikely to achieve the compression in Variant 2A. Variant 3 ---------- Parse the image files through ScanTailor. This will produce the best results, but it is more time consuming. First, convert the files, so that they can be recognized by the program. mogrify -format png *.pbm Then add the directory with the png files into ScanTailor. After you process them, you will get files in a subdirectory called "out/". Add the files there into gscan2pdf or DjVuSolo. Export to djvu. |
|
04-20-2012, 11:31 AM | #23 |
M92 info archivist
Posts: 238
Karma: 1398
Join Date: Jun 2008
Device: IRex Iliad 2nd ed., Onyx Boox M92, Onyx Boox Max Lumi
|
|
04-20-2012, 12:42 PM | #24 | |
Addict
Posts: 294
Karma: 1196776
Join Date: Nov 2008
Location: Bulgaria
Device: Kindle 4 NT, Onyx Boox M92
|
Quote:
That is why I write only here and leave it to others to decide if the tip should be added to the wiki. |
|
04-20-2012, 02:39 PM | #25 | |
M92 info archivist
Posts: 238
Karma: 1398
Join Date: Jun 2008
Device: IRex Iliad 2nd ed., Onyx Boox M92, Onyx Boox Max Lumi
|
Quote:
My point would be that everyone looking for a solution may feel the same when (or if) he finds it in the forum... but that ignoring irrelevant points in a structured Wiki is easier than browsing through a large number of (often endless) threads to find a tiny bit of information. Most annoying situation in my eyes: I stumbled across some solution I didn't encounter the problem to, yet... and when I need it, I know that the information was 'somewhere' in a couple of dozen pages of 3-4 candidate threads. (even better when I don't remember the right keywords to search for it :P ) E.g.: I didn't feel the need to convert documents, yet Anyhow: I'd like to encourage rather posting pointless information (which will eventually be put into a substructure by somebody else) than using the forum as a primary documentation medium. That's why most of the wiki currently consists of links to the forum - just add pointers to relevant discussions... and I'd be amongst the folks appreciating it. Cheers. |
|
04-20-2012, 04:39 PM | #26 | |
Addict
Posts: 344
Karma: 6494
Join Date: Jan 2008
Device: PRS-505, KDXG
|
Quote:
It is possible to add a hidden text layer. The quality of the OCRed text would, however, depend heavily on the quality of your scans. It does not support djvu natively but you can use either Doukan or kpdfviewer for that purpose. |
|
04-23-2012, 07:36 AM | #27 | |
Connoisseur
Posts: 68
Karma: 116806
Join Date: Mar 2011
Device: Sony DPT-s1, Pocketbook 903, Onyx M90, iPad 4
|
Quote:
About the original question of dead115, I think it is also a matter of firmwares. (a little digression: until now i think software production still reaches is best in the western world... for example, even a big company like Samsung uses for its smartphones a western OS system, like google android or Windows 7/8... so, sorry if I'm too generic but no surprise if Kindle firmware is better and faster then Onyx firmware. On the contrary, hardware production now is an eastern matter, but soon they probably will be competitive even in software). |
|
04-23-2012, 05:44 PM | #28 |
Member
Posts: 11
Karma: 10
Join Date: Apr 2012
Device: iCarus eXcel
|
|
04-23-2012, 05:49 PM | #29 |
Addict
Posts: 320
Karma: 99999
Join Date: Oct 2011
Location: Germany
Device: Onyx Boox M92, Icarus Illumina E653
|
At the moment no. But anyone that's using qt and is compatible with qt-qws at least has a chance to run on our m92. though I don't know, how hard it really is to port e.g. libpoppler which is used by evince and okular ...
the mupdf engine doesn't look very promising concerning porting to qt, to be honest. I really doubt, that it's a good idea to put so much effort into porting it... |
04-26-2012, 07:55 AM | #30 | |
Mono
Posts: 699
Karma: 13333
Join Date: Jan 2012
Device: Boox M92
|
Quote:
In time of internet the programmer may sit on the other side of the globe than the headquauters. The main problem is the timezones (to be able to communicate in real time) and cultural differences. :-( |
|
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Syncing PDFs to M92 | eba | Onyx Boox | 18 | 12-17-2012 11:32 AM |
Reading technical docs (PDFs/EPUBs) on M92 -bugs and workarounds | PF4Mobile | Onyx Boox | 34 | 03-11-2012 08:39 AM |
M92: Academic Reading of PDFs - how does it work? | Defenderland | Onyx Boox | 22 | 03-03-2012 12:01 PM |
Boox M92. Spring Bargains. The price of M92 stays reduced for 14 days :) | Booxtor | Onyx Boox | 5 | 03-02-2012 05:16 AM |
Review of the Icarus Bluefire G2 tablet... | Hans de Cock | Alternative Devices | 5 | 07-06-2011 04:13 PM |