|
|
#1 |
|
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1,462
Karma: 30087146
Join Date: Mar 2010
Location: UK
Device: Kobo Forma, Icarus, iPad Mini 2, Kobo Touch, Google Nexus 7
|
Converting CBR PNG pages to epub
OK, this is an odd one: I've been able to track down a CBR file of a document (derived from microfilm) which I want to OCR and convert to epub. It's 7 pages, one of which is an image to be used as a cover, the other 6 pages are PNG images of text.
What I'm after is a way to OCR the textual pages so they can be converted to an epub file. Calibre CBR > epub converts the pages as images, not text, which isn't much use to me as I want a reflowable document not a fixed format document. I can't seem to find a plug-in to do what I want, and the online converters aren't useful. Does anyone have any ideas on how to handle this? If the worst comes to the worst, I can manually type the text, but I'd rather not have to do that... |
|
|
|
|
|
#2 |
|
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1,462
Karma: 30087146
Join Date: Mar 2010
Location: UK
Device: Kobo Forma, Icarus, iPad Mini 2, Kobo Touch, Google Nexus 7
|
OK, managed it using an online OCR site. I still need to compare the output to the images, but it's good enough as a start.
|
|
|
|
|
|
#3 |
|
Evangelist
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 474
Karma: 3972002
Join Date: May 2013
Location: Ontario, Canada
Device: Pop_Os!, Kobo Forma
|
You will have to use an actual OCR program to go over the images and recognise the text, then proofread it and format it using a word processor like LibreOffice Writer. Then you can add the word processor file to Calibre and convert it to an epub. Put the image you want in the word processor file as the first thing, and it will become your cover. Use proper paragraph styling and the conversion should go very well.
Luckily, you have a CBR to start with. That is just a rar file of pictures, so un-compress it and you have your pictures. Then choose your OCR program, that's a whole topic itself. If you are on Linux you probably already have Tesseract installed, and you can find an app called OCRFeeder that will work with it to let you easily go over each picture, get the text, and copy it into your word processor. If you are on a different OS, I don't know, others can help you. EDIT: I was typing as you found your OCR. Everything can be done online these days. Good job. Now you just need to proofread, format, add, and convert. Last edited by retiredbiker; 03-04-2026 at 06:42 PM. Reason: Crossed messages |
|
|
|
![]() |
|
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| Issues with double pages for cbr/cbz to epub | hanxin | Conversion | 1 | 07-26-2016 01:24 AM |
| Calibre not converting CBR to epub | Savyna | Conversion | 1 | 10-16-2014 08:21 AM |
| Blank pages after converting cbr to epub | bdoertel | Conversion | 1 | 06-05-2014 01:17 AM |
| Newbie - after converting .cbr files, kindle skips pages | n64play | Conversion | 1 | 09-10-2013 06:34 PM |
| Touch Converting multiple .jpg/.png into ePub for Manga | Gater | Kobo Reader | 11 | 01-08-2012 08:28 AM |