Order it now! Amazon prioritizes orders on a first come, first served basis.


View Full Version : do-it yourself repro v-cradle for paper books


ereszet
09-16-2007, 06:17 AM
Materials: hardwood, glass, steel rods, nuts and bolts, black spray, black cardboard.
Cost (materials and work by a friendly handyman): less than $50
The attached two pictures say it all.

ereszet
09-18-2007, 04:08 AM
Part 1 was in my post Do-it-yourself repro v-cradle

1. Why a v-cradle and not a flat bed.
For two reasons: First, you cannot spread the books flat and if you do, the quality of reproduced pages will be compromised. It is especially important if you plan to OCR the book. Second, you avoid light reflections. You need only one lamp with a diffusor just over the v-cradle (picture attached). Lighting is the most difficult part of reproduction. Over the years, I have tried various setups with my semi-profesional Manfrotto repro stand and four lamps at 45 degree angle. It doesn't come close to an overhead lamp and v-cradle. Avoid any other light in the room or take everythig to your terrace and shoot at the sunlight with no artificial light.

2. Where to put the camera
On a tripod, at 90 degree angle to the book page. The camera and the tripod should be masked with a sheet of black cardboard to avoid reflections (picture attached). The best camera is black camera anyway. Same goes about tripod. I use now 8 and 10 megapixel cameras, but 5 megapixels are good enough for most A4 documents.

3. How to shoot fast

Quick and dirty: from hand (or tripod plus remote) if somebody turns the pages for you and keeps them flat (fingers may be visible at margins but you can erase them or the margins later). Takes less than 5 seconds per page.

Precise: from your laptop connected to the camera (My Canon Powershot Pro1 makes it easy - a picture attached). The camera can shoot automatically in regular intervals, so you need no help from your buddy. Takes up to 15 seconds to flatten the page with a glass weight, re-position the cradle if necessary, and shoot. About the same time is required per photo to download (USB 1.1) in a batch after the shooting session. Before shooting use a test page to fix the focus of the camera. Re-positioning of the cradle (rather than the camera), i.e. moving it closer to or further from the tripod, is necessary after shooting 20 or so pages if you do not want the opposite page to get into the picture (a viewfinder on a laptop screen is much better for positioning than using the one on the camera). Alternatively, you can place a black cardboard over the opposite page. Removing black margins aftewards is easy with proper software.

More to follow, if there is any interest.

ereszet
09-18-2007, 04:15 AM
Camera setup: view from the front and the back.
Light diffusor.

NatCh
09-18-2007, 12:08 PM
That's a very nice system you've got there, I suppose that if you had 2 cameras, and your software could handle it, you could shoot both pages without moving much.

This looks like a pretty viable alternative to the >$10k automatic system that someone posted about a year ago. I'm certainly a lot more likely to do something like this than shell out for the library-grade machine. :nice:

The book holder looks like it's the most complicated part of the assembly -- do you find that a particular angle is best for the shots? I'm wondering how much you'd lose by going with a fixed angle holding bed, you see. :chinscratch:

Thanks for posting these, ereszet. :yes:

ereszet
09-18-2007, 03:34 PM
That's a very nice system you've got there, I suppose that if you had 2 cameras, and your software could handle it, you could shoot both pages without moving much.

This looks like a pretty viable alternative to the >$10k automatic system that someone posted about a year ago. I'm certainly a lot more likely to do something like this than shell out for the library-grade machine. :nice:

The book holder looks like it's the most complicated part of the assembly -- do you find that a particular angle is best for the shots? I'm wondering how much you'd lose by going with a fixed angle holding bed, you see. :chinscratch:

Thanks for posting these, ereszet. :yes:

The system with 2 cameras (company name is Atiz) looks very nice and costs now much less but sitll a few thousand USD plus cameras. It is semi-automatic - you turn the pages by hand, but v-shaped glass/plexi? can be moved up and down without removing it, and the shots are taken automatically.

Apart from a v-shaped glass and a mechanism to raise it, my system does exactly the same plus a regulated angle, distance, and room for books of different thickness. Two cameras are no problem technically, they can both shoot automatically every nn seconds (at least my Canon can). The problem is quality. You should concentrate on making every shot picture perfect. Otherwise there is more manual work at the processing stage. Now I shoot all the odd pages and then all even pages just by turning pages and adjusting the distance from the tripod by a few milimeters every 20 or so pages.

The book holder is just two panels with two hinged supports plus a heavy base with a lot of grooves. This is my patent :) You can regulate the distance between two panels, the angle of each panel, and the distance between the cradle and the tripod. The regulated v design is crucial to ensuring the angle between the lens and the book page at 90 degrees plus avoiding any skewing. Precise correction of angle and skewing with the tripod controls is very cumbersome. Since I installed my system, I do not touch my tripod any more (I miss my 7x zoom when taking photos of my garden birds - like the barbet in my avatar - with other 3x zoom cameras).

As for cheap and dirty alternative you can make something similar from a cardboard. It was my first design. My wife used her scissors to make it in one hour or so. It worked perfectly. Remember, the devil is in the lighting and reflections, especially for color and glossy pages (or pages under glass). The disadvantages of cheap and dirty solution are: it is very light and therefore unstable, and you have to change tripod postion controls frequently to accomodate for various sizes of original pages, since moving the cradle further away creates a trapezoidal effect. You can avoid it with changing the angle in the v-cradle. The focus is quite forgiving in such a case (+/- a few milimeters difference in distance between the lens and upper and lower parts of the page do not show in repro).
The advantage of the cardboard cradle is portability.
Finally, you can always use the simplest solution: a friend or a document holder (picture attached) and soft flash plus a lot of processing to get pictures straight. You need a steady hand and a camera with a good automatic focus (out of my two Casios, ex-z850 does a better job than ex-z1000).

Thanks for asking NatCh.

ereszet
09-18-2007, 04:32 PM
The angle between the lamp and the book page can be anything (usually 45 degrees) as long as the reflected at 90 degrees light does not go into the camera lens.
For uniformity of lighting the best thing is natural light outside, an apartment corner with two windows at 90 degrees or a bright lamp as high as possible (a bright ceiling lamp will do) over the cradle plus a diffusor. The v-cradle assures the proper angle.
Typical setting of two or four repro lamps at 45 degrees to the book page may work with very expensive lamps that I cannot find even in best photo shops (however there are some ads on the net). But in my experience, even if you see no bright spots on the page, the reproduction will show them. Human eyes provide for automatic correction, while the camera is not forgiving at all (photographers who deal with RAW pictures know it too well).

ereszet
09-19-2007, 06:03 AM
Cradle distance and angles

First picture to the left is a setup with camera lens at 90 degrees to the book page placed in the v-cradle.

The middle picture shows that moving the v-cradle closer to the lens helps to maintain 90 degrees angle for pages of bigger sizes. The zoom in the camera has to be adjusted accordingly.

The last picture shows what happens when the distance between the cradle and the lens stays the same despite a bigger page size. The distances between upper and lower parts of the page and the lens differ considerably, which causes a trapezoidal effect.

Not shown in the pictures but an obvious solution for bigger pages is to change the lens angle in order to keep the focus in the middle of the page. But when the lens angle changes then it is no longer 90 degrees to the page. That is why you can change the inclination of v-cradle panels.

ereszet
09-20-2007, 06:40 PM
Cost: nothing (just a diamond ring for my wife, but I would buy it anyway)

Enclosed pictures are worth a thousand words (and bucks if you had considered to buy an expensive substitute :) ).

phrodod
09-20-2007, 07:16 PM
Cost: nothing (just a diamond ring for my wife, but I would buy it anyway)

Enclosed pictures are worth a thousand words (and bucks if you had considered to buy an expensive substitute :) ).

This one looks doable, even for me. Does anyone have OCR software recommendations that work with camera images?

Phrodod

ereszet
09-21-2007, 04:04 AM
This one looks doable, even for me. Does anyone have OCR software recommendations that work with camera images?

Phrodod

Finereader 8
Takes images with resolution from 96 dpi upwards (I tried 72 dpi with success). Ingenious recognition algorithm with all imaginable languages. Splits double pages. Converts to black and white. Can save blocks of text and images only (removing any useless white or other space around - requires some trick though to do it in batch). Finally, saves as text under picture pdf format.

IMHO The best available OCR software. I have bought it. I put my money where my mouth is.

BTW I am considering posting some hints on processing photos to a book format but I am not sure if this is a right forum/thread. Any ideas?

Szczurek
09-21-2007, 08:14 AM
That's a very nice system but do you use special software to handle it.
Can you give us hints to make the best results.

ereszet
09-21-2007, 08:44 AM
That's a very nice system but do you use special software to handle it.
Can you give us hints to make the best results.

You either take time to position the book right in the cradle (no distortion, no background behind the page, uniform lighting - just rectangular page in your viewfinder) and then no processing is required apart from conversion of images to pdf or djvu with any freely available software. Or you shoot fast with the black background and don't care about photo quality, which you improve later with proper software. I prefer shooting fast because processing later is fun.

In both cases free Google Picasa can be very useful (downloading the pictures from the camera, automatic contrast and colour corection in batch processing, individual deskewing and cropping, etc.). Free Irfanview may be useful as well, especially for batch conversion between image formats.

I will post my suggestions for more sophisticated photo processing software (for low quality photos like the ones offered by google books in pdf format or a number of digital libraries in djvu format - you know dLibra, don't you) over the weekend.

ereszet
09-22-2007, 03:39 PM
Before photoscanning hundreds of pages, it is worth making sure that the lighting is uniform. Neither your visual inspection of the original paper document nor the camera LCD display or the computer screen (if you shoot from a camera connected to a computer) are good enough to see the bright and shadow areas due to non-uniform lighting (unless shadows and bright spots are very discernible).

There is a commercial program (Psremote) that in its preview mode can highlight the areas that are too bright according to a selected level of luminosity. However its use is limited to cameras that can shoot from the computer and it can only show which areas are too bright at a chosen threshold rather then areas with different levels of brightness.

I deviced a method which is simple and effective to check the lighting uniformity before shooting hundreds of photos.

You just take a test photo as jpg and compress it aggresively using Irfanview or any other similar progam. On the compression scale of 100 (best quality) to 1 (lowest quality) you may choose 10 or lower quality. The resulting compressed jpg will show "clouds" or areas that are similar for the sake of compression. For our purpose, it shows areas of pixels with similar luminosiity. By forcing a very low quality, we can see "clouds" even in a photo that is otherwise almost perfect. Once we know which parts of the test photo are under- or overlighted we can: move the cradle, move the lamp, use a curtain or a black bristol paper to block the light from reaching the original at various points and angles.

The attached pictures show:
Daylight.jpg - a test photo taken in the middle of the room with two windows and various objects in the way of light (the gray line along the photo image is the feature of the original paper page).
Daylight10.jpg - same photo compressed at the level of 10, showing the clouds of non-uniform brightness
DayTungsten.jpg - like daylight.jpg but with a ceiling and overhead lamps switched on (I do not recommend this setup, but I have no way to stop sunlight during the daytime - blinds are too transparent. Usually i shoot at night.
DayTungsten10.jpg - same photo compressed at the level of 10

You wll notice that the better quality DayTungsten.jpg shows some non-uniformity. It is due first of all to the sunlight but also to the fact that upper part of v-cadle panel is closer to the overhead lamp than the lower part of the panel. That is why I recommend using a bright ceiling lamp only. At at distance of 1.5 or 2 meters (depending on how high is your ceiling), the 20 centimeters between the upper and lower part of the cradle panels make little difference. You can also put a white paper (you can experiment with its dimensions) on the panel opposing the original page, and it will act as a kind of mirror, directing more light to the lower parts of the original. One method of achievieng quite uniform light is using the camera flash directed to ceiling. The light reflected from the ceiling is more dissipated. I even experimented with double reflection, i.e. flash directed to --> a sheet of bristol paper (or a sheet of matt aluminium) --> ceiling --> v-cradle.

If you are a little ovewhelmed with all this, remember that one ceiling lamp will usually be ok. The brighter is the lamp, the easier it is to get the white balance (i.e. to make the white pages white rather than gray, yellowish or pinky, while keeping the print black) in the processing afterwards. I am not going into RAW processing, white balance/ISO/camera setting problems or commercial heavy weight and costly software to process your photos. I concentrate on fast shooting and batch processing. But if you plan to convert all your family photos to digital ones, you may want to learn about RAW image format and all the tricks of professional photographers. There is abundance of advice on that on the internet but rather little of what I give you here in terms of convenient fast practical arrangements.

Please note that the paper originals were put under the glass but the photos show no reflections thanks to my v-cradle design. Better results can be achieved with proper lighting, zoom, and processing (see Tungsten processed.jpg).

The question is: do we need to make the lighting perfectly uniform? My answer is:
yes, when we deal with color originals;
rather yes, when we deal with gray photos;
not necessarily when we convert the pictures to mono.

Unless you are a perfectionist and the original is in blue dark, my setup is very forgiving even with bad lighting.

BTW. There are some commercial programs that can process the photos and help you make the lighting more uniform, but in my opinion they require fine tuning for each and every photo individually. A proper test and light setting before taking pictures is the only way to avoid disappointment in the final stage of book processing.

ereszet
09-23-2007, 04:20 PM
How it's made?
It is my wife's plastic compact powder case with mirror. I made a hole in the cover opposing the mirror to hang the deflector on the camera (I used a red hot nail to make the hole). If the light reflected by mirror and ceiling is not uniform enough even with soft flash, I cover it (the mirror) with a small piece of paper or a piece of matt aluminium, which dissipates the light even more.

Does it work?
Yes, for black and white book pages or pages with light colors (even glossy). For pages in dark colors, the camera will cast a barely visible (but annoying - if you are a perfectionist) reflection of itself on the original paper page in the v-cradle (the camera faces the original page at 90 degrees). That's why I usually obscure the camera with large black paperboard, which of course prevents the use of camera flash, and I stay with my proven overhead lamp (which is 45 degrees to the original book page and 45 degrees to the camera lens).
The advantage of compact flash is portability. You can use it anywhere together with a tripod and my "quick and dirty" portable cardboard v-cradle (see my earlier posts).

Cost: a gift to your lady, a Dior powder case perhaps?

ereszet
09-24-2007, 06:51 PM
For software tools to process photoscanned images see post #177 (Software tools to convert paper documents to lrf with thanks to cacapee for pdflrf) and the following discussion in "Yet another PDF to LRF converter" thread in Reader Content forum.
To find post #177 use the linear method for browsing.

igorsk
09-25-2007, 04:57 AM
To get a link to a specific post, click the #NNN link at top right of it:
http://www.mobileread.com/forums/showpost.php?p=99716&postcount=177

phrodod
09-25-2007, 07:31 PM
ereszet;

Thanks for the posts on the Cradle, Cardboard Cradle, Detailed Instructions, Masking Instructions, OCR Recommendations, Lighting Perfection Tips, and Software Tools!

This is quite a huge, detailed, beautiful set of instructions. Thanks!

Any chance you want to take it the next step and add it to the wiki?

You've provided an incredible tutorial for the rest of us.

:2thumbsup:2thumbsup:2thumbsup:2thumbsup:2thumbsup

Phrodod

ereszet
09-26-2007, 01:25 PM
ereszet;

Thanks for the posts on the Cradle, Cardboard Cradle, Detailed Instructions, Masking Instructions, OCR Recommendations, Lighting Perfection Tips, and Software Tools!

This is quite a huge, detailed, beautiful set of instructions. Thanks!

Any chance you want to take it the next step and add it to the wiki?

You've provided an incredible tutorial for the rest of us.

:2thumbsup:2thumbsup:2thumbsup:2thumbsup:2thumbsup

Phrodod

I am glad to here the good words from you and I welcome your suggestion to produce a wiki tutorial. However, I am afraid that I cannot devote enough time for that in the near future. If you or anybody else would like to use my posts to produce such a draft tutorial (e.g. From paper to reader: hardware and software tools), I would be only to happy to contribute to its next version. In the meantime, I will post only a few more hints on the hardware setup in this thread (after all, I see no more problems to solve unless somebody asks for advice), and more on preprocessing software in the cacapee pdflrf thread forum.

Please note that I am neither a professional photographer, nor a reprographer, nor a software developer but I try my hand in a number of fields beyond my call of duty, which covers a lot of areas. It is to be more efficient in my professional life (and pastime as well) that I follow or sometimes precede new developments.

ereszet
09-26-2007, 01:33 PM
1. Do-it yourself - see http://www.frontiernet.net/~rjacob/copystnd.htm. The picture I attach here is from that page.

2. Semi-professional copy stand by Manfrotto (Bogen in the US).
I guess that this is something similar to what some US university libraries offer for the use of students (It's been quite a few years since I last time lectured in the US, so I am not current on that).

I bought the Manfrotto in Italy a couple of years ago together with lots of accessories:
micro ball head, a mini laser level (that I attached to camera flash rail to aim the lens at the middle of the document page), extra 500 watt halogen lamps, reflector lamps, a variable friction arm, micrometric plate, various brackets, quick change plate adapter, multiclips (to hold black sheets protecting from side light), etc.

I was quite proud of the results achieved with that setting, although I was never happy with the lighting and shadows.

Since I am now a few thousand miles away from home, I had to device something provisional. So I designed and constructed a repro v-cradle as a temporary solution. It proved better than the flat bed copy stand for at least four reasons:
- it is ideal for books, magazines, etc. because of the V shape;
- it allows to position the original paper document at any distance and angle to the lens without touching the tripod or other camera stand;
- it solves the problem of lighting (see desperate solutions for flatbeds in the attached pictures taken from the internet);
- it does not require a tripod or a stand with precise positioning, anything that holds the camera steady at about 45 degree angle to the horizon level will do.

3. Professional reprographic systems. There are dozens of them with sky high prices. They may be right for automated reproduction of thousands of pages per day, but not for quality of each and every page. I assume that google books uses something like that and the results are sometimes disastrous (see yourself at google books).

I believe that with my repro v-cradle setup, I can match quality of any professional reproduction and I can photoscan hundreds if not thousands book pages a day (individual sheets of documents are just too easy to photocopy to mention).

Studio717
09-27-2007, 11:50 PM
Thank you for this! I have an old copy stand stashed away. Definitely time to pull it out. :D

ereszet
09-28-2007, 07:06 PM
Shooting photos and processing them takes time but it is not necessarily my time, I let most of the burden to my computers.

I work in my home studio at a desk with my old laptop plus an infrared external keyboard (I have to get a radio keyboard one day) and a Linksys modem/router. In another room, I have a desktop computer with dual core processor. The computers are connected as a LAN through an Ethernet cable but wireless works as well (I use wireless to surf with Archos 704 wifi when I am in bed). I use Windows terminal services to operate the desktop computer from my laptop.

To my right, on another desk is my photoscanning setup: a tripod with a camera (connected to the laptop) and the v-cradle. I can turn document pages in the cradle without moving from my chair and I shoot photos with my mouse. The shooting is the only time when I need to give my full attention. After that I switch to the other computer and start the download of pictures from the camera to the computer. It takes time, but in the meantime I work with my laptop doing other things. Next, I run all the picture processing programs (like OCR, conversion to pdf, and pdflrf) on the desktop computer. It does not take my time at all. I work with my laptop, and only occassionally I switch my screen to see what is going on with the picture processing.

I do not mention all the external hard disks, a maze of cables, power supplies, UPS towers, US, various European and other adapters for different plugs, usb extensions, card readers, etc., plus a big fan to cool my laptop.

Somehow it all works most of the time. Every evening for half an hour or so I photoscan the documents that I want to have in my computer, then I OCR and index them. I have not yet met anybody else who runs that kind of paperless office (now with the Sony Reader and Archos 704 extensions). But I assure you that it works (until hard disk crashes).

Studio717
09-30-2007, 01:03 AM
Can you be a bit more specific about the actual image-taking stage? Do you use the glass on each page?

Do you:

- place the glass on the page to be photographed
- focus the camera to the right or left page (I realize this is done at the beginning, but not continuously)
- assuming all the computer stuff is set up, take the photograph
- lift the glass, turn the page, replace the glass
- take another image of the left or right page
- continue until all left or right pages have been photographed, then repeat the process for the opposite pages
- clean up images in software
- collate images
- save as pdf (or whatever)

Is this how you're doing it?

I'm asking because I thought I understood, but when I went to step through the process, I realized how much I didn't understand.

Thanks for sharing this. I have a large (quarto-sized, I think) book that is very heavy for a traditional scanner (both for the scanner and for me!), and your method looks like it will help me a lot.

ereszet
09-30-2007, 06:56 AM
Can you be a bit more specific about the actual image-taking stage? Do you use the glass on each page?

The heavy glass serves to flatten the pages. It is five millimeters thick. An anti-glare glass would be nice, but I use just ordinary glass and it is ok with my lighting arrangement.

For single sheets of paper documents it is not necessary to use the glass unless the sheets are creased, like the ones that come folded in the envelopes.

In case of multipage documents, magazines, and books, I manually set the focus for the first page (i.e. I let the camera to focus automatically and if I am happy with the result, I fix it for the rest of the pages). The automatic focus for all the pages can work as well but some of the pages can have no print in the middle, where camera focuses, and the image will be blurred. If the first or cover page is like that and the camera cannot get the focus right, I use any printed page to put it over the book in the cradle to set and fix the focus before photoscanning the book.

My Canon connected to a computer allows me to zoom, set the focus, and change all kind of camera settings without touching the camera itself, with the results seen in a compter window (in a viewfinder, preview or downloaded image window).

For my other compact cameras (both Casio - 8 and 10 megapixels), which cannot shoot from the computer, I use a universal remote trigger that can be attached to the camera (I bought it in a photo accessories shop). I let the camera set the focus automatically for all the pages, however for the pages that have nothing to focus on in their middle, I put a half sheet of printed paper over the blank part to make the focus possible. The image has to be later cleared of that manually at the processing stage.

As for the zoom, it is a good idea to set it to such an extent that the whole page of the book (with no background visible) is in the picture frame. However, not all paper documents or books have the format proportions of the camera photos, so you wil have a background in the view whatever is your zoom, unless you zoom in to see only a part of the original image. With thick books it is advisable to adjust the cradle position every 20 pages or so in order to get the picture without the opposite page getting in the view (the middle part of the book moves a little with the turning of pages).

The background itself is a v-shaped black matt piece of paperboard that I put in the cradle. It is black because some programs make it easy to remove the black background automatically for mono images (it is much more tricky for color images).

For single sheets of documents that are not in color, there is no need to worry about zooming and background. The black border background can be taken care automatically in batch in the processing stage.

As a matter of fact, I am too impatient at the photo taking stage to adjust the cradle every so often. I let the black background margins and a fragment of the opposite page to be in the frame and I cut the margins in the processing stage. For that (cutting margins) it is important that the book pages are always in the same place in the cradle. Therefore, I use a "delimiter", a v-shaped stiff black piece of carboard one or two centimeters wide, one or two millimeters thick (v wings about 2 centimetrs high) that I put at the edge of my black backgroud paper in the cradle, and I always push the book against it.

Once the position of the cradle, camera zoom and focus are set, I shoot all the odd pages first by turning the pages and flattening them with glass, and then I reverse the book and shoot all the even pages. After downloading the images from the camera, I put the first half of images (odd pages) in a separate folder and the second part (even pages) in another folder. Now I can cut the margins in batch automatically - separately for odd and even images (there are programs that can set the cutting parameters different for odd and even pages in one batch). Next, I rename the images in the folders automatically using Irfanview, so that odd images are 0001.jpg, 0003. jpg, etc., and even images are 0952.jpg, 0900.jpg, etc. The even images are in the reverse order since the book was reversed in the cradle for the shooting. I do not bother with pages being in horizontal or upside down position. My OCR program takes care of it. However one can rotate all images in the batch (separately for odd and even pages) automatically with Irfanview of Picasa (recommended).

Now, the two batches go into one folder, and can be processed by OCR or any other program and converted to pdf.

For small booklets or magazines I do not shoot odd and even pages separately but I reverse the original after every shot. Than I crop margins manually in Picasa.

For single sheet documents the whole workflow is much more simple. Just taking a shot a replacing the sheet. Five seconds per page. After downloading images from the camera (Picasa recommended) you can review them and improve by cropping and fine tuning if you wish.

BTW. Apart from anything else, the scanners are much too slow for book copying. They are really good for a batch of single sheet documents fed by an automatic feeder.

ereszet
09-30-2007, 01:21 PM
For that (cutting margins) it is important that the book pages are always in the same place in the cradle. Therefore, I use a "delimiter", a v-shaped stiff black piece of carboard one or two centimeters wide, one or two millimeters thick (v wings about 2 centimetrs high) that I put at the edge of my black backgroud paper in the cradle, and I always push the book against it.

In one of the next post I will show how to take photos with a small compact camera not connected to the computer.

ereszet
09-30-2007, 01:28 PM
[QUOTE=Studio717;101293]Can Do you use the glass on each page? /QUOTE]

For really fast shooting, I set the focus of the camera and a time of i.e. 10 seconds for automatic shooting. Then, I only turn the pages and keep them flat with my fingers. The fingers in images can be later removed by trimming the pictures in batch. A book per hour is possible.

Studio717
10-01-2007, 07:49 AM
ereszet,

Thank you for this! I'm considering making a cradle out of foam core. Plenty of strength and yet easy to cut (at least I'm hoping so - on both counts). I have both an old copy stand and a small light tent, so once the cradle is built, it should be a matter of putting it all together and tweaking.

I'm looking forward to your next 'installment'. Thanks for taking the time to do this.

ereszet
10-01-2007, 09:04 AM
ereszet,

Thank you for this! I'm considering making a cradle out of foam core. Plenty of strength and yet easy to cut (at least I'm hoping so - on both counts). I have both an old copy stand and a small light tent, so once the cradle is built, it should be a matter of putting it all together and tweaking.

I'm looking forward to your next 'installment'. Thanks for taking the time to do this.

A foam cradle should work with two improvements.

1. Glue it to a heavy base, so that it doesn't move easily.
2. Make some wedge panels of different angles (the foam should be ok for that) to put them either under or over the cradle to avoid trapezoidal/perspective effect. The v-cradle should be at an angle greater than 45 degrees to make it possible to use the wedges. The wedges can make the angle smaller but not larger, unless you put them under the cradle at the opposite side of the book page (in that case you can use anything, e.g. for my cheap and dirty cradle I used books of various thickness under one end of the cradle to change the angle.

Attach the pictures of your setup, when it is ready.

For my do-it-yourself v-cradle I am going to add four bolt legs, to regulate precisely the horizontal position of the v-panels to match that of the camera lens. In theory a small level tool should help to get both the camera and the cradle in the right horizontal position, but in practice the camera postion changes a little when you fasten the ball head (unless you buy a precise and expensive 3-dimensional ball head for your tripod or copy stand). That is why a preview on a computer screen is the only way to make sure that the picture is a perfect rectangle.

ereszet
10-02-2007, 04:10 PM
Can you see the differences in lighting at the margins of the original picture? Hardly so.
But the computer can, as displayed in the decorative frame of the second picture.

For my example how to remove black background see my post #223 in the thread "Yet another PDF to LRF converter" (Reader Content) (http://www.mobileread.com/forums/showthread.php?p=102036#post102036).

user
10-04-2007, 08:45 AM
I am very interested in your project

can you tell me please some info: what camera you use, what lighting, what equipment, is OCR satisfactory? like scanner's scans?

thanks

HarryT
10-04-2007, 09:04 AM
If you read from the start of the thread, user, you'll find all your questions answered.

user
10-04-2007, 11:13 AM
I didnt find anything about the camera

ereszet
10-04-2007, 03:07 PM
Yes, this thread is kind of a tutorial, but nevertheless it is nice to get a feedback from the readers. So here is summary of some points.

1. Scanner vs. digital camera

Scanner is ok for single page documents, and a scanner with a feeder is very good for a pile of single sheet documents. Scanners can also be used for scanning your old photos to digital form. They do not require any lighting arrangements and take care of the white balance (usually produce white background as white - but not always). However, getting the right colors with a scanner requires as much care and processing as with a digital camera. You can learn a lot about scanning from the author of Scanview and the links on his pages: http://www.hamrick.com/vsm.html

I have a HP scanner on my desk that I don't use anymore (I dont't use a photocopier anymore either). The main reason is that the scanner is very slow in color mode. The mono mode is faster but still too slow for me. The other reason is that you cannot spread flat a book in the scanner even if you use a heavy weight on the scanner's cover. If a book page is not flat, the lines of text will be curved, and OCR will be unreliable.

Sometimes (e.g. for professional printing) you may need a scanner resolution of 1200 dpi or more. Few (very expensive) digital cameras can match that resolution for A4 pages today.

2. Digital cameras

I started with a 1.2 Mpixel digital camera only five years ago. Totally useless. (Even before that I had an opportunity to casually talk about the idea of using digital cameras as scanners with some high level HP executives. They didn't "get the picture".)

My next camera was 5 Mpixel Canon PowerShot G5. Image sensor 1/1.8 inch CCD. Image size up to 2592 x 1944 pixels. Very good.

My current camera is 8 Mpixel CanonPowerShot Pro 1. Image Sensor 2/3 inch CCD. Image size up to 3264 x 2448 pixels. Canon does not want my money for the successor of Pro 1. Apparently it competes with high priced reflex cameras.

Most Canon cameras can be connected to a computer to shoot pictures (not just to download them). Most reviewers negelect this advantage and also do not understand the need for higher resolution and never refer to repro applications in their reviews. While 8 Mpixels are good enough for A4 pages (210x297 millimeters) and more than good (with zoom) for A5 pages (you can calculate the resulting resolution from image size and paper size), it would be good to have as high resolution as possible for A3 and even A2 pages. As a matter of fact with my Canon I can shoot a picture of a distant object over the heads of the crowd and read the text which I cannot see well with my eyes. It is also the only way to read small print instructions that come with some products (recently I used my camera LCD to read a serial number on my Archos 704 wifi). The higher resolution (in terms in Mpixels) cameras should come with larger sensors. Higher resolution with the same size sensor is sometimes a disadvantage for repro applications. A nice feature for repro applicatons would be the manual focus. It is very awkard in my Canon (requires three hands).

When shooting, always use a neutral mode (no color or contrast improvement in the camera - you can always improve it with proper software) and the lowest ISO possible at a given lighting condition. In my experience, it is better to underexpose than to overexpose. Always take care of proper focus. For all practical reasons, you cannot improve on that once the shot is taken.

As for cheaper compact cameras, it looks from the specification that 10 Mpixel Canon A640 or a succesor may be good for repro applications.

There are cameras available with a special text mode. It is just a selection of camera settings. Good for novices.

My other two cameras: 8 Mpixel Casio Ex-Z850 and 10 Mpixel Casio Ex-Z1000 provide the text mode. They have a very nice 9 point focusing screen. Since they use the same size sensor, the 8 Mpixel Casio is better for repro applications. It is more difficult to focus with the higher resolution Casio.

3. Lighting

With the v-cradle design use a bright light as high over the cradle as possible. A diffusor to dissipate lighting and make it more uniform is useful. A soft flash (possibly redirected against the ceiling) can be used for black and white pages. The most uniform lighting you can get is the sunlight outdoors, with the original paper document at a proper angle to the sun..

4. OCR

Abby Finereader 8 is a marvelous piece of software but sometimes it is vicious and puts strange and obscene words in the recognized text (it is just the way it interprets the picture). It is very risky to use the OCR-ed document without correcting the misspellings manually. I never waste time for correction since I produce the output as the picture-true pdf copy of the original with OCR-ed text layer underneath. In that way you can index and search the documents in your computer and always read and print a perfect copy of the original document.

Studio717
10-04-2007, 03:36 PM
I've done some research on the web about cradles and discovered that rare book libraries use several different kinds, including book pillows. I'm going to do some experiments (I hope this weekend) to see if these alternatives will work for this type of 'scanning'.

My camera is a 7.1 megapixel Olympus C-7070 with a good macro capability. I'm hoping it will work. I have a remote for it, which will help I think, but the software I'm going to have to look into further. No doubt the solution will be PC only, which is always annoying but not a deal killer. (I usually use Macs, but I have PCs.)

For OCRing I usually just use Acrobat. I used to use a version of Iris but there were limitations that I just didn't want to deal with (they are probably gone in later versions, but I haven't looked). I've also used Omnipage in the past, but not at all recently. My biggest issue is that for many books OCRing is a waste of time. The 18th century long 's' totally, uh, messes up the results.

I'll be printing out ereszet's instructions and trying to replicate with on-hand equipment before buying (or building) new stuff. This looks like a great solution to the problem of scanning page by page in a scanner, even one as useful as the Opticbook (also PC only :disappoin ).

ereszet
10-04-2007, 05:43 PM
A book pillow? Never heard of that. I can imagine a black soft pillow as a "cradle" for books but it would be a daunting task to keep the book pages at the same position for sequential shots. What about a glass to flatten the pages? Can you give more info on that?
As for OCR, you apparently use it for English texts which are so easy that Acrobat Capture (and even free ocr programs) may be ok. I ocr all kind of languages (even old Russian cyrilic) and books as old as 17th century. I tried Omnipage, Iris, Textbridge, an IBM program whose name I forgot, and any demo/evaluation I could find on the internet. There is no match to Finereader (they even offer a version for Gothic Fraktur and Old European Script but it is too expensive for personal use - apparently they sell it to libraries).
OCR is crucial if you search thousands of books for specific terms and purpose, like genealogical research. You just cannot read thousands of books page by page.

Studio717
10-04-2007, 09:56 PM
Here's the first instance of a soft book cradle, complete with instructions on how to make it:

Velvet book cradle (http://aic.stanford.edu/sg/bpg/annual/v04/bp04-01.html)

And here's the site that sells the book pillow:

Book support bag / cushion (http://www.conservation-by-design.co.uk/sundries/sundries35.html)

There's also the more traditional library cradle:

Acrylic book cradle (http://www.gaylordmart.com/adblock.asp?abid=8606&sid=82B2B66A1E184839BDC8F939DFE873&search_by=desc&search_for=cradle)

That last one does not have adjustable sides, so that would have to be dealt with. (There is another model that has front 'stops' on it to keep a book from sliding.)

The last cradle (I should say cost-effective cradle) was one made for book binders. A sewing cradle is very close to what's needed, though I think the sides may be a bit too steep and, in the ones I saw, the sides weren't adjustable.

I saw some of the more commercial cradles for digitizing and wish they weren't so expensive! The most common one seems to be flat blocks of varying heights, where one side of the book can be raised and lowered depending on where the book is open to. This would be especially nice for thick books.

If I understand the process clearly, the main concern is to keep the plane of the camera lens even with the plane of the page being imaged, along with keeping the book steady. To keep the efficient flow ereszet has worked out, it's necessary to not have to readjust after each image, so those are the two things I keep in mind when evaluating the usefulness of the cradles and pillows, etc.

I really appreciate the work ereszet has done. It's amazing!

(And you're right, I'm only doing English books - with a few scattered French ones - and nothing earlier than the 18th century. After reading your list, I realize I have it easy! :grin2: )

user
10-05-2007, 02:54 AM
I am afraid I will have to echo what everyone else replied in this thread: mr ereszet your work in marvelous

I was thinking weeks ago something similar to what you made, but a bit different, here is my idea:

first, there are two ways to digitize afaik: cameras and scanners

the camera problem is that cameras cant create an image of the quality of a scanner (low resolution, much distortion, many artifacts, needs specific lighting conditions, specific camera angles, camera needs megapixels 35MP or more to reach scanners' 300dpi resolution) and we need expensive equipment to achieve results as a scanner easily does, however they are very fast

the scanner problem is that pages cant be flat, which results in distortion at the edges and that it takes too much time to scan each page, however the scans are top quality

my idea is about a scanner that will scan books easily, fast and accurate, without damaging the book

it will scan the book with best quality including the area near the binding
it will scan the book in the half time of a scanner, since it will scan two pages at once

this scanner will look as if you take two flatbed scanners and bind them as a book

here you can see a scheme of it:
http://i17.tinypic.com/4yjykpy.jpg

you can imagine it as an A3 scanner (that scans both pages of an opened book) but it will be foldable

the scanner I propose wont damage the book at all, since the book will be in its normal position as when you read it

you may also look at this picture:
http://www.diy.atiz.com/images/new_site/cladle.jpg
http://www.diy.atiz.com/diy_detail_info.php
http://www.diy.atiz.com/images/new_site/component/pic_self-stabilizing_cradle.gif
http://www.diy.atiz.com/diy_components.php?page=V-shaped,_self-stabilizing_book_cradles

it looks like the scanner I propose, but these glasses dont scan, they are only used to eliminate the shadow and to make the bookpages flat so that a digital camera will take a picture of the pages

opticbook solution is not adequate imo, although I havent used it
it is just a simple scanner with small border:
http://www.plustek.com/product/images/SEE.jpg

microtek scanner (http://www.mustang.com.hk/bottons/Microtek/scanner/s280.htm) has the same 'small border feature' plus its slim, so these scanners maybe the appropriates to create the dual scanner I say

it looks simple, however I dont know if it can be implemented and if there will be any benefit

looking forward to hearing your opinion
thanks

ereszet
10-05-2007, 05:57 AM
I am proud to see that both the velvet and the acrylic cradle can not match my design in terms of precision positioning and flexibility. You can add a soft padding to my v-cradle panels if you wish. With my new improvement (four regulated bolt legs) my v-cradle will allow to change horizontal and vertical angles to any precision required without touching the tripod. And it allows you to shoot very fast without adjusting the postion for every page. Instead of glass you can use your fingers or a clip or a transparent strap.
If I had a few thousand bucks to spend, I would buy an Atiz two camera v-cradle, but my design does the same (with less automation), with more precision and flexibility.

user
10-05-2007, 06:09 AM
maybe a plate that will stand on an axis which will allow the plate to rotate it self 180o (and the book on it) would make more easy to shoot the next page

user
10-05-2007, 07:07 AM
although it will be the first time I will shoot a book, I cant say all the work ereszet is proposing is needed

this simple and cost-effective scheme will be more than enough imo, since it enables you to make any adjust in the shooting angle you may ever want to make, but ofcourse please feel free to make any comments

http://i23.tinypic.com/2yuww3n.jpg

ereszet
10-05-2007, 09:03 AM
Anything similar to what is in your drawing will work. I tried before and suggested a number of substitutes in my previous posts. When I am in my office, I don't bother to go to a copier. I frequently shoot a paper document placed on a table over a black background, keeping a camera in my hand. As I have already mentioned, I run a paperless office. Fifty or so pages of documents that come to my desk every day can go to my camera and my computer in a matter of 15 minutes. Any distortions I can correct later with batch software processing, but I prefer to do the photos right from the very beginning with the v-cradle in my home studio).

For precise positioning and fast shooting of a book you need more than your design. When trying to aim and zoom your camera from a tripod you will learn how difficult it is to adjust the tripod angles precisely (in three dimensions). I did it once and I don't touch my tripod and camera anymore, I can just move my v-cradle forward and backward, left and right, up and down, change the angle of v-panels (in the next model, even precisely change the length of micrometer legs). Also remember that you need to obscure your camera behind a black curtain or a paperboard, otherwise the camera will be reflected in a glass or a glossy surface of the (color) paper document/book.

ereszet
10-05-2007, 09:12 AM
A rotating plate was my idea for shooting two opposite pages one after another. A marble rotating plate for serving cheese can be all right for that. However you would need a v-shaped glass and a book with wide margins between the pages, beacuse a joint of the glass would obscure the middle part. What's more, turning the rotating table 180 degrees will take about the same time as turning a page. On top of that, the positioning of opposite pages will be different for books thicker than pamphlets. So I dismissed the idea, and I shoot all the odd pages first and even pages next.

user
10-05-2007, 09:25 AM
nice

since you are more experienced, can you post here some:

1) program macros for batch image correction (what to correct)
2) calculations for the optimum angles, camera distances, amount of lighting (I suppose prof photographes may help)

something else, your camera costs over 1k USD, will cheaper 12MP cameras will do the job? anyone tested?

also, has anyone tried to work with any of these (http://www.amazon.com/s/ref=nb_ss_p/103-1905967-0107833?initialSearch=1&url=search-alias%3Dphoto&field-keywords=copy+stand&Go.x=0&Go.y=0)?
they are supposed to work well, since many industrial strenght book scanners, look like these, eg (http://www.imageware.de/)

last, I suppose all people occupied with book shooting have tried the snapter program from atiz, JFYI

ereszet
10-05-2007, 09:36 AM
the camera problem is that cameras cant create an image of the quality of a scanner (low resolution, much distortion, many artifacts, needs specific lighting conditions, specific camera angles, camera needs megapixels 35MP or more to reach scanners' 300dpi resolution) and we need expensive equipment to achieve results as a scanner easily does, however they are very fast
.....
this scanner will look as if you take two flatbed scanners and bind them as a book

With 10 Mpixels camera you get over 300 dpi for A4 format, over 600 for A5, and over 1200 dpi for your collection of small print business cards.

Cost is minimal. Just a tripod and do-it-yourself v-cradle. Today, you need a digital camera anyway on all kind of occasions.

A v-shaped scanner plus automated turning of pages (mechanical or pneumatic) would be a breakthrough. HP already offers some see-through scanners, and that technology would be appropriate for the v-shaped design. I hope somebody from HP or their competitors will read this and make it work (you should better go to the Patent Office before them).

With all that, the v-cradle will be a choice for home and non-corporate users, and will help you copy even large size documents or newspaper pages without spending a lot of money.

ereszet
10-05-2007, 09:51 AM
nice

since you are more experienced, can you post here some:

1) program macros for batch image correction (what to correct)
2) calculations for the optimum angles, camera distances, amount of lighting (I suppose prof photographes may help)

something else, your camera costs over 1k USD, will cheaper 12MP cameras will do the job? anyone tested?

also, has anyone tried to work with any of these (http://www.amazon.com/s/ref=nb_ss_p/103-1905967-0107833?initialSearch=1&url=search-alias%3Dphoto&field-keywords=copy+stand&Go.x=0&Go.y=0)?

last, I suppose all people occupied with book shooting have tried the snapter program from atiz, JFYI

For batch processing, I am posting some hints and problems in the pdflrf by cacapee thread in Content rather Accessories forum.

You do not need any calculations for angles, lighting, etc. You just shoot your photos and compare results.

My Canon Powershot 1 is today less than 500 USD (I guess), Canon A640 is even less. My two compact Casios are good enough as well, alas there is no way to connect them to a computer for shooting and they have no remote.

I have Bogen/Manfrotto copy stand at home away from where I am now, and I am going to use it instead of tripod with my v-cradle when I am back home.

Atiz hardware looks very good, and Snapter is a promising tool but it is too slow and buggy in version 2.0

ereszet
10-05-2007, 10:50 AM
Picture 1. A flexible cheap arm bought in a photo accessories shop, attached to a chair. I do not recommend that particular type of arm - it is neither flexible or stiff enough even for small cameras.

Picture 2. Details of a wired remote (again a computer accessories shop) attached to the camera.

Picture 3. Professional flexible arm by Manfrotto (it is for lighting support rather than a camera). It is very solid but costs a lot.

I tried all that some years ago, and I recommend buying a good tripod whith short legs (if you want to keep it on a table together with the cradle) instead.

ereszet
10-05-2007, 01:34 PM
I almost forgot about them, they are really nice. I use them occasionally.

1. A Manfrotto super clamp with a ball head. You can attach it to almost anything: a shelf, a table, a chair, a lamp stand, etc., and you can take it with you anywhere. It is very solid and stable. I recommend a Manfrotto Photo Catalogue available on the internet. You will find there all kind of photo supports and accessories, including precise ball heads.

2. Same as above plus a quick removal plate

3. A microphone stand (image from the internet). You replace a microphone holder with a ball head and get the best camera "monopod". It does not take your precious space with three legs like a tripod. You can put it on the floor. With its long arm you can reach any place you wish. You can even put your camera very high over the floor with the arm set almost vertical to shoot large posters.

Studio717
10-05-2007, 05:22 PM
Mic stand! Great idea. Just hit up my musician dh for a boom mic stand. Now I have to find the camera tri-pod mount adapter and I'm in business. :D

ereszet
10-05-2007, 05:49 PM
Mic stand! Great idea. Just hit up my musician dh for a boom mic stand. Now I have to find the camera tri-pod mount adapter and I'm in business. :D

As far as I can remember (I have no mic stand at hand now), you will need one of those adapters to change the screw thread and size from 1/x" to 1/xx", depending on your mic stand and ball head for the camera.

user
10-07-2007, 03:09 AM
I run a paperless office.

I don't know what kind of office you run, but I doubt one will ever manage to operate an office paperlessly (well, in the near future at least)

things like seals and signatures in original papers will hardly ever be digitally substituted

ereszet
10-07-2007, 03:31 AM
I don't know what kind of office you run, but I doubt we will ever manage to operate an office paperlessly (well,, in the near future at least)

things like seals and signatures in original papers will hardly ever be digitally substituted

We will have true paperless offices in the future but for now "a paperless office" is something as described in wiki: http://en.wikipedia.org/wiki/Paperless_office

For me, a paperless office is the way of processing information in the office. I store all the original stamped and signed documents somewhere in the office (it is the job of my secretary) but I never have to search for them physically if I must refer to them. I just use a few words that describe the subject and I retrieve in my computer a list of indexed files that contain those words (a boolean search is possible). I don't even assign any keywords to my document files, which is a standard practice in corporate "paperless office". Every OCR recognized word in my documents is indexed, so a combination of words and date of the file let me find anything in seconds.

ereszet
10-07-2007, 04:11 AM
To give you an idea of the volume of information I process in my paperless office. Just one of the folders in my computer contains: over 100 thousand documents (over 2 million word count) taking about 7 Gb of disk space plus 300 Mb for the index files. My 500 Gb disk is already nearly full with all kind of info. Of course I use another disk as a backup. This year, one of my disks failed, but I only lost about 10 thousand pictures from my trips abroad (I didn't bother to backup them in time - now I repent that).

To be true, not all documents come from photoscanning. A lot of them are files recerived by e-mail in digital form or conference papers, government regulations, etc. downloaded from the web.

ereszet
10-07-2007, 06:13 AM
Big size:
- A laptop with a "brick" power supply. A Linksys wireless and a 3G telephone card. A tablet laptop would be nice but for the time being the prices are disouraging.
- Canon camera that can upload pictures via USB directly to the laptop. Another "brick" power supply for Canon.

Small size:
- Sony Portable Reader for crucial b&w documents converted to lrf or png (and some books for leisure). 2 Gb memory card. No power supply necessary for browsing 7000 pages.
- Archos 704 wifi for crucial color documents converted to pdf or png (and some movies for leisure). 80 Gb hdd. A small power supply (battery life is for about 5 hours).
- A Casio compact camera with a 2 GB SD card. No power supply needed for a few hundred shots - enough for a week long business trip.
- A small external SD card reader to move the pictures from the camera SD card to Archos (for Sony the SD card itself with all the pictures can be inserted directly).

In addtion, for both big and small offices on the go one needs a cellphone and a small power supply again.

A bit of advice:
When cosidering a compact camera for the small office-on-the-go try to find a camera with direct usb connection, unlike Casios that are very compact but require a cradle and a power supply.

I hope that in a year or two a color e-ink tablet with 7" screen, long battery life and 3G (etc.) connection will replace dozens of today's devices using "crippled operating" systems..

user
10-07-2007, 08:17 AM
can you tell me please what the last 2 photos here show?
http://www.mobileread.com/forums/showpost.php?p=98093&postcount=3

how did you achieve the last photo here? it looks like scanned, is it an actual photo? its very OCRable
http://www.mobileread.com/forums/showpost.php?p=99347&postcount=13

last, how did you shoot the second of these photos? its very OCRable
http://www.mobileread.com/forums/showpost.php?p=102042&postcount=28

thanks

ereszet
10-07-2007, 09:07 AM
can you tell me please what the last 2 photos here show?
http://www.mobileread.com/forums/showpost.php?p=98093&postcount=3


It is a foldable light diffusor (photo accessories shop). When folded it is ca. 30cm in diameter. When unfolded it is over 1 meter in diameter. It is attached with strings and clamps to a lamp post.

ereszet
10-07-2007, 11:33 AM
how did you achieve the last photo here? it looks like scanned, is it an actual photo? its very OCRable
http://www.mobileread.com/forums/showpost.php?p=99347&postcount=13
thanks

Yes, it is an actual photo of a test page, to show that nice final results are achievable. I cannot remember now whether I took the DayTungsten.jpg and cut the margins or did I took another shot with a zoom to get that part of the page in the frame without black borders. I may have used Picasa to automatically improve contrast and perhaps add some brightness.

All that has little to do with OCR. It is rather for my esthetic pleasure. Finereader recommends not too manipulate the images before OCR processing.

In my experience, the only case when some preprocessing helps to get better results in OCR is when the original images are so dark that it is difficult to see details. Picasa's automatic contrast does the job.

In Finereader processing I recommend to change the resolution of the picture to what it really is and not what some cameras provide wrongly in the picture info (like 96 dpi). I usually change the resolution of all pictures in batch to 280 dpi (Canon Powershot 1) or I look at the picture in the Finereader full screen Window and change the resolution in such a way as to get the view of the picture as wide as the window is. I cannot explain the theory behind the last approach but it looks that if I can see the picture well than the OCR processing is more happy.

ereszet
10-07-2007, 11:53 AM
last, how did you shoot the second of these photos? its very OCRable
http://www.mobileread.com/forums/showpost.php?p=102042&postcount=28


Automatic black border removal and conversion to mono by ClearImage demo.

I convert color pictures of b&w pages to mono in order to reduce the size a picture takes on the disk. With thousands of document pictures stored on a disk, the size matters.

Finereader OCR can automatically convert color to mono on input (it is a legacy feature) but it prefers to deal with original color pictures.

user
10-07-2007, 12:01 PM
Finereader recommends not too manipulate the images before OCR processing.

this is very interesting, but I cant imagine what kind of logic it has, since manipulating image will make OCR easier

perhaps, ABBY recommends this because finereader already does the manipulation by its own? but in this case, I can't see how pre-OCR manipulation can harm

as for these photos:
http://www.mobileread.com/forums/showpost.php?p=102042&postcount=28
what improved the second/right image and made it perfectly OCR-able?

ereszet
10-07-2007, 12:56 PM
this is very interesting, but I cant imagine what kind of logic it has, since manipulating image will make OCR easier

perhaps, ABBY recommends this because finereader already does the manipulation by its own? but in this case, I can't see how pre-OCR manipulation can harm

as for these photos:
http://www.mobileread.com/forums/showpost.php?p=102042&postcount=28
what improved the second/right image and made it perfectly OCR-able?

Yes, finereader does in batch: rotating, correcting resolution, straightening text lines (poor algorithm), despeckling, and of course recognition of blocks and text. On input it allows to detect image orientation, split dual pages and convert to black and white.

As for preprocessing they especially advice not to fatten or otherwise manipulate the pixels around the text.

The last question I already answered in the previous post.

user
10-08-2007, 04:34 AM
JFYI, a nice OCR article aiming to specify optimum OCR conditions (http://groundstate.ca/ocr)

ereszet
10-10-2007, 02:36 PM
JFYI, a nice OCR article aiming to specify optimum OCR conditions (http://groundstate.ca/ocr)


The approach to testing Linux OCR programs in the link you provided is rather simplistic. It does not take into account that the scanned pages are frequently not "clean" and come with lower resolutions (like fax pages), they contain layouts with blocks of text and images, the text may be in different colors (the author didn't even test italic and bold print for his few selected fonts), the 8 point and even 6 point fonts are not uncommon in scanned originals, the original lines of text may be skewed and even curved, etc. One accented word he used to test the OCR performance (presumably with the French and English languages selected) does not give a clue about recognizing text with a number of different languages in one document (I frequently recognize by OCR more than six languages in old books - Latin, English, French, German, Cyrillic, various Slavic languages, plus their old varieties). While providing some hints on the performance of a few Linux OCR programs, the tests have little to do with real life situations. I am afraid that Linux OCR programs are about 20 years behind the commercial Windows OCR programs, like Finereader 8. It is not about programming, it is about recognition, cleaning, deskewing, binarization and other algorithms.

To give you an idea how correct is the recognition using photoscanning and Finereader 8, I used the line of text as in the tests above, and added 8 point plus italic fonts. Below are the results.

Number of characters (with spaces) 1852

1. Word document converted by software to jpg at 300 dpi (no scanning), picture attached - 6 uncertain characters (no errors)
2. Word document converted by software to jpg at 400 dpi (no scanning) - 16 uncertain characters
3. Paper printout of the Word document photoscanned at about 300 dpi, picture attached - 17 uncertain characters
4. Paper printout of the Word document photoscanned at about 300 dpi and binarized by ClearImage demo, picture attached - 11 uncertain characters

While most uncertain characters were recognized correctly, some where wrong - mostly wrong recognition of characters in the articles "the"). The 1% uncertain/error margin will make little difference when searching for a combination of words in an OCRred and indexed files, however it would make a difference for presenting the document as a copy of the original. That is why for the output of OCR the best option is to use pdf picture over text.

As strange as it may look, 400 dpi resolution did not improve the recognition but binarization did.

In the photoscanned picture you may notice that corners are a little darker that the rest of the page. This is a feature of camera zoom. To avoid dark corners you just use a lower zoom. The higher is the resolution of camera sensor, the lower is the zoom required to get the photo at a given resolution for a page of a given size.

user
10-10-2007, 06:13 PM
may film cameras have better results than digital cameras?

ereszet
10-10-2007, 07:00 PM
may film cameras have better results than digital cameras?

I guess that a film camera (i.e. still photo film camera) can produce a better result (in terms of resolution and color) than a digital camera. Only a few years ago, any professional photographer would dismiss digital camera as a toy. But today's digital cameras are a different breed, so we need an opinion of somebody who still uses a reflex film camera.

As far as examples of microfilm reproductions available in digital libraries are concerned, the quality of pictures is very bad. Is it because they didn't have a repro v-cradle for shooting? :)

If you need only paper copies of paper documents and not digital copies, you may achieve the same (or better quality?) with my v-cradle design and a film camera, but the lighting, focus and postioning problems will be the same for analog and digital shooting, with much more difficult and costly setting and processing for analog pictures.

If you think of a digital video camera, the zooming is impressive (x800 etc.) but when you save a frame, the resolution is rather low. However, you can probably use the zoom to read your neighbour's book over a distance. :)

user
10-11-2007, 05:35 AM
maybe printing the digital camera shoots and then scan them with a sheetfed scanner will be increase OCR accuracy and document quality?

ereszet
10-11-2007, 07:02 AM
maybe printing the digital camera shoots and then scan them with a sheetfed scanner will be increase OCR accuracy and document quality?
If you prefer to buy a camera instead of a house, you may use a Hasselblad or similar "space age" film camera, print the photo at 1200 dpi and scan it. Perhaps it can improve the rendering of 6 point and lower fonts in case you are trying to make a copy of a thumbnail size Bible. But you can use a digital Hasselblad H3D for that as well.

As you can read in my previous post, even the perfect copy of a Word document will give you some warnings/errors at the OCR stage. You can correct them manually and you can teach the OCR program to recognize most frequent warnings/errors correctly, if you ocr documents with a similar typeset.

Generally, I do not see a reason for you to concentrate on high-end cameras, scanners or expensive repro setups, unless you are in a business of printing, cartographic, geodetic, CAD or similar industry.

For a paperless office of any size a v-cradle, a proper lighting setup and a "descent" digital camera will do the job. The students can have their texbooks converted to Sony Reader in a matter of hours (copyright observed - it is different for different publications and countries) and after processing with pdflrf they are perfectly readable on a small and greyish Reader screen. The OCR and indexing will help you find any word or phrase if you store the textbooks in your laptop (unfortuntely, the Sony Reader will not allow you to search for text.

user
10-12-2007, 05:52 AM
as for ABBY finereader, while it does very good ocr job, its spell checking is mediocre:
you can't edit the dictionary entries
you cant use regex to implemet your own correction rules of common mistakes
it doesn't autocorrect 'on the fly'

ereszet
10-12-2007, 08:47 AM
as for ABBY finereader, while it does very good ocr job, its spell checking is mediocre:
you can't edit the dictionary entries
you cant use regex to implemet your own correction rules of common mistakes
it doesn't autocorrect 'on the fly'


In my Finereader 8 (version 9 is long overdue) you can use a Language Editor to create your own language based on one of the existing languages (i.e. copy of English) and add/import your own "user dictionary". There is also an option to use regular expressions. You can choose characters for your recognition set, prohibited characters, characters ignored in words. There are various options for punctuation characters and a Pattern Editor as well.

Having said all that, I admit that I have no time to play with all the options available. My current backlog of documents to ocr is over 1000 pages (and several hundred downloaded books). The 1% error margin in word recognition is acceptable for indexed text search, when you can use alternative words (or wildcards) to find and display a document with the words in context.

ereszet
10-12-2007, 03:44 PM
I owe you - the visitors to this thread - a warning. While photocopying and processing images is fun, the fun ends when you have to save and file the final pdf. You must give a meaningful name to every document and file it in a proper directory/folder. It is easy for a 500 page book. It is boring for 500 single sheet documents.

I always save my documents with a name starting with current date in the format 20071012 xxx yyy zzz, where xxx etc. are meanigful keywords like "letter from ..." etc. Although it is easy to find an indexed document by searching the key words (key words rather then keywords - adding keywords would be another boring and time consuming step), sometimes you find thousand of documents referring to a similar subject. The date helps to find the document you are looking for, sometimes even without the index search but by simple looking at or searching the names in a directory. Total Commander is a very useful tool for that.

Unfortunately, we cannot avoid this time consuming manual step of giving names to digital documents unless we delegate this job to our secretaries. I do it myself, since I am the only one who takes decisions on processing the document contents. The secretary classifies and files the documents in a traditional way, but we rarely need to rertrieve them physically from paper document folders.

Studio717
10-12-2007, 11:12 PM
Fortunately, most of what I'm saving is research materials, so I give the files names I can later search for. The hard part is remembering to add in keywords.

A cradle I ordered is scheduled to arrive Monday, so I'm hoping I can test out my system next week. (The adapter for the boom mic stand arrived today.) Getting close!

user
10-13-2007, 07:07 AM
any recommendations about the glass used to flaten the pages? what material, coatings, dimensions, density, thickness?


A cradle I ordered is scheduled to arrive Monday, so I'm hoping I can test out my system next week. (The adapter for the boom mic stand arrived today.) Getting close!

may I ask where you order the cradle from?

ereszet
10-13-2007, 07:41 AM
[QUOTE=user;105954]any recommendations about the glass used to flaten the pages? what material, coatings, dimensions, density, thickness?

My glass is about 5 millimeters thick and heavy enough to flatten even creased magazine pages. The size is A4 (210x297) plus 1 cm margin on each side. It is a kind of glass used for glass shelves, nothing special. No anti-glare glass is required with my setup, but possibly with an anti-glare glass you could avoid obscuring the camera with a black paperboard. I have tried a glass from my damaged scanner but it was too light to flatten the pages. Good enough to flatten my family photos to digitize.

user
10-13-2007, 11:13 PM
as for my scanner suggestion, I think we are close:

the HP 4670 scanner is only 0.75 inches thick -> scanning of book pages while having the book opened as minimum as possible
the Microtek s280 scanner has almost no margin (the plastic margin outside the glass scanning area) -> full page scanning without any distortion near the book spine

if we could combine the features of the two above, we will have a scanner that can be placed between book pages almost as a paper and it will scan the pages while having the book almost closed and without any distortion near the spine

ereszet
10-14-2007, 05:53 AM
as for my scanner suggestion, I think we are close:

the HP 4670 scanner is only 0.75 inches thick -> scanning of book pages while having the book opened as minimum as possible
the Microtek s280 scanner has almost no margin (the plastic margin outside the glass scanning area) -> full page scanning without any distortion near the book spine

if we could combine the features of the two above, we will have a scanner that can be placed between book pages almost as a paper and it will scan the pages while having the book almost closed and without any distortion near the spine

One company that is innovative and capable of constructing a v-shaped scanner based repro station is Atiz.

They already offer an automatic book scanner called BookDrive and a manual photo scanner called BookDrive DIY, so they have knowledge and technical skills to combine the two in a v-shaped scanner. But don't hold your breath yet.

Each solution has its advantages and disadvantages. The BookDrive is automatic but is only suitable for a certain size and thickness of books. The BookDrive DIY is semi-automatic (the v-cradle centers automatically, the shots can be taken automatically, but requires manual rasing of transparent cover and manual turning of pages). Again, the size of books is limited to the design dimensions. To make a v-shaped scanner competitive one would have to use the BookDrive DIY design with full automation, which means additional mechanisms and costs. Scanners are rather slow in scanning (especially higher resolutions and color pages), and automatic raising of the v-shaped scanner would take even more time. So the advantage of scanning 700 pages an hour (as Atiz claims) would have gone. The cost would be in the range of several thousand dollars if not more.

For a home and small office users my flexible v-cradle design has all the advantages with one disadvantage - the manual operation. There are no limits as to the size or thickness of books (you can put a big size cardboard base over v-cradle panels) and the shooting is as fast as you are in turning pages and flattening them with glass (the resolution and color doesn't matter for the speed of camera shooting). Once you set your lighting conditions right (you may experiment with a black tent to prevent any extra lights or shadows) the quality of reproductions is good enough even for professional applications. Several hundred OCR quality photos per hour is possible.

user
10-14-2007, 08:44 AM
One company that is innovative and capable of constructing a v-shaped scanner based repro station is Atiz.


a slim-and-without-margin scanner is what will do the job perfectly in every occasion, no need for V shape, cameras, etc

consider the glass you put on the book to flaten the bookpage, to perform a fast scan <3 seconds and produce a 600 dpi image, wouldnt it be marvelous? its the easiest way

I cant imagine any other more efficient and convenient way to produce high quality copies

the digital camera solution is tempting, but I suppose it needs too much work and the result will never reach even 300dpi (in all digital camera forums, in all digital camera manufactures, I contacted, I was adviced to use scanner instead of a camera)

although, I am about to buy a Canon G9 (12MP - 6x optical) and make some attempts, but I doubt I will ever reach kirtas-tech.com or atiz.com results since they use Canon EOS of at least 17MP

ereszet
10-14-2007, 09:27 AM
the digital camera solution is tempting, but I suppose it needs too much work and the result will never reach even 300dpi

You are wrong about dpi. Firstly, you do not use 600 dpi for OCR, unless your print is very small. Recommended dpi is 300-400 dpi, and in one of my previous posts I gave you an example how a perfect (Word print to image) picture gives better OCR results at 300 dpi rather than 400 dpi - with Finereader 8. Faxes at 200 dpi are OCRed with no problems as long as they are clean (no smudges).

Secondly, a 10 Mpixel camera gives you an equivalent of 300 dpi for A4 format and 600 dpi for A5. I refer you to a nice OPTICAL DPI TABLE in Atiz FAQ (http://www.atiz.com/support.php). You can easily calculate it yourself as well. However, to be true, I quote below a comment by Atiz:
"It is kind of hard to compare the dpi from cameras vs. the dpi from scanners. The results from the cameras at lower dpi can be higher quality than scanners. DPI is not the total information that is equal to the quality of the image. It is just one factor of many other factors that represent quality."

I agree with them. I have used a dozen of various scanners (including sheet feeder scanners) in my work. One of them is standing idle on my desk even now, but I have no use for it. Photoscanning is fast and produces good quality results for normal home or office use.

user
10-14-2007, 09:42 AM
mm what about barrel distortion, white balance, color distortions that are common in cameras?

scanners have minimum artifacts (mainly from dust) and inexisting distortion

I dont have an opinion about camera scanning at the moment, I just collect facts and most photographers say it wont work or it will be too much job for such a bad quality

I hope G9 will be enough and I will come back with positive feedback

the atiz supported cameras are +1k USD

ereszet
10-14-2007, 10:37 AM
mm what about barrel distortion, white balance, color distortions that are common in cameras?

scanners have minimum artifacts (mainly from dust) and inexisting distortion

I dont have an opinion about camera scanning at the moment, I just collect facts and most photographers say it wont work or it will be too much job for such a bad quality

I hope G9 will be enough and I will come back with positive feedback

the atiz supported cameras are +1k USD

1. I see no barrel distortion when shooting A5-A4 formats. I see sometimes darker edges, which can be avoided by changing the camera zoom. White balance is the problem of lighting and your perception. All the digital cameras I have, provide white balance settings for several different light conditions (daylight, fluorescent, tungsten, automatic) and a possibilty to set the parameters based on your shooting of what you believe is a white page (or a Kodak gray sheet). If your light is uniform you can easily correct white balance in RAW photos or even in jpg (see Picasa). Color depends on the mode settings in the camera. I recommend neutral (no processing inside the camera) mode. You can match the colors to your taste in a number of photo processing programs (Picasa included). Scanners need as much color attention and postprocessing as digital cameras. And yes, the scanners (especially the ones in the range of $100) distort images, because of non uniform speed of their CCD or other sensor bar. As for white balance, most of the time the white from a scanner looks white but not always, and you have no means to adjust it in the scanning process (I do not refer to high end costly scanners)

2. Most photographers dismissed digital cameras a few years ago. Times change.

3. Your Powershot G9 should be more than enough for the job. My first digital scanning camera was G5 (5 Mpixels) and was quite good for A4 format sheets and OCR. Unfortunately, Canon makes some strange marketing moves. They removed raw mode from G7 (fortunately it is back in G9), but they also removed a flexible, rotating LCD screen (which is still available on a cheaper A640). That, and also the lack of a pop-up flash in G-series, is the reason why I stay since 2004 with my Powershot Pro 1 and wonder why Canon doesn't want my money for Pro 2.

4. As for Atiz DIY BookDrive, they provide support only for selected cameras, because they are not in the camera business. Canon cameras come with the software to shoot from the computer, and some come with infrared remote. An independent developer of Breeze Systems sells the PSremote program that works with practically all Canon cameras, and is much more convenient than the Canon software.

ereszet
10-14-2007, 05:06 PM
It is time to present some comparison results scanner vs. camera.
The scanner is hp scanjet 2300c at 300 dpi color.
The camera setup is Canon Powershot Pro 1 on a tripod, v-cradle put on a box to bring it closer to the lens, a home 50 watt lamp with halogen mini bulb - about 1.5 meters over the cradle, a diffusor for dissipation of light - about 40 centimeters below the bulb (see my first posts in this thread).

Neither the scanner nor the photo setup are top quality devices/arrangements. It is just what I have at hand in my temporary residence away from home.

The scanning took a lot of time (between 30 and 60 seconds if not more - I didn't look at my watch).
The photo took a split second plus a dozen or so seconds to download the image to my computer. All camera parameters apart from zoom and white balance were automatic. White balance was set to "tungsten". No attempt was made to set the custom white balance, based on a Kodak gray card (or a white sheet of paper).

The object to scan and photo was a DVD plastic reflective case with a sleeve cover underneath and miniscule text in 10 languages. See the attached picture "original scan and photo.jpg" (lefthand image is scan, righthand image is photo).

The black margins from both the scan and the photo were removed manually. See the attached picture "images with black margines removed.jpg" (lefthand image is scan, righthand image is photo). The colors are not true neither in scan (too blueish) nor in photo (too greyish). The white is more white in the scan (possibly can be made better in the photo with custom white balance settings or with postprocessing). The photo shows a small barrel distortion due to the object being close to the lens. My camera would not allow me to zoom from a further distance (automatic focus would not work; I have not tried to use the manual focus since it is awkard and cumbersome to use in Powershot Pro 1 - despite a nice ultrasonic ring that can be rotated manually, one needs to push some buttons at the same time, which requires three hands).

The quality of images can be seen in detail in the attache picture "scanner and photo detail.jpg" (top image is scan, bottom image is photo). Amazingly, the photo renders the fonts much better than the scan.

The OCR results with Finereader 8 are (out of 10 languages, two were not installed in my recognition engine):
scan - uncertain characters 82, total characters 2130; error rate 4%
photo - uncertain characters 44; total characters 2093; error rate 2%

To make the OCR results visible, I saved the results as text and image. See the attached "text and image picture" (lefthand image is scan, righthand image is photo - both after OCR). Usually, I save text under image, which preserves picture true copy of the original.

What arrangement wins? In my opinion, both for speed and quality the camera is better. But you can draw your own conclusions.

As a bonus, I include an example of OCRed photo of text on a Scan Reader screen (original at the top, OCRed "text over image" at the bottom of the picture). That the result is perfect is just accidental. You can expect some OCR errors with other texts.

user
10-14-2007, 08:35 PM
thank you very much for this!

the results are astonishing to me, camera way wins hands down, far superior to scanner way

however, I think Canon Powershot Pro 1, as a professional SLR camera, even if the megapixels seem low for todays standards (8MP), it is better than some of todays compact/ultracompact 12MP cameras, do you get the same results with your compact casio?

here (http://www.dpreview.com/learn/?/key=sensor%20sizes) you can see why SLR cameras are superior to compact cameras and why 8MP is not the same for all cameras

so, it would be interesting to estimate the optimum camera that would do the job and that anything more would be an overkill

in "original scan and photo.jpg" the scanned image is at the left? why its so small?

ereszet
10-15-2007, 04:26 AM
in "original scan and photo.jpg" the scanned image is at the left? why its so small?

The object in the original scan is placed in an A4 scanner so it looks small. The object in the original photo is taken with the maximum zoom allowed by the camera (while retaining focus) to get as large the picture in the frame as possible.

The scanner set at 300 dpi will make 300 dpi scans no matter what is the size of the object, be it an A4 document or a 1" stamp. The camera can zoom to get the object in the frame with little or no background, so the dpi equivalent is greater for smaller object (same Mpixels for smaller size). It is a kind of self-adjusting dpi.

One can say that it is unfair to compare a set dpi in a scanner with a self-adjusting dpi equivalent in a camera, which can much exceed the scanner setting. But my comparison is not about fairness but about practicalities. I will not even try to waste my time waiting for the scanner to take a color scan at 600 dpi or higher resolution. It would take ages.

As for my camera, Powershot Pro 1, despite the name, is not a professional SLR. It is an old "prosumer" camera belonging to the same class as Canon G-series cameras (the sensor is much smaller than the one in SLRs). Later today, I will conduct some tests with the same object and my compact cameras.

user
10-15-2007, 09:12 AM
Later today, I will conduct some tests with the same object and my compact cameras.

this will be very interesting

the sensor is much smaller than the one in SLRs

the sensor size of Canon PowerShot Pro1 is 2/3", which is significantly bigger than most compact cameras' sensor size (1/7", etc)

and given the fact that, "digital compact cameras have substantially smaller sensors offering a similar number of pixels. As a consequence, the pixels are much smaller, which is a key reason for the image quality difference", the results may be disappointing

pixel count does not accurately determines image quality, pixel quality (among other factors) defines image quality
image quality may be better in low megapixels, if the pixel quality is better

also, did you save in RAW or JPG?

ereszet
10-15-2007, 10:34 AM
Following my earlier post, below are further test results including two models of compact Casio cameras.

Setup: since I do not touch my tripod and Canon Powershot Pro 1 anymore to avoid lengthy angles adjustment afterwords, for testing Casio I used a superclamp with a headball, attached to the back of a chair and my v-cradle put on the seat of the chair over a cardboard box. I took the photos outdoors (picture attached).

The settings:
1. HP scanjet 2300c 300 dpi color
2. Canon Powershot Pro 1, 8 Mpixels, resolution 3264 x 2448, sensor ccd 2/3" (8.80 x 6.60 mm), indoors with halogen 50 watt lamp, white balance set to tungsten, other settings automatic
3. Casio Ex-Z1000, 10.1 Mpixels, resolution 3648x2736, sensor ccd 1/1.8 " (7.18 x 5.32 mm), outdoors in shadow, white balance set to custom based on white sheet of paper, other settings automatic
4. Casio Ex-Z850, 8.1 Mpixels, resolution 3264 x 2448, sensor the same as Ex-Z1000, outdoors in sunlight, automatic white balance, other settings automatic.

The picture attached (margins trimmed.jpg) shows in the top row images from the scanner and the Canon, and in the bottom row images from Ex-Z1000 and Ex-Z850. Apparently my custom white balance setting for Ex-Z1000 was wrong (I used a white sheet of paper instead of Kodak gray card. However a white sheet works for me indoors) .

The details in the attached picture "detail.jpg" show that rendering of fonts with camera photos was in all cases better than in the scan (download and see the picture full screen). The images are in sequence 1 to 4 from top to bottom.

OCR results with Finereader 8 (no preprocessing of pictures)

1. scan - uncertain characters 82, total characters 2130; error rate 4%
2. Canon - uncertain characters 44; total characters 2093; error rate 2%
3. Ex-Z1000 - uncertain characters 52; total characters 2100; error rate 2%
4. Ex-Z850 - uncertain characters 80; total characters 2133; error rate 4%

The pdfs saved as text and images are attached in the last picture.

Conclusions:
- The results from a compact Ex-Z850 camera are comparable to those from the 300 dpi scan.
- Canon (8 Mpixels) gives better results than Ex-1000 (10.1 Mpixels) because of a larger sensor.
- The higher resolution (10.1 Mpixels) Casio gave better results than 8.1 Mpixels Casio, despite the same sensor size (however, it is not as good as 8.1 Mpixels Casio when focusing under dim condtions).

BTW. I have no time to deal with RAW, so I save pictures as the biggest size jpgs (compressed in the camera). Typical sizes are 2 to 4 MB.

user
10-15-2007, 04:32 PM
Very interesting results, imo they prove that high MP but cheap cameras wont work well. The 8MP Canon outperforms the 10MP Casio, so megapixels is not the only factor we must take into consideration: we need an optimum of both amount and quality of megapixels.

although 98% OCR success is superb, what if you postprocess the images with a software? do you reach a 100%? does postprocessing increase the OCR success in both scans and photos?

I am sure your work will be referenced from all those who try to digitize using a camera, and maybe in the futures scanners will extinct for home/small-office digitization

ereszet
10-16-2007, 10:29 AM
although 98% OCR success is superb, what if you postprocess the images with a software? do you reach a 100%? does postprocessing increase the OCR success in both scans and photos?

I am sure your work will be referenced from all those who try to digitize using a camera, and maybe in the futures scanners will extinct for home/small-office digitization

1. Please note, that my test was a kind of a "destructive test" (like crushing your car against the wall to see damages). The title sleeve was under a reflective plastic cover, the text was miniscule and in ten languages of which two were not installed in my OCR engine, the pictures (flags) were so small that could easily be taken as a text in color, the background was in color, etc. It is difficult to imagine similar case in practice. That the error rate with my camera setup was just 2% was amazing. One should examine the text after OCR (ten languages) by reading it to get the real taste of what Finereader is capable of doing with moderately good photos (no white balance) of extremely difficult layouts.

On average you can expect a 1% error rate (even with picture perfect photos/scans of text), unless you have hundreds of similar pages and you train the program (dictionary, patterns) with the first dozen or so pages to make sure that the rest is recognized error free.

In my experience, there is always an error here and there. Even if you scan the same document twice under identical conditions, the errors will appear in different places. Apart from sophisticated algorithms, some kind of a huge contextual database and artificial intelligence is the future. With handwritten text there are different recognition algorithms based on the movement of your pen rather then patterns.

2. Preprocessing is not required if you take the photos in proper lighting condtions and make sure that they are rectangular (removing of black borders may be required when the book size proportions are different from the photo frame). Batch contrast improvement may be useful for photos taken in dim lighting (in a hotel on your business trip). Deskewing and despeckling may be useful for faxes that you receive (a lot of people do not clean their fax machines). Correcting perspective and straightening text lines may be required for photos taken by hand or photos of double pages of thick books. Binarization will help you reduce the size of resultant pdfs. All this can be done before using Finereader and some preprocessing can be done by Finereader (but Finereader uses default parameters that may not be ideal with your specific documents/books). Finally, sometimes you may need to remove black blobs that are due to non uniform lighting conditions. When you binarize color or grey pictures with some areas where text and background are barely discernible, you will get the text surrounded by black lines, smudges and blobs. By changing the binarization parameters you can get less of that but the danger is that the text will disappear as well. The best method I found is to recognize blocks of text and pictures in Finereader and save all the images with blocks of text and picture only. Then, you load the images back to the Finerader and check the thumbnails. You will easily see those images that still have black spots and you can remove them with the eraser. After that you save again your images as blocks, load them back, recognize the text, save the final result in whatever format you want, and add white margins afterwords if you need them.

Please do not be put off by the procedures I described above. If you use my setup with adjustable v-cradle you will shoot 10 perfect photos a minute (a book in one hour - with automatic camera shooting, 6 seconds lapse time) and you can directly OCR it. If some pages go wrong (e.g. you turned two pages together instead of one, you moved the book a little and the binding shows in some pictures) you just take again the photos of those pages only.

user
10-18-2007, 12:10 PM
this is a sample from the Casio EX-Z1000 (10MP)
http://69.93.231.164/PRODS/Z1K/FULLRES/Z1KhMULTI3648F.JPG

this is a sample from the Fujifilm Finepix F50fd (12MP)
http://69.93.231.164/PRODS/F50FD/FULLRES/F50FDhMULTI4000F.jpg

in my opinion, the Casio Z1000 is slightly better (even though its 10MP and Fuji 12MP), the letters are more sharp and clear, with less noise (I dont examine color accuracy)

I cant imagine how can Casio (that is not a big brand in digital cameras) produce so fine quality and how can Casio produce so poor performance in your test (its lower than the 8MP Canon, while Casio is 10MP)

I afraid we will never reach with a compact camera, the OCR results we get from a SLR

also, is there any chance to acchieve more than 98% OCR success? 100% maybe? what is needed? more decent camera? more megapixels? more pixel quality? (98% can be alot of if we are talking about many books)

in this nice site (http://www.imaging-resource.com/IMCOMP/COMPS01.HTM) I couldnt find any 10-12MP compact camera that produces better image quality than Casio EX-Z1000 and Fujifilm F50fd (I didnt look for SLR)

I was thinking to buy one between Casio EX-1080 (10MP), Casio EX-Z1200 (12MP), Fujifilm Finepix F50fd (12MP)

but I cant find the "golden mean", the optimum combination of megapixels and image quality

it is known that cameras with more megapixels produce lower image quality photos, the more megapixels the less image quality (the main is noise and sharpness)

so would a 10MP camera perform better than a 12MP camera in OCR?

user
10-18-2007, 02:05 PM
oh, this one is very sharp too (I think the sharpest):

Sony Cyber-shot DSC-N2
http://69.93.231.164/PRODS/N2/FULLRES/N2hMULTI3648F.JPG

(unfortunately I cant find Powershot Pro1 images to compare)

ereszet
10-19-2007, 05:55 AM
Do not take my ad hoc tests as a scientific proof. The results may vary from shot to shot. My tests only show that a digital camera can be as good (and much faster) for scanning as a cheap scanner.

For professional comparison of practical resolutions see http://www.robgalbraith.com. The author provides tests for all the cameras reviewed. (The link didn't work for me today because of Gateway Timeout. Probably a temporary problem).

When deciding on which camera to buy remember about a remote capture feature, which is available for most Canon and very few other cameras. Check http://www.breezesys.com/ to see what cameras the author's Psremote program supports. Also, make sure that the camera can be powered by an external AC power supply.

From http://www.steves-digicams.com, I quote:

"RemoteCapture allows you to operate the camera while tethered to a computer via the USB port and includes a live viewfinder option. The Field angle/flash screen gives you access to settings for: Focusing Point, Flash, Macro Focus, AF-assist lamp and AF operation.

The Image Size and Quality is selected at the top. The shutter can be tripped by either a mouse click or a hot-key on the keyboard.

Captured images can be saved directly to the computer or they can be stored on the camera's CF card and then transferred to the hard drive

The Remote Capture program is also capable of interval and timer recording as well as instantaneous. Shooting interval: approx. 1 – 60 min. (1-min. increments) Number of shots: 2 – 100 shots (Maximum number of shots varies according to memory card capacity"

Avoid compact cameras that need a cradle to download images or to recharge batteries (unfortunately that is the feature of my Casios - two different cradles for two similar cameras). Camera+cradle+power supply is no longer compact.

If I were to buy a compact camera today, I would choose Canon Powershot A640 or a successor.

user
10-19-2007, 10:53 AM
mmm, I dont think Canon Powershot A640 is the best for OCR

here is a comparison of Canon Powershot A640 with Sony N2 (canon at right, sony at left):
http://shup.com/Shup/8128/untitled1.png

in my opinion, canon looks blury and less sharp than Sony's, but with good color reproduction (black is greyish, but it doesnt have purple in it as Sony's and white seems better that Sony's), however I dont know which is better for OCRing and which is more easily corrected with postprocessing, accurate colors or sharpness

user
10-19-2007, 11:31 AM
also, there is something else that must be taken into consideration:

when we are saying that Finereader needs at least a 300dpi scan, or at least a 8MP photo, I think we mean that the fonts must have a minimun size so that the OCR program will recognize them and it will be able to see the differences between the letters

so, a small font text will need more dpi/MP than a large font text

also, we must calculate as image dinemsions, not the dimensions of the paper (A4 or more), but the actual dimensions of the text column (and maybe a little bit bigger just to avoid the blurry corners than most cameras have)

so since 10MP to 12MP is a 10% increase in the font size, and at the same time a significant loss of detail and more noise, I wonder which is the golden section of MP-size sensor-font size

ereszet
10-19-2007, 03:32 PM
in my opinion, canon looks blury and less sharp than Sony's

Good research. But I am afraid that with Sony you will miss this - photos attached.

user
10-19-2007, 04:36 PM
mm, RemoteCapture would be so important reason to buy a camera?
1) to be able to adjust settings from the pc and view the files to the pc?
2) to save images directly to the pc?
(or there is any other advantage I missed)

arent any alternative programs/firmware that can do this for other cameras? or it need hardware support?

cant one just use a 4GB card and transfer the files? (I suppose it will store the whole book)
cant one change the settings from the tripod and view the images in the lcd?

I recognize that all these would be handy ofcourse, but would it justify a compromise in image quality?

ereszet
10-19-2007, 07:01 PM
mm, RemoteCapture would be so important reason to buy a camera? ...I recognize that all these would be handy ofcourse, but would it justify a compromise in image quality?

Sorry for the poor quality photo of the laptop screen in my previous post. I forgot about PrintScreen. So here you are (pictures attached).

Previewing a photo on the laptop screen and changing zoom and other settings "on the fly" to get the photo right is crucial for me. Resolution is important but lighting and focus "make or break" the photo. You cannot see the quality of the picture in the camera viewer.

If you are cost conscious you have to suffer compromises.

ereszet
10-21-2007, 07:26 AM
It is time for examples of quality photo scans. For that, I took the photo of a page from an Ukrainian album on Kiev (Kyiv). Please note, that this is only to illustrate some procedures and therefore for copyright reasons I spoiled the photo with an overlayed text. The album is larger in size than a home scanner can take, so the photo scanning is the only way to copy the page. The photo scanning setup was not ideal due to a temporary arrangement that I use now. In particular, the lighting was a 50 watt overhead halogen bulb, ceiling lamps switched on, and a lighting diffusor. An ideal setup would require a bright (possibly special photo reprographic) overhead lamp.

In the first attached picture (image processing.jpg) you can see the original photo, the same photo binarized to black and white, and then the same binarized photo with the color picture inserted. To achieve that result you just copy the picture from the color original and paste it into the binarized one. You can do it with the free Irfanview program.

Why should one use color pictures in black and white pages instead of dealing with original color pages? For two reasons. First of all, a book of several hundred pages scanned in color will produce a set of images/pdf file of enormous size. Secondly, the black print in color pages is not quite black and the white background is not quite white. For comparison see the attached pictures (aleksander ocr pdf color.jpg and aleksander ocr pdf bw and color.jpg). The three images of OCRed photos in both pictures are sequentially the following pdf formats: text under picture, text over picture, and text and picture only.

Finally, I attach printscreens (lrf color vs black and white with color picture.jpg) from Sony Reader's Connect software that show lrf pages respectively for color and binarized (with color pictures inserted) pdf pages (text under image) refined with pdflrf program. Although the large size album with small print is not something you would carry with you in the Sony Reader, this example shows vividly the diiference between rendering color and mono pages in the Reader's lrf pages.

ereszet
10-21-2007, 04:56 PM
Long overdue Finereader 9 has been released. I will test it and report the results in this thread as soon asr I buy it. In the meantime I quote below some hints regarding photo scanning from:
http://abbyy.com/DLCenter/downloadcentermanager.aspx?file=/fr90/guides/Guide_English.pdf


1. Make sure that the page fits entirely within the frame.
2. Make sure that lighting is evenly distributed across the page and that there are no dark areas or shadows.
3. Straighten out the page if required and position the camera parallel to the plane of the document so that the lens looks to the center of the text being photographed.

Minimum Requirements
. 2–megapixel sensor
. Variable focus lens (fixed–focus cameras, common in cell phones and hand–held devices, will usually produce images unsuitable for OCR)
Recommended Requirements
. 5–megapixel sensor
. Flash disable feature
. Manual aperture control or aperture priority mode
. Manual focusing
. An anti–shake system, otherwise the use of a tripod is recommended
. Optical zoom

Shooting Modes

Lighting
Make sure there is enough light (preferably daylight). In artificial lighting, use two light sources positioned to avoid shadows. (ereszet remark: this is not necessary with the v-cradle design; one bright overhead light source and a light diffusor is recommended).

Positioning the Camera
If possible, use a tripod. Position the lens parallel to the plane of the document and point it toward the center of the text.
At full optical zoom, the distance between the camera and the document must be sufficient to fit the entire document into the frame. Usually this distance will be 50–60 cm. (ereszet remark: tripod with a ball head is not preccise enough to set the camera angle and distance to match different size paper documents every time they change; repro copy stand with micrometer plates will save you a lot of positioning effort; with a tripod, once you position the camera, do not touch it anymore, do all the distance and angle positioning with the v-cradle instead).

Flash
Whenever possible, turn off the flash to avoid glare and sharp shadows on the page. In poor lighting conditions, try using the flash from a distance of about 50 cm, or, preferably, use additional lighting.
Important! Using the flash when photographing documents printed on glossy paper causes the worst glare. (ereszet remark: see my post on redirecting the flash to the ceiling; glare does not interfere with the shots taken with the v-cradle setup).

White Balance
If your camera allows, use a white sheet of paper to set white balance. Otherwise, select the white balance mode which best suits the current lighting conditions. (ereszet remark: white sheet of paper may not work in bright sun; see an example in one of my previous posts; try Kodak grey card instead).

Auto focus may not work properly in poor lighting or when photographing at a close distance. In poor lighting conditions, try using an additional light source. When photographing a document up close, try using the Macro (or Close–Up) mode. Otherwise, if possible, focus the camera manually (ereszet remark: set the zoom and preview the focus with PSRemote remote capture program; if a page is almost blank with a few lines of text use another page of text to set the focus and fix the focus.)
If only a part of the picture is blurred, try reducing the aperture value. Increase the distance between the document and the camera and use maximum zoom. Focus on a point anywhere in between the center and a border of the image (ereszet remark: as long as that point is not a blank part of the page).

user
10-21-2007, 06:45 PM
Question:
I wonder now if we really need so many megapixels or its useless or overkill. And if not, what is the most importan factor to achieve highest OCR rates.

Scanner:
The above is a scan. I dont know how this scan was produced, what scanner was used, what scanner settings were used, I only know its dimensions (A4), its resolution (850x1103, 100dpi) and that its black-and-white.

http://speakingjazz.com/PageSamples/SpkJz2ndEd-DiatPg19.jpg

Here is the scan that was OCRed:
http://shup.com/Shup/8323/107921064.png
Here is the OCRed text:
http://shup.com/Shup/8485/107921054.png
Finereader results:
Uncertain characters: 18
Total characters: 2087
OCR success rate: 99.14%

The Finereader results were superb in my opinion, considering that, as you can see, most of the uncertain characters were some music characters.

This results make me really wonder. In order to produce a 100dpi image of A4 size you only need 0.9MP. So there must be something more than just resolution. Abby cannot say "you just need a 300dpi image" in order to OCR successfuly.

Camera solution:
The Fujifilm F31fd has excellent sharpness and image quality and 6.3MP resolution. It will produce very sharp images of 3024x2016 resolution and as the article from this respected website says, "Fuji has managed, with (F31fd's) sensor and processor combination, closer than ever before to 'SLR-like' output from a compact camera (http://www.dpreview.com/reviews/fujifilmf31fd/page15.asp), when its compared with the SLR Nikon D50 (6.24MP, 23.7 x 15.6 mm sensor, +1k USD)

Images of 6.3MP resolution in the A4 size produce 260dpi, which is below Finereader's minimum requirements, but as I said, I wonder if this dpi is really not enough.

Finereader has 300dpi as the only (afaik) requirement that the image needs to have in order to perform accurate OCR.
Obviously, Finereader can't "see"/OCR too big fonts or too small fonts in the image and there must be an optimum font size or, better, range that OCR success rate is highest. But why it doesnt have specific pixel size, brightness, contrast, etc in the minimum requirements for accurate OCR?

In my opinion, Finereader not only needs the font in the captured image to be of specific size, but of specific quality as well (sharp image with good contrast, smooth boundaries of the lines, etc). We need to know more parameters in order to achieve 100% OCR rate (99.2% OCR rate is not enough, because it means 600 mistakes that need to be manually corrected in a 300 pages book, thats looks like a lot of job)

user
10-21-2007, 07:05 PM
Font size or Font quality or both?

Note:
document = the page of the book with the printed text
image = the capture of the document, either with a scanner (scan), either with a camera (photo)

Font size on the image:
The font size in the image depends on the font size in the document and the resolution that we use to capture the image.

Font size on the document:
It can be measured in points (http://en.wikipedia.org/wiki/Point_(typography))
1 point = 0.3527 mm
Usualy fonts are 5-22p in printed text pages

As bigger the font is, the lower resolution is needed to produce the same font size in the captured image (I suppose we can make a graph or the ratio for "image resolution with which the page is captured" (x) and "font size in the printed page" (y)).
For example (I dont know the exact amounts), if a 20p font needs to be captured in 5MP resolution in order to have the optimum for OCR size in the image, then a 10p font needs to be captured in 10MP resolution in order to have the optimum for OCR size in the image.

So the optimum resolution varies and depends on the font size in the document. But why should we adjust resolution? Because we need the minimum resolution that gives the best OCR rate, because cameras with many megapixels have alot of noise and this is affecting the quality of the image (shooting in bigger resolution than needed, not only is overkill, not only is a waste of money for expensive camera, but it's also known that when MP increase, IQ decrease... etc+).

So we should shoot or scan to specific resolution according to the font size in order to achieve
however maybe there are not much differences between different shooting/scanning resolutions for normal and usual (10-20p) fonts. Maybe there is no advantage. Maybe the different resolutions havent so big difference (eg 0.0x megapixels). Also, maybe Finereader "normalizes" resolutions according to font size. These are just some thoughts to increase OCR accuracy and to find the ultimate camera for the job.

I wonder if Finereader needs more megapixels because it only needs big fonts, or it needs more pixels and better pixel quality as well, or all of these.

If Finereader needs more megapixels just for making fonts bigger, we should examine magnification (http://www.cambridgeincolour.com/tutorials/digital-photo-enlargement.htm).
One method is shooting at high resolution, so if we increase the resolution the image is magnified (shooting in 10MP and then in 12MP, the image size is increased by 10%).

Another solution is that if we have a low resolution captured image, we can magnify it with an image editing program in order to make the fonts in the image bigger.
So would only magnification be sufficient for achieving optimum font size in the image?
No, because image editing software magnification results in distortion of the image (and thus fonts), because it doesnt add pixels, it just magnify the current pixels:
http://shup.com/Shup/8495/107920225751.png
So we need software that will trim megapixels and produce accurate lines and circles, and the trimming will be in proportion with the given character line of the specific font type and size.
One method to achive this is interpolation. Any photoshop guru may help us for magnify-without-distorting methods, and any other software or interpolation methods to do this.

Another thing to consider is that fonts are usualy of specific type, eg arial, verdana, etc.
I dont know if modern books mention somewhere which font are typed in.
I also dont know if there is a way (a program, or some rules) to identify by ourselfs the font type that a book is typed in.

But if we know the font type and the font size, I suppose it would be easier to tell the OCR program what to compare with (since it should have the whole character set of the specigic font type in the specific size).

I suppose and hope Finereader already does automatically all of the above (identifies font type, font size, etc and compares to the given font type all the characters line, interpolates if necessary, adjusts contrast, brightness, etc)
But, Abby's support is fast, though abysmal and not technical at all, afaik, and Finereader doesn't come with more technical requirements in order to achieve fine images and hit high OCR rates. So I would really like to know if Finereader doesnt do any of the above mentioned solutions so that we can help it more.

user
10-22-2007, 01:31 AM
I just shoot a book page with my 2MP camera of my cellphone

the only problem I encountered is that corners are blurry and dark and there is a distortion as the book page doesnt lay flat

I am quite satisfied with it, I am almost sure that we don't need too many megapixels, but sharp lens (no blurry corners, barrel distortion and dark corners) and as less as possible page curve

the flat page problem:
a glass over a bookpage doesn't seem to flaten it enough
I wonder how the kirtas (http://www.kirtas-tech.com/) machine flatens the book pages (or how it corrects the distortion with a postprocess operation (http://www.kirtas-tech.com/ocr.asp))
what I think it does is that it shoots the photo when the bookpage is vertical:
http://www.kirtas-tech.com/APT_2400/GoldAnimationCell2.gif
this is the best position that the page is completely flat and there is no distortion at all

if we manage to reproduce this, to make some kind of photocopy stand that will be able to stabilize each bookpage in that position, the main (since all others seem easily correctable) problem of book scans will be solved

EDIT: some programs are supposed to fix the curve distortion of book pages, but I havent tried any yet (if needed I 'll post some links)

EDIT2: if we put a single book page between two glasses we will do the job nicely, however it will be very time consuming to do this for every page

but only books with soft cover have problem to be flaten, the hard cover books are easier to flat

ereszet
10-22-2007, 06:41 AM
OCR success rate of 99% is acceptable in a number of applications (like book scanning) and unacceptable in other applications (like bank documents).
For a "paperless office", which today in most cases is based on picture true copies and some keywords describing the documents, an index of 99% properly recognized words is more than one could wish for (and some of the remaining 1% words are just uncertain - not necessarily errors). For special applications (like musical notes) there are special OCR programs.

OCRing books is important for two different reasons:
either to save the result in a variety of text formats (i.e. Word or html) in order to work on the text later (e.g. to convert to lrf text format)
or to save the text under the page image in order to have a perfect copy of the original and to index the words.

In the first case 1% error rate does not prevent you from reading the book. As a matter of fact, the Discovery TV channel proves in their spots that you need only the first and last letters of the word in context to make the right associations in your brain. Gutenberg project books (see Websters encyclopedia of 1911) contain a lot misrecognized letters but are usable anyway.

In the second case, you are able to search thousands of books for some combination of words, even using wildcards, e.g. you can search for all the pages with words "photo scanning" or "photo* scan*") and the displayed pages will be exactly like the original. If you miss a page or two because of ocr errors, you can live with it. I use it for genealogical reseach even with 17th century books (where e.g. "s" is printed as "f"), and I can still find a lot of references to what I search for.

As for the ideal parameters for scanning before OCR, there is simply no such requirement or need. Twenty years ago, when OCR was based on simple pattern matching, there were very strict rules for the resolution and recognizable fonts. Today, OCR algorithms are much more sophisticated and can cope with a plethora of languages and font shapes. However, they cannot make an informed decision, whether an object (like a big title or a small icon or a musical note) is a picture or text. Therefore, warnings and errors are unavoidable. I stress it again that even the perfect 300 dpi picture of the text you convert from Word directly (without printing to paper and scanning) will have OCR errors and warnings. The only way to improve on that is to teach (train) the OCR program with the first few pages of the book, so that it can easily tell the difference between e.g. "n" and "h" in the fonts used to print the book.

Quality of scans (focus, uniform lighting, proper positioning) is more important than the resolution, whether you plan to OCR the scan or not. Finereader can cope with resolutions from 96 dpi up. The new Finereader 9 can automatically set the proper brightness of greyscale images for scanning but for the camera you need to set the parameters yourself. Because every book is different you should always take some trial photos of a few pages at different camera settings and preview the results (you may scale the picture up to see the details). If your eye is happy with the results, your OCR program should be happy as well.

Finally, as books are concerned, scanners are no match to photocopying, because of limited size, slow operation, deformation of scanned book pages, damage to books, etc.

BTW. While Finereader 9 allows for 2 Mpixels camera shots, it specifically says in its guide that cellphone cameras are not fit for the purpose. I wonder why you try all kind of approaches instead of using a solution with which I started this thread, and which I tested on tens of thousands of book and document pages. Just try it, even with the "quick and dirty" cardboard v-cradle.

user
10-22-2007, 11:58 AM
I am just trying to find the optimum camera for the job

firstly, I was about to buy an expensive 12MP camera, but now I am quite sure its overkill, MP is not an important factor (since even my 2MP cellphone camera does the job)

megapixels myth (http://www.google.gr/search?q=megapixels+myth) seems to have effect in OCR as well

I wonder if a good image quality 12MP camera will do it better or worse than a 6MP best best image quality. Also I wonder which specific criteria for image quality are most important for OCR (noise, artifacts, bluriness, dark corners):

http://bpribadi.multiply.com/photos/photo/13/1
http://bpribadi.multiply.com/photos/photo/13/2

also, I am trying to make a set of rules to increase OCR rate. For example, if Finereader usually mistakes n with h in a specific document, it should ask us to train it.

ereszet
10-22-2007, 07:28 PM
I am just trying to find the optimum camera for the job


You get what you pay for (camera) but the best things in life are free (DIY v-cradle). When you buy a camera after a long research, the next day you learn about a better one.

Some cameras come with a text/document mode. Do not fall for it but just note what are the settings they provide for that mode. E.g. my Casio EX-Z850 uses the following settings: EV Shift +1, white balance Auto, ISO Auto, Sharpness +1, Saturation 0, Contrast +1. Of all these parameters the only one worth considering when shooting a document in automatic mode is EV Shift (exposure compensation), which provides for brighter pictures. The other parameters can be adjusted by software in processing stage (modifying sharpness before OCR is not recommended). With my rather dim lighting setup I usually use EV between +1 and +2. Of course, when I set the parameters manually (e.g. ISO 50, Tv 2 seconds, Av 3.5) there is no need for exposure compensation.

As for the camera resolution, there is little room for myths. The higher is the resolution the camera sensor supports (without overcrowding the pixels) the better are the details of the picture. You need the details when shooting a document with very small print. Although small print comes usually in small size printed objects (like business cards) and the camera can be zoomed to fill the frame with the picture of the object (which means dpi equivalent of 600 or even 1200 dpi), there are some publications of bigger sizes with very small print.

An example of such a small print are instructions for the use of medicines provided by pharmaceutical companies which are difficult to read even with the strong glasses. I took pictures of such a leaflet at two different zoom levels to see what difference the magnification makes to the OCR. The higher is the zoom the higher is the dpi equivalent. The OCR error/warning rate changed from 2% to 1% with the two times higher zoom. So the resolution matters.

ereszet
10-22-2007, 07:47 PM
what I think it does is that it shoots the photo when the bookpage is vertical...this is the best position that the page is completely flat and there is no distortion at all...if we manage to reproduce this, to make some kind of photocopy stand that will be able to stabilize each bookpage in that position, the main (since all others seem easily correctable) problem of book scans will be solved

You are trying to reinvent and complicate the wheel. To shoot a photo of a book page in vertical position you would have to place the camera parallel to the page while the other pages obstruct the view (as your picture of Kirtas repro stand shows). And no, page curvature is not a problem at all in a v-cradle with adjustable angle wings. You can set one of the wings almost vertically with the other one almost flat. The most difficult thing is to ensure uniform lighting, and that is what v-cradle helps to achieve. In most cases, the best angle between the v-cradle panel wings is 120 degrees.

user
10-23-2007, 11:56 AM
can you tell me please if there are other cameras apart from canon that can be remotely controled from the pc?

thanks

PS: unfortunately I got an email from Sony that says cybershots dont have such feature

ereszet
10-23-2007, 01:53 PM
can you tell me please if there are other cameras apart from canon that can be remotely controled from the pc?


Visit this page:
http://www.steves-digicams.com/digsoftware_cameractrl.html

Also, search that site for "Camera Control Software" to see discussions at some forums.

For Canon models see:
www.breezesys.com

I am aware of some Olympus cameras and some high end Nikon cameras, but I am quite happy with my Canon and PSRemote by Breezesys (original Remote Capture by Canon is no match for PSRemote), so I have not followed the subject for some time.

Studio717
10-23-2007, 04:13 PM
My Olympus C-7070 WZ works with the software Cam2Com, though it's PC only. :(

My particular camera model has been discontinued, but it's still listed as working with the software. I'd recommend finding the software you want to use, and then seeing what cameras are able to use it, if you're going to be buying a new camera anyway.

:2thumbsup

user
10-23-2007, 05:56 PM
thanks for your replies

I don't know which software I need, I suppose any software that can control the camera from the pc

however, I think camera needs to support such function as well

I am about to buy a Casio, Fuji, Pentax or Sony, not interested for others

ereszet
10-23-2007, 06:50 PM
I am afraid, you are out of luck, and none of those cameras can be operated remotely from the computer. If you want to stay with that choice, you should look for a camera with at least an infrared remote.

user
10-23-2007, 10:22 PM
neither Casio? your exilims arent remotely controled from pc?

ereszet
10-24-2007, 03:49 AM
No they aren't. But they fit in my pocket and have a long battery life. That's why I use them for my business trips, e.g. for photo scanning and e-mailing a signed agreement.

user
10-24-2007, 01:08 PM
any impressions about new finereader?

I found it to be less faster and didn't see any significant improvement on accuracy in comparison with 8.0

ereszet
10-24-2007, 02:09 PM
any impressions about new finereader?

I found it to be less faster and didn't see any significant improvement on accuracy in comparison with 8.0

I must get used to it to give a fair verdict but my first impression of Finereader 9 is a big disappointment.

Djvu files are not supported neither as input nor as output.

Saving of blocks (text and images) is possible for the selected page only (I will still have to use my external script to get rid of black lines and blobs for all the pages).

Cropping (added hastily in version 8) has not improved. Small window, rectangle with 4 lines that have to be adjusted separately (can't they learn from Picasa or Irfanview? - just one mouse movement), batch cropping does not take into account odd or even pages, preview is limited only to the selected page.

Processing window (where current processing info is displayed) cannot be put to background, so the Finereader window cannot be reduced to an icon in Windows menu bar.

But the most annoying thing is that it doesn't work most of the time. Opening a file takes so long that I try to cancel the operation, which again takes so long that I use Ctrl-Alt-Del. Is it just me? 512 KB of memory and 2.8 GHz is not enough? I attach a screenshot of an error message telling me of a timeout regarding some server. Is Finereader trying to connect to some server (on the net?) that is not responding and that halts the operation?

I hope that in a week or so I will find time to contact Finereader support to get a feedback from them. But for now, I use my old Finereader 8.

user
10-24-2007, 07:48 PM
Processing window (where current processing info is displayed) cannot be put to background, so the Finereader window cannot be reduced to an icon in Windows menu bar.

this can be easily done with an autohotkey (www.autohotkey.com) script I suppose


(I will still have to use my external script to get rid of black lines and blobs for all the pages)


I d be glad if you would like to share any time-saving or process-improving scripts

But the most annoying thing is that it doesn't work most of the time. Opening a file takes so long that I try to cancel the operation, which again takes so long that I use Ctrl-Alt-Del. Is it just me? 512 KB of memory and 2.8 GHz is not enough? I attach a screenshot of an error message telling me of a timeout regarding some server. Is Finereader trying to connect to some server (on the net?) that is not responding and that halts the operation?

But the most annoying thing is that it doesn't work most of the time. Opening a file takes so long that I try to cancel the operation, which again takes so long that I use Ctrl-Alt-Del. Is it just me? 512 KB of memory and 2.8 GHz is not enough?

I agree. It takes disproportionately too much time. T2050 Dual Core and 2GB RAM here.

I attach a screenshot of an error message telling me of a timeout regarding some server. Is Finereader trying to connect to some server (on the net?) that is not responding and that halts the operation?

I think this is the license server that needs to run while finereader runs in trial mode (NetworkLicenseServer.exe)
maybe your firewall blocks it

JSWolf
10-25-2007, 12:49 AM
It could be that Finereader 9 loads something that acts as a server, but is either not loading or not working.

user
10-30-2007, 05:39 PM
Omnipage (http://www.nuance.com/omnipage/) has introduced new technology for camera OCR (http://www.nuance.com/omnipage/professional/camera.asp)

anyone tested it?

EDIT: I just checked it, don't waste your time, Finereader is still the best :)

dbrotzen
11-14-2007, 06:21 PM
Minimum Requirements
. 2–megapixel sensor
. Variable focus lens (fixed–focus cameras, common in cell phones and hand–held devices, will usually produce images unsuitable for OCR)
Recommended Requirements
. 5–megapixel sensor
. Flash disable feature
. Manual aperture control or aperture priority mode
. Manual focusing
. An anti–shake system, otherwise the use of a tripod is recommended
. Optical zoom

Hello ereszet,

The info you provide is fantastic. I just discovered this thread. Thank you!

I travel a lot and imagine bringing along a pocketable digital cam to photo scan magazine articles, and later ocr them, enabling search. I contacted abbyy for any hardware recommendations beyond what you quote above, but they did not want to give any other advice. Previously I've used a C-pen to scan text. Slow and headings go amiss etc. Rather than being at my desk with a tripod and v-cradle, I imagine sitting aboard a plane, taking out the camera to free hand shoot a page of some text I'm reading, using the settings/features abbyy listed. I've got a few basic questions:

1. Capturing images with a handheld camera with image stabilization, (obviously the results wouldn't be nearly as good as with tripod and v-cradle) but do you think it could work? Have you tried hand held photo scanning and later OCR?

2. I wonder, would a wide-angle lens be a help or a detriment? I just noticed I had to be quite far away from the page when taking a picture from a book with a Sony DSC-200 I've got available and a wide angle lens would enable being closer to the page when taking the picture. I have not tried any OCR software.

Thank you for any feedback!

mlbspike
11-18-2007, 06:31 PM
A lot of practical information here, about the DIY option. I've been looking into this for a while. I knew Atiz was coming out with a smaller unit, at a more reasonable price. nearly $1600 is not cheap, but for a pre-built solution, it looks attractive ... the previous version was not only huge, but was nearer $4500. ereszet ... As someone with realworld experience, with building your own, and with the issues you've had to overcome, I'd be interested in your opinion of the product (obviously, there's only so much you can tell from the ATIZ website, without actually having a unit). For myself, as much as I like a DIY project, my time and patience is in limited supply currently, and the pricetag has finally come in to a very acceptable range ... regardless of the laughter from some who will no doubt call it a ripoff.
- Mike

Studio717
11-18-2007, 07:01 PM
mlbspike, if you can, please post your thoughts about the device once you receive it and have had a chance to use it. I'd like the set up, but I'm wary of spending that kind of money if a home-made set up will work almost as well.

I have one camera that I think would work fine with it, but in addition to the initial cost would be the cost of a second camera. :(

Studio717
11-18-2007, 07:02 PM
ereszet, have you posted the plans anywhere for the hinged-side cradle? My apologies if you have and I missed the link.

recycledelectron
12-12-2007, 09:03 PM
You've inspired me. I've bought a tripod, and plan to build a cradle over Christmas break (starting Dec 14, and running for 3 weeks -- I'm a college prof.)

Andy

recycledelectron
12-17-2007, 04:16 PM
OK...I built my cradle, bought a tripod, set up some lights, and tried my copy of Abbyy FineReader 8.0. I've got a Samsung S630 (6.0MP point-&-shoot) camera.

I get good results except on the bottom 2 to 5 lines of text. They seem to curve / warp, even with glass on top, and Abbyy does not handle that well.

I tried Atiz's free software to correct distortion from photographed books, but it seems to only work with both pages in book mode. (I think you have to have used the S/W to know what I mean.) Also, the lines it draws around the page are often going to take the whole left page, and only half the right page. I can not adjust the right border of the right page. Has anyone else had this problem?

Andy (the anarchist)

megalomania
01-09-2008, 03:03 PM
Greetings MobileRead community, and to ereszet in particular. I found this forum and thread via one of my periodic searches of the net for anything related to scanning books with digital cameras. For over 4 years now I have been digitizing my books on and off, and every few months I like to see if anyone else is doing the same thing. It is only within the past 1.5 years that this kind of information has started to proliferate on the Internet.

A little over a year ago I built my own V cradle based on a picture of the Atiz BookDrive DIY. It does not look as nice as the one ereszet built, but the principle is identical. I have images of the cradle and building instructions in a thread at my website: http://roguesci.org/theforum/showthread.php?t=1232 I actually posted about my book cradle starting on page 5 of the thread.

The important innovation of my cradle is a horizontally sliding base that moves the book every turn of the page to maintain the exact same distance from camera lens to paper surface. In close-up (macro) photography there is a very significant difference in the area photographed by moving an inch closer or farther away. Imagine a 1 inch thick book, where page 1 is the bottom of a triangle, and the camera is the peak of the triangle. Now picture an imaginary line of the triangle extending to the bottom of the book, to the last page. If your camera is very close, this new triangle will be 1-2 inches wider on the last page than on the first page. If you do not readjust your camera after each picture is taken, every time you turn a page the resulting surface of paper is father away from the lens, and the resulting text is smaller. The distance increase is only equal to the thickness of the book’s paper, but with hundreds of sheets of paper it adds up to a big difference.

For all practical purposes you can manually adjust your camera every 50-100 pages or so, but if you try to compile an ebook to look nice, you will discover that batch cropping the pages is difficult because you have slightly more margins to crop with every page. If you rapidly scroll down such a PDF document you can see the text get smaller, and smaller, and smaller. If you try to OCR the pages the problem really gets bad. Page 1 is perfectly centered and cropped, and it might be 305 dpi, but page 100 has wasted margins because it was much farther away, so it is only 260 dpi. That is quite a significant difference in resolution.

My original cradle used a drawer slide to move from side to side, but the sliding action is too jerky, it does not allow for the fine fraction of a mm movement of each turned page. I bought a linear slide, which is a rather expensive device that moves from side to side with minimal friction. Linear slides are made for precision fractional mm movements. There are usually a few being sold on ebay for around $50, which is a good deal considering the typically cost $200-$300 new from the manufacturers. Once I finish wiring up my lighting, I will finish the copy stand and camera mount, complete with clear glass platen. The final device should be a close approximation of planetary scanners like the BookDrive, but not for $4500.

On the link to the thread I gave above I believe I posted about calculating dpi from a digital camera. I am writing an ebook about camera scanning, so I have a considerably more thorough treatise on the meanings of DPI, PPI, megapixels, etc., and how it relates to a final image. In brief, if your digital camera has a paltry resolution of 1600 x 1200, which equals 1.92 pixels (would be labeled a 2 MP camera), each of the 1600 pixels corresponds to DPI. If you photograph a 1 inch wide piece of paper, you would have a resolution of 1600 DPI. This might work for a newspaper article, but books are not 1 inches wide, they typically range from 6 x 8 to 8 x 11 inches including margins. Since cameras have a ratio of 4 x 3 (1600/4 * 3 = 1200, hence 1600 x 1200), but books have a ratio somewhat less, you only need to consider the width part of the camera (1600 in my example) to photograph books sideways from top to bottom. An approximately 6 x 8 inch book with half inch margins can be zoomed in and close cropped to 7 inches, which still leaves a little safety room in the margins. Our 7 inch wide picture only has a resolution of 1600/7 = 228.6 DPI. This is good enough to read, not so good for OCR. If you fail to adjust your camera every so often and the later pages are farther away from the lens, say we are now digitizing the full 8 inches of the book by page 400, our resolution would be 1600/8 = 200 DPI.

Figuring out what resolution camera you need to scan books is entirely dependant on how big the book is. Indeed the more megapixels you have the higher your resolution will be when photographing the same area. However, digital cameras do not technically correspond exactly to DPI like a scanner does. The most important factor of a digital camera is the lens. The lens makes or breaks the book. When I first started digitizing books I bought a 6 MP handheld point-and-shoot camera that had settings for close-up shooting. I scanned a few books for a local bookstore that deals with very rare 19th century local history and genealogical books. Demand for these books was high because people wanted to look up their family history, but there were few of these books ever published, and the condition of the books does not allow them to being manhandled. Flatbed scanning was out of the question because the books would be damaged by laying them flat and pressing down.

A bit more about old books, overhead scanning by a digital camera is the best solution for books in bad condition, or that are very valuable. It goes without saying you can not razor valuable books and scan them in a sheetfed scanner, and getting a bound book flat on a flatbed scanner requires a great deal of spine crushing pressure. Books with a damaged spine should never be opened more than 100 degrees. As I was designing my book cradle I found professionally manufactured book cradles for libraries invariably have 90-100 degree angles. At 90-100 degrees the pages on one side of the book will be absolutely flat. Open a book and see for yourself; if it’s a hardbound book or thick paperback, open the left side to page 20 and the right side will be flat until you open the book past 90 degrees.

I am digressing from my point about lenses. My little digital camera did a very good job of digitizing the pages, until I went to OCR the pages. Much to my dismay I discovered the corners of nearly all the pictures were slightly blurry. As it turns out this is called spherical aberration, and is a common defect of cheap lenses. Consumer model handheld cameras are designed for casual portraits at distances from 5 feet and greater; a zoom lens is not made for precise corner to corner focusing because it is a general use component. These lenses are a Jack of all trades, master of none. My camera, and any consumer camera, is unable to keep both the center area and corners focused at the same time.

My solution was to move the camera back farther to increase the amount of wasted margin space, but this also lowers the effective DPI. Then I bit the bullet and bought a professional DSLR camera with a detachable lens, specifically a Rebel XT with a macro lens. Macro lenses are designed for close up corner to corner focusing. To put the importance of the lens into perspective, my macro lens costs more than your entire handheld digital camera. Even a low quality lens for a SLR camera costs as much as cheap digital cameras. The lenses on any handheld camera are only realistically able to utilize a maximum of 10 megapixels. I recommend everyone read about the megapixel myth before buying one of those 12+ megapixel camera. Without a high quality lens, as with SLR cameras, imaging sensors beyond 10 megapixels are just useless marketing fluff that trick consumers into thinking “my camera is better than your camera.”

Commercial units like the Atiz BookDrive and their new BookSnap, and industrial models like the Kirtas book scanners all use quality cameras with quality lenses. The BookDrive uses Rebel XT and XTi cameras with macro lenses as a matter of fact. This does not mean your little digital camera is useless, far from it, just don’t expect perfect ebooks just as good as the publisher can make. My top priority in digitizing books is OCR; I digitize scientific reference works so they can be searched for the rare snippet. I rather dislike reading ebooks of any kind, on any reader, I would much rather have a paper copy. I realize this position may not be popular with the members of this website. If human readability is your top priority, and OCR accuracy is only secondary, than a handheld digital camera is more than adequate.

In my early experiments with digital cameras I found 3 megapixels to be the minimum resolution to digitize 6 x 8 inch books and still be able to read the text. Because the lenses are so bad on low resolution cameras, like camera phones, digitizing a smaller area can still be difficult to read. According to my calculations I will need a 70 megapixel camera to do 600 DPI scans, two pages at once, of 8 x 11 books. I won’t be holding my breath for one of those anytime soon, not an affordable camera anyway.

I can go on and on about methods to digitize books with a camera, but I will spare everyone further reading. I reserve the numerous fine details for the ebook I am writing on the subject. It will be free and not copyrighted, and I suppose I will include an online version on my website as well. Taking the pictures is actually the easy part, post processing the images is the challenge. Post processing has to be fast and automated, otherwise you lose the time saving advantage using a digital camera gives you.

Incidentally, I read in another thread here, http://www.mobileread.com/forums/showthread.php?t=14475 , that some people were questioning why using a digital camera is a superior method. Overhead scanning is absolutely the single fastest method of scanning a book. I hate scanning books, it is the most boring, tedious, mind numbing task ever devised. Being able to digitize books in less time was just the motivator I needed to actually start scanning. A digital camera scans a page in a fraction of a second, compared to the 45-60 seconds of a scanner, or 3-4 seconds of a copy machine. However, there is more to scanning a page than just acquiring the image. On either a copy machine or flatbed scanner you have to flip the book over, turn the page, flip it over again, align the book on the glass, and then scan the page. This adds another 20-30 seconds per page, maybe more if you are not so agile. With a digital camera I can scan a book almost as fast as I can turn the pages. It takes me an average of 5 seconds to lift my glass cover, turn the page, lower the glass, and press the shutter release on my remote control. I will put a movie on and get into the zone, just turn the page and click. Using the camera is such a passive activity that it can become an easy routine, and I like to believe I am not wasting my life watching TV if I am scanning books at the same time. The constant flipping and aligning of books on a flatbed scanner requires more attention to what you are doing, and this is more like work. On a copy machine I doubt you can have the luxury of TV or music to distract you.

ereszet
01-16-2008, 02:48 PM
A little over a year ago I built my own V cradle based on a picture of the Atiz BookDrive DIY. It does not look as nice as the one ereszet built, but the principle is identical. I have images of the cradle and building instructions in a thread at my website: http://roguesci.org/theforum/showthread.php?t=1232 I actually posted about my book cradle starting on page 5 of the thread.

The important innovation of my cradle is a horizontally sliding base that moves the book every turn of the page to maintain the exact same distance from camera lens to paper surface.

Your experience and experiments have been very similar to mine. I am quite satisfied with the results I get and document/book scanning has been a routine thing for me for more than a few years now.

I have visited your thread at the site you indicated and looked at the sliding base photos but I cannot understand how the base moves the book at every turn of the page. I guess how Atiz BookDrive does it (cradle moves horizontally while the transparent cover moves only vertically) but I cannot see any cover mechanism in your solution.

As for my experience, I am using PSRemote program to control the operation of my camera from my laptop, so I can see the picture in advance on the screen and I can correct the position of the cradle a few millimiters every 20 or so book pages scanned (no sliding mechanism required).

ereszet
01-16-2008, 02:58 PM
ereszet, have you posted the plans anywhere for the hinged-side cradle? My apologies if you have and I missed the link.

Sorry for my late reply.
Unfortunately, I have made no plans of the cradle. I just told my friendly handyman what to do and I gave him approximate dimensions. You need to rely on the photos I had provided in my posts. The dimensions of my v-cradle are suited for A4 size of paper but a larger sheet of paper or book can be put in the v-cradle as well. If you are interested in exact dimensions of the cradle and its parts I can measure them and submit in a post.

Razi
03-03-2008, 08:40 AM
What about this? Costs less than £5.00 and is available in three sizes:

http://www.islamicgoodsdirect.co.uk/images/16RH31.jpg


http://www.collectingsf.com/bookworm/wp-content/uploads/2007/11/rehal2.jpg

This one for $12:

http://www.iqra.org/mmIQRA/Images/2840_500.gif

here:
http://www.iqra.org/index/items.asp?Cc=GIFT.RIHA&iTpStatus=0&Tp=&Bc=

All you need to add is a sheet of clear glass to smoothen the pages.

davidw89
03-09-2008, 07:54 PM
How much does this cost and where can i buy it?

moz
03-09-2008, 11:28 PM
I'm interested in the camera better than scanner aspects because I'm currently trying to buy an A3 scanner in order to scan a large number of magazines. I will have to buy a sheet of glass and see how well my camera works for this. If it does go well it will be cheaper to buy OCR software and build a stand than to buy a scanner, that's for sure. Even using my Canon 30D would probably be cheaper per page than a scanner with "free" software (assuming the camera only lasts 50,000 shots before the shutter needs to be replaced at ~$500, and the scanner lasts more than 10,000 pages).

I guess that a film camera (i.e. still photo film camera) can produce a better result (in terms of resolution and color) than a digital camera.

A digital camera will do substantially better than a film one for this task, largely because of the speed which which you can check and reshoot photos. Unless you work in a photo lab the time it takes to check a film exposure is at least a day.

A DSLR (the equivalent of most film cameras) will produce slightly better images than a film one for this purpose, but as your results show very clearly the problems are not with resolution or colour so much as distortion, noise and unevenness. Digital will almost certainly do better than film on all of those (there are b&w reproduction films that would probably do well but those are probably unobtainable in 35mm these days, I think you would need to use microfilm or large format cameras).

JSWolf
03-09-2008, 11:50 PM
Doesn't look like it would work all that well for a typical paperback book.

moz
03-10-2008, 12:18 AM
Doesn't look like it would work all that well for a typical paperback book.

I suspect that buying a sheet of slightly thicker glass the size of a trade paperback would help, if you're worried about holding the book in place and open. Or you could glue a sheet on at an angle to do the same thing. I vaguely recall that glass is pretty cheap if you just want a rectangular bit cut to size.

moz
03-10-2008, 04:42 AM
OK I downloaded the trial version of FineReader and I'm impressed. Even with slightly uneven lighting and no sheet of glass to flatten things out it did impressively well. On the book page I tried the only "error" was font sizing on a word in italics. It even managed a page from "The Lorax" with mostly extra non-text recognised, with that removed it was fine. Magazines not so well, lots of extra spaces in the middle of words although it did pretty well on the slanted fragment of the opposing page.

Thank you so much for your extensive reports and evaluations, ereszet, this is wonderfully useful stuff.

Overall, it's enough for me to ask for a refund on my as-yet undelivered A3 scanner and start shopping for a couple of sheets of glass. I think I'll be using my camera for this stuff. camera and schoolchildren, possibly, if I can persuade a friend that her desire for money matches my low wage offer :)

Edit: glass is cheap... 3 sheets (paperback, trade paperback and A3) cost me ~$30 including having the edges polished so they're not sharp. Given the labour costs the actual glass has to be just about free.

anvil
04-27-2008, 07:26 PM
Ereszet.

I enjoyed your article very much. It has been a big help in converting pdf's to editable text.

You mentioned that you would do a "how to" on converting poor quality pdf's to editable text if there was enough interest.


I would be very interested in this. Hope you can find the time for this tutorial\howto

anvil

megotrafigon
05-25-2008, 07:13 PM
(excuse me for hooking up here)
I've just discovered your interesting topic about using a camera instead of a scanner just while I'm trying it. My need is to digitalize old documents in historical archives, both as single sheets and books. The main need is for books since they are forbidden to photocopy. Unfortunately it is also forbidden to make use of a tripod and people guess for various devices to hold the camera tight on the documents laying on the work table. I builded a self-made device with a heavy metal base and a metal tree holding a horizontal arm to screw the camera to. I made some satisfying tests with a Nikon Coolpix L1 for single sheets, by shooting via USB cable with Nikon's Remote Control software. As for books things were more difficult due to the need to hold the book open at an angle of about 90°, so I shooted first the pair pages and then the odd ones, the photographed page lying on the table and the previous pages with the cover being held vertically by my odd hand while with my right hand I hit the key for shooting. Now unfortunately 1) I lost my L1 camera 2) Nikon has discontinued Remote Control for all models. So I'm looking for some camera with remote control sw and as you too said the only ones are Canon's.
Now I'd like to ask you if you have thought of photographing books just lying horizontally on the table and have some suggestions to improve such procedure
TIA

igorsk
05-26-2008, 02:06 PM
You can try this to steady your camera when a tripod is not available/not permitted.
http://www.ehow.com/how_2146191_make-string-tripod.html

scanner
07-15-2008, 09:39 AM
can anyone make a wiki of this vast post please ?

I mean, first what do I need, especialy what kind of lamps

second, what is the procedure to adjust my camera? no flash? what kind of white balance?

last, what is the afterwards procedure? how do I photoshop them to achieve better OCR results in finereader? or finereader will fix them all?

thank you

theomeier
09-26-2008, 01:22 AM
megalomania supposedlly has documented some building plans for a cradle.
Has anyone seen these plans and can provide the infos ?


The link quoted does not provide access and megalomania does not reply. Thus I need alternative information. Other sources and/or plans are appreciated.







........
A little over a year ago I built my own V cradle based on a picture of the Atiz BookDrive DIY. It does not look as nice as the one ereszet built, but the principle is identical. I have images of the cradle and building instructions in a thread at my website: http://roguesci.org/theforum/showthread.php?t=1232 I actually posted about my book cradle starting on page 5 of the thread.
........

Maggy
11-28-2008, 08:36 AM
I've never tried with a film camera, but my experience with those is that the film grain would probably spoil OCR results.

Please watch out for high megapixel claims especially on cheap camera's
-1st: their sensor has often less Mp but the built in software interpolates
-2nd: a cheap tiny 10 Mp sensor gets hot and produces speckling noise while more expensive 5Mp camera's have a larger and better quality sensor that will produce better results. Reports that even a 2Mp phone camera might give acceptable results might for a part be explained because these sensors are probably low energy and the pixels are not as much interfering with one another. I'm still convinced that this is not my personal favourite solution but it is worth a try if you already own such a camera phone. What's perfectly acceptable for some of us is rubbish for others.

I'm studying on the possibility to make a simple portable solution for use in archives and libraries. A kind of suitcase with fold up book stand, fold up camera monopod and battery fed permanent light that is not irritating other library visitors. Flash is in 99% of the cases strictly forbidden, wall sockets are in most cases not available. I'm thinking about something like taking off the TFT from a broken laptop that has still working backlight. I already tried this once, but after taking off the TFT the backlight refused to work as well. I did convert the backlight to 220V, much lighter weight than the entire laptop, but no longer working on batteries. The light quality hovever is the best you can get for this purpose.

All suggestions are welcome.

Instead of glass I'm thinking about using clear film fixed to the back of the book stand, glue a hard plastic edge on the front side and give it some spring hooks. I own some samples of very high quality plastics that should last very long even with high spring tension.

By the way, I have lots of PDF ebooks, looking like full picture facsimile with underlaying OCR text that's so full of errors that text searches hardly give any results. Downloaded from several different libraries and Google Books. I can't find any easy way to show just the text to make corrections. Nor do I know any easy way to redo the OCR job. Advice please!

And does anyone know any software more or less comparable to OCR but for handwritten text? I'm currently doing an awful lot of manual transcription.

KonikaMinolta Dynax/Maxxum 7D is the best camera I know for this kind of work. It's a bit heavy, but it has excellent hardware anti-shake, (most camera's have much worse software shake reduction), one of the best sensors ever made, performs excellent even free hand in low light conditions, easy both manual and automatic controls, software for PC control, interchangeable lenses and the best high resolution camera LCD money can buy. KonikaMiolta no longer exists as a camera builder so if you want one you'll have to find it second hand.

Maggy
12-02-2008, 09:45 AM
In most archives and libraries using flash is forbidden. Of course it is annoying for other visitors and they fear damage by the bursts of light to the paper. AFAIK there is hardly any UV or IR in xenon flash so it shouldn't do much damage.
I found a better solution to the horrible lighting conditions: portable LED lights with rechargeable batteries. The ones I bought were 15 Euro's, they have 30 ultra bright white LEDs, they burn up to 4 hours per charge.
I do need a way to diffuse the light however, they throw their rectangular bundle almost straight forward. And I do need to find the ideal places to avoid shining/mirroring spots on paper or glass/plastic.

xanlexian
12-02-2008, 01:43 PM
Maggy,

For diffusing the light -- I've got a 3-LED book light that is more like a bright searchligh! A small scrap of car-window tinting material fixed that right up, and gives it a not so "harsh" light. I had this stuff left over from a fiasco I went through trying to tint my truck windows (more like a solo Keystone Cops episode...)

I also use the stuff over my bedside digital clock, and just about anything I have with those annoying bright blue LEDs.

Simple solutions! Look for some car window tinting (or similar material)

tsttm
05-18-2009, 01:22 PM
This is a great thread. thanks for putting the time to sharing. It's a few months old, hopefully people are still monitoring it.

I found this great instructables that takes this idea a little further if anyone's interested. Definitely more work, but has some advantages (speed/accuracy).

http://www.instructables.com/id/DIY-High-Speed-Book-Scanner-from-Trash-and-Cheap-C/?ALLSTEPS#step0

One question, the set up you use to connect the Canon to your computer for remotely shutter etc...is it possible to control two cameras at once? Just wondering if there's a feature to trigger two cameras at the same time.

Also, since you say it will upload the photos immediately between shots..does this mean you don't requre any SD cards in the camera?

thanks!

ereszet
05-19-2009, 05:44 PM
It's been a long time since I visited this thread. I am glad it is still alive.

*http://www.instructables.com/id/DIY-High-Speed-Book-Scanner-from-Trash-and-Cheap-C/?ALLSTEPS#step0
Thiat link looks very interesting indeed. One day I must find time to read it thoroughly.

*One question, the set up you use to connect the Canon to your computer for remotely shutter etc...is it possible to control two cameras at once? Just wondering if there's a feature to trigger two cameras at the same time.

For that see http://www.atiz.com

*Also, since you say it will upload the photos immediately between shots..does this mean you don't requre any SD cards in the camera?

Well, immediately - by the way of speaking. It takes a few seconds to transfer a few megabytes from the camera to the computer. Just enough time to turn the page of the book. However, transfering all photos in batch from the SD card (Google Picasa does a good job at it) is a more convenient solution.

tsttm
08-03-2009, 12:36 PM
ereszet, can you elaborate a little when you say you save 'text under image' using finereader?

I'm just testing finereader now, so i'm only vaguely familiar with the settings...but are you saving text version & image version in the same file? just wondering how that works.

or does it mean you're simply saving a text file & an image file?

I'm curious how you work with your end products- are you reading the images using some image managing software & when you want to search/cut & paste, you go to your text version copy?

If you could just elaborate a little on how you work, it'd be great.

thanks!

coffeeneone
09-21-2009, 11:25 PM
Hey,

I just found this thread, and I want to say thank you for all the great information. I was just about to buy a plustek 3600 (http://www.plustek.com/product/book3600.asp) . I was wondering now if there is a manual for building that v-cradle? Forgive me if it's already posted and I missed it. Right now, I only have a nikon 8 mp coolpix. All I want to do is back up my paper back books and comics. 1. will this camera do the trick? 2. Is there a step by step manual from building the v-cradle to building a finished ebook?

Thanks for any help!

Juliette
09-27-2009, 01:38 PM
You can try this to steady your camera when a tripod is not available/not permitted.
http://www.ehow.com/how_2146191_make-string-tripod.html

Thank you a lot for the link, very useful!

kennyc
11-04-2009, 08:54 AM
Thanks for all this info and to Bryan for the crosslink.