![]() |
#1 |
Connoisseur
![]() ![]() Posts: 97
Karma: 110
Join Date: Sep 2010
Device: Kindle Fire HD
|
PDF pages in a box
I have a PDF book I want to convert to text. But each page of the book is in a text box so it will not convert. How do I get round this.
Thank you |
![]() |
![]() |
![]() |
#2 |
Sigil & calibre developer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 2,487
Karma: 1063785
Join Date: Jan 2009
Location: Florida, USA
Device: Nook STR
|
First you need to verify the text is not a set of images. Open the pdf using something like acrobat and see if you can copy the text. If you can you can use a program to crop the pages. I don’t know what you would use on Windows.
Your said you wanted to convert to txt. Acrobat can save as text. Give that a try too. |
![]() |
![]() |
Advert | |
|
![]() |
#3 |
Connoisseur
![]() ![]() Posts: 97
Karma: 110
Join Date: Sep 2010
Device: Kindle Fire HD
|
Thank u for the reply.
Yes it looks like a set of images. I saved the PDF in Adobe as a text file, but the txt file comes up blank in Notepad or Wordpad. I tried "Select All" in Adobe, but it does not select the whole file, it only selects one page where the cursor and if I copy and paste it in MS Word it comes up in same text box!! I am looking to get rid of the boxes and want only the content in them to format as I want. Last edited by Mamaijee; 01-10-2011 at 02:17 PM. |
![]() |
![]() |
![]() |
#4 | |
Well trained by Cats
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 30,884
Karma: 59840450
Join Date: Aug 2009
Location: The Central Coast of California
Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A
|
Quote:
If the images are High quality, you might be able to OCR each and every one ![]() |
|
![]() |
![]() |
![]() |
#5 |
US Navy, Retired
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 9,889
Karma: 13806776
Join Date: Feb 2009
Location: North Carolina
Device: Icarus Illumina XL HD, Kindle PaperWhite SE 11th Gen
|
As you have already stated it is not a text box, but a image you're copying into MS Word. As a image calibre can't help you but as mentioned you may be able to run it through OCR software to convert it to text.
|
![]() |
![]() |
Advert | |
|
![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
Classic Split PDF pages into smaller pages (images into tiles) | Astro | Barnes & Noble NOOK | 4 | 06-12-2020 10:56 AM |
blank pages on a pdf book | afsandiego | Sony Reader | 6 | 12-19-2015 05:52 AM |
PDF to Epub (problem with pages) | violentlyserene | Calibre | 1 | 08-22-2010 10:38 AM |
Split pdf pages down the middle | Blue_Alien | Calibre | 3 | 08-15-2010 11:12 PM |
Calibre changes the .PDF pages size | beniof | Calibre | 0 | 07-09-2010 06:41 AM |