10-18-2011, 06:56 PM | #31 | |
Groupie
Posts: 166
Karma: 5358
Join Date: Aug 2010
Location: Davis, CA
Device: Kindle 3
|
Quote:
... That being said, as a former graduate student and a researcher who reads and writes technical PDF's I can't say I am happy or not about what you can and can't do with them on a Kindle. I appreciate what they do for journals, printing, etc, but I do wish I could easily read them on my Kindle. I can imagine other tablets are a decent solution, even a DX may be a decent solution, but in the technical world color is becoming more common, and I personally hate reading off a backlit monitor. I really just wish the Kindle had a mechanism so that PDF's could be somewhat cropped to a specific size (you can do that via the zoom already) and then the page forward buttons would quickly scroll you to the bottom of the page and then the next page... similar to the way the page-up and page-down button works on a PC with Acrobat Reader. It isn't ideal, but I don't think PDF's should need converting to ebooks. I just wish they could be handled a little better. I hate having to scroll around with the arrow buttons and the page forward can have me skipping parts of pages that I don't want to be skipping. The color issue will have to wait until colored e-ink becomes an option if it ever does. By the way, I do find that turning the Kindle sideways and reading a PDF (6 in screen) is a reasonable method of reading a PDF. |
|
10-18-2011, 07:06 PM | #32 | |
Grand Sorcerer
Posts: 27,546
Karma: 193191846
Join Date: Jan 2010
Device: Nexus 7, Kindle Fire HD
|
Quote:
So I usually have to decide between HTML with italics—but with pesky headers and footers to track down and remove (Acrobat). Or really nice, clean HTML with no pesky headers and footers, but no italics (PDFMasher). Both need regexed for paragraph fragments. Last edited by DiapDealer; 10-18-2011 at 07:08 PM. |
|
Advert | |
|
10-18-2011, 07:29 PM | #33 | |
Resident Curmudgeon
Posts: 73,896
Karma: 128597114
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
|
Quote:
PDF was never designed to have the information needed to convert it to another format and it never will. Basically, if you have a PDF, the only way to convert it is to pick a program to convert it and then A/B compare every single pixel/letter/punctuation/etc. and also do any format fixing that needs to be done. Then you'll have your conversion. There is NO program that can convert a PDF of any reasonable size error free. |
|
10-18-2011, 07:42 PM | #34 | |
Resident Curmudgeon
Posts: 73,896
Karma: 128597114
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
|
Quote:
|
|
10-18-2011, 10:58 PM | #35 |
Junior Member
Posts: 3
Karma: 10
Join Date: Oct 2011
Device: Kindle
|
Forgive me if I missed something, but if PDF isn't the best format to convert from, what is? Is it better to convert from a Word format to .mobi or .epub?
|
Advert | |
|
10-18-2011, 11:06 PM | #36 | |
Treasure Seeker
Posts: 18,708
Karma: 26026435
Join Date: Mar 2010
Device: Kobo HD Glo, Kindles, Kindle Fires, Andriod Devices
|
Quote:
I use Word html as my source then import it into Calibre and convert to mobi and epub. |
|
10-19-2011, 07:15 AM | #37 | |
Junior Member
Posts: 6
Karma: 10
Join Date: Oct 2011
Device: Kindle 4
|
Quote:
Maybe you are doing a few things together that are helping to make a good conversion? I would be most grateful for any advice |
|
10-19-2011, 08:51 AM | #38 | ||
Wizard
Posts: 1,090
Karma: 6058305
Join Date: Sep 2010
Location: UK
Device: Kindle Paperwhite
|
Quote:
Quote:
I've found pdftohtml gives good results with some PDFs. Calibre and pdftohtml are both open source, so if you do decide to try and write something better, it might be worth having a look at how they do things. |
||
10-19-2011, 09:45 AM | #39 | |
Evangelist
Posts: 461
Karma: 956567
Join Date: Oct 2010
Location: Toronto, Canada
Device: Kindle Oasis 3
|
Quote:
|
|
10-19-2011, 01:04 PM | #40 | |
Treasure Seeker
Posts: 18,708
Karma: 26026435
Join Date: Mar 2010
Device: Kobo HD Glo, Kindles, Kindle Fires, Andriod Devices
|
Quote:
|
|
10-19-2011, 01:10 PM | #41 | |
Grand Sorcerer
Posts: 5,886
Karma: 464403178
Join Date: Feb 2010
Location: 33.9388° N, 117.2716° W
Device: Kindles K-2, K-KB, PW 1 & 2, Voyage, Fire 2, 5 & HD 8, Surface 3, iPad
|
zip
Quote:
|
|
10-19-2011, 01:39 PM | #42 | |
Treasure Seeker
Posts: 18,708
Karma: 26026435
Join Date: Mar 2010
Device: Kobo HD Glo, Kindles, Kindle Fires, Andriod Devices
|
Quote:
Code:
Do a S&R for Manual line breaks and replace with paragraph marks. MS Word it uses ^13 for a return, with wildcard box checked in the Search Box ^13([a-z]) = This checks for broken sentences ([a-zA-Z])^13 = This checks for broken sentences ([a-z])^13([A-Z]) = This checks for broken sentences Replace Box \1 and \2 if there is more then one bracket, add appropriate spaces as needed. [0-9]{1,}^13 = This checks for page numbers [0-9]{1,} = Second check for page numbers and OCR error where numbers replace letters. [A-Z]{3,} = Match Case checked, Replace 3, if needed for more word matches. I also use the Styles panel to make batch changes. Alot of back titles I buy have inconsistency when it comes to formatting this feature comes in handy to fix that quick. Highlighting a chapter heading and then click Clear formatting and clicking the appropriate style will really help it to take on the correct formatting you want. I also use Macros to make it alot faster! Last edited by Blossom; 10-19-2011 at 01:41 PM. |
|
10-19-2011, 02:41 PM | #43 |
Grand Sorcerer
Posts: 27,546
Karma: 193191846
Join Date: Jan 2010
Device: Nexus 7, Kindle Fire HD
|
For broken sentences in HTML, I use the following search regex:
Code:
([^.”":?’'!>—…)])</p>\s+<p[^>]*> Code:
\1 I don't trust it enough to blindly do a "Replace All" on a whole book, but I rarely have to intervene when stepping through a document an incident at a time. |
10-19-2011, 02:45 PM | #44 | |
Treasure Seeker
Posts: 18,708
Karma: 26026435
Join Date: Mar 2010
Device: Kobo HD Glo, Kindles, Kindle Fires, Andriod Devices
|
Quote:
What program does this work with? I've tried Notepad++ and Notepad2 and it can't find anything. Last edited by Blossom; 10-19-2011 at 02:48 PM. |
|
10-19-2011, 03:04 PM | #45 |
Grand Sorcerer
Posts: 27,546
Karma: 193191846
Join Date: Jan 2010
Device: Nexus 7, Kindle Fire HD
|
I use it mostly with Sigil and Komodo Edit. I like Notepad++ as a code editor, but it gives me fits when trying to use more complex, multi-line, regex S&R.
|
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
KINDLE DEAL: The Holy Bible: NKJV ($3.36 CANADA) | gospelebooks | Deals and Resources (No Self-Promotion or Affiliate Links) | 2 | 04-09-2011 12:07 PM |
Free Book (Kindle / Nook) - The Holy Bible | koland | Deals and Resources (No Self-Promotion or Affiliate Links) | 21 | 11-14-2010 01:51 PM |
Free Book (Kindle) - The Holy Bible | koland | Deals and Resources (No Self-Promotion or Affiliate Links) | 21 | 10-09-2010 10:31 AM |
Free Book (Kindle) - Holy Bible (GW) | koland | Deals and Resources (No Self-Promotion or Affiliate Links) | 0 | 10-04-2010 03:29 AM |
The search for the Holy Grail of reading lights continues | Bob Russell | News | 19 | 04-01-2009 01:24 PM |