Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book Formats > ePub

Notices

Reply
 
Thread Tools Search this Thread
Old 07-18-2015, 11:53 AM   #16
roger64
Wizard
roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.roger64 ought to be getting tired of karma fortunes by now.
 
Posts: 2,625
Karma: 3120635
Join Date: Jan 2009
Device: Kindle PW3 (wifi)
Quote:
Originally Posted by Tex2002ans View Post

Hyphenation: I prefer just typing "-" into the search box in Sigil/Calibre's Spellcheck list to get a list of every single word with a hyphen in it, and then go through the hyphenated words to see if I can spot any blatant errors. At least one pass with "Show misspelled words" on and one off.

.../...
In the Calibre Editor help, you can import a very nice "regex-function" tool which gets rid of all unwanted/incorrect hyphens by checking the validity of each of them with the spelling dictionary. It's pretty efficient.

You'll find it in this chapter: http://manual.calibre-ebook.com/fr/function_mode.html

Of course for discerning Eins-tein and Einstein and other words which are not included in the spelling dictionary, you'll need the main spelling tool. The table presentation makes them easy to spot.

It's very easy to test it: just open an EPUB, introduce some faulty hyphens here and there and ask it to clean the mess... At least, once its work is done, it's fairly easy to take care of the few remaining "inconsistencies" abovementioned since the vast majority of the mistakes have been eradicated.

Last edited by roger64; 07-18-2015 at 12:24 PM.
roger64 is offline   Reply With Quote
Old 07-18-2015, 06:30 PM   #17
BetterRed
null operator (he/him)
BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.
 
Posts: 21,731
Karma: 29711016
Join Date: Mar 2012
Location: Sydney Australia
Device: none
Quote:
Originally Posted by Tex2002ans View Post
Hyphenation: I prefer just typing "-" into the search box in Sigil/Calibre's Spellcheck list to get a list of every single word with a hyphen in it, and then go through the hyphenated words to see if I can spot any blatant errors. At least one pass with "Show misspelled words" on and one off.
- I sometimes go a step further. The Words report in calibre's editor (Tools->Reports->Words) has an option to export the list to a csv file.

I load the csv into nirsofts CSVFileView which has menu options for things like font, size, use of half-shadow 'paper' etc. I find I'm able to spot more anomalies by eye-balling the same list in different fonts, weights, etc on 'shadow paper', than I can using the light-weight small font I use in the epub editors.

Another use of Toxaris's EPUB Tools Search and Replace is to create a rule set that finds words which are not misspelled, but are nevertheless the wrong word - e.g. 'manger/manager', 'toll/tool'. Aside : the ability to have different rule sets in Epub Tools S&R is a boon

You can exclude words from MS Word's Standard Dictionaries, see ==>> Create an Exclusion Dictionary. So if you object to 'soiree' you can exclude that spelling, then when you type it you'll get 'soirée'

BR

Last edited by BetterRed; 07-18-2015 at 06:36 PM.
BetterRed is offline   Reply With Quote
Advert
Old 07-19-2015, 06:45 AM   #18
crutledge
eBook FANatic
crutledge ought to be getting tired of karma fortunes by now.crutledge ought to be getting tired of karma fortunes by now.crutledge ought to be getting tired of karma fortunes by now.crutledge ought to be getting tired of karma fortunes by now.crutledge ought to be getting tired of karma fortunes by now.crutledge ought to be getting tired of karma fortunes by now.crutledge ought to be getting tired of karma fortunes by now.crutledge ought to be getting tired of karma fortunes by now.crutledge ought to be getting tired of karma fortunes by now.crutledge ought to be getting tired of karma fortunes by now.crutledge ought to be getting tired of karma fortunes by now.
 
crutledge's Avatar
 
Posts: 18,301
Karma: 16078357
Join Date: Apr 2008
Location: Alabama, USA
Device: HP ipac RX5915 Wife's Kindle
ePub formating

I have tried to use the save to ePub.

There are two things that don't seem to work:

1: All italics are lost.
2. Indentation of poetry is lost.

A CSS file with a list of fonts is attached to the ePub which does nothing.

Can I set some switch to maintain the formating?

I think I need a "FineReader for Dummys."
crutledge is offline   Reply With Quote
Old 09-18-2015, 03:58 PM   #19
crutledge
eBook FANatic
crutledge ought to be getting tired of karma fortunes by now.crutledge ought to be getting tired of karma fortunes by now.crutledge ought to be getting tired of karma fortunes by now.crutledge ought to be getting tired of karma fortunes by now.crutledge ought to be getting tired of karma fortunes by now.crutledge ought to be getting tired of karma fortunes by now.crutledge ought to be getting tired of karma fortunes by now.crutledge ought to be getting tired of karma fortunes by now.crutledge ought to be getting tired of karma fortunes by now.crutledge ought to be getting tired of karma fortunes by now.crutledge ought to be getting tired of karma fortunes by now.
 
crutledge's Avatar
 
Posts: 18,301
Karma: 16078357
Join Date: Apr 2008
Location: Alabama, USA
Device: HP ipac RX5915 Wife's Kindle
FineReader is great

FineReader beats anything else I have used. 300 page books can be done in a reasonable time. I have done a number of PDF to HTML with pretty much success.

The only thing is that each pdf shows differently .....

Is there an optimal (good???) set of options and other settings? For instance Font Style and Symbol Table are a real PITA.

Maybe someone who is an expert could publish a FineReader to HTML/ePub
for Dummys!
crutledge is offline   Reply With Quote
Old 09-22-2015, 01:17 AM   #20
grumbles
Addict
grumbles ought to be getting tired of karma fortunes by now.grumbles ought to be getting tired of karma fortunes by now.grumbles ought to be getting tired of karma fortunes by now.grumbles ought to be getting tired of karma fortunes by now.grumbles ought to be getting tired of karma fortunes by now.grumbles ought to be getting tired of karma fortunes by now.grumbles ought to be getting tired of karma fortunes by now.grumbles ought to be getting tired of karma fortunes by now.grumbles ought to be getting tired of karma fortunes by now.grumbles ought to be getting tired of karma fortunes by now.grumbles ought to be getting tired of karma fortunes by now.
 
grumbles's Avatar
 
Posts: 238
Karma: 1500000
Join Date: Nov 2009
Location: Toronto
Device: Pandigital Novel (Black), T-2 and 3, Nexus 7
ABBYY Finereader Sprint comes with Epson scanners. I have both versions 6.0 and 9.0. For the most part they work very well. I also have version 4 of the pro version. It was on the cover disc of PC Plus many years ago. It still works well but I find the Sprint versions more convenient.

I convert the PDF to images, PFill works well in Windows and is free. Pdftopng works in Linux and Windows but is a command line program. I use Scantailor to clean up the images and then OCR with Abbyy. From OCR to rtf into LibreOffice for more cleanup. From there to htm and then I do all my real work with Notepad++. Abbyy and Notepad++ are the ONLY reason that I still use Windows. XP continues to work just fine for all that I need it for.

This a cheap and very effective way to work. The only expense was the scanner which I had to buy anyway.

I was going to buy the pro version Abby a while back but the went for an activation based system so I decided to continue with Sprint. With the exception of Windows, I will not buy any product that requires activation.
grumbles is offline   Reply With Quote
Advert
Reply


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Convert epub to pdf, with notes with main text in the pdf? 8140david ePub 1 06-18-2015 01:13 PM
Convert epub to pdf, with notes with main text in the pdf? 8140david Conversion 1 06-18-2015 11:02 AM
ePub->pdf:Please help to overcome long standing Kindle pdf bug EbokJunkie Conversion 4 01-25-2015 12:44 PM


All times are GMT -4. The time now is 02:05 PM.


MobileRead.com is a privately owned, operated and funded community.