|09-11-2009, 06:32 AM||#1|
Join Date: Dec 2008
Device: Sony Reader PRS-505
I prefer to format my own ebooks using OpenOffice. Is there a quick and easy way of extracting the text [either as plain text, but preferably as RTF] from ebooks?
|09-11-2009, 06:41 AM||#2|
Join Date: Jul 2008
"Ebooks" is a rather broad term encompassing a metric crapload of formats. Calibre can convert many formats to and from many formats. Not all, and not all will look as dashing.
|09-11-2009, 07:14 AM||#3|
Join Date: Jan 2008
Device: Kindle 3|iPad 2|iPhone 4|Sony 600
At the moment I prefer to read PDFs on my iREx digital reader, so I have some of the same issues as you do.
How to extract the text, depends on what type your source files are. Firstly, you will need to remove DRM. AFAIK this is not possible with .LRF (BBeb) files, but it is with many others, such as epub, prc and lit.
Then with a DRM free file, you can do a number of things. What I've found to be easiest, was to open the file in Stanza (reader application), copy all, and paste to OpenOffice. There are other ways to get at the text, but I've found that most often the source is a collection of html files, and using Stanza you get all in one go. I haven't tried with DRM'd files but I doubt it will work.
calibre can also convert to a number of formats, but not as many as Stanza. As far as I remember you can also convert to RTF directly in calibre, but the quality was not usable for me - perhaps it is for you.
|Thread Tools||Search this Thread|
|Thread||Thread Starter||Forum||Replies||Last Post|
|Extracting a cover image from lit file||p3aul||Calibre||6||07-25-2010 05:33 PM|
|Extracting firmware bin file||adreamer||Ectaco jetBook||1||01-02-2010 02:38 PM|
|Tool for extracting pdf bookmarks||geraschenko||iRex||1||10-24-2009 04:42 PM|
|Extracting pdb files from Palm Installer||bpwhistler||Alternative Devices||0||11-15-2008 04:07 PM|
|Extracting text with formatting from PDF||nekokami||22||03-05-2007 10:18 AM|