Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book Software > Calibre

Notices

Reply
 
Thread Tools Search this Thread
Old 08-10-2010, 06:21 PM   #31
Pushka
Wizard
Pushka ought to be getting tired of karma fortunes by now.Pushka ought to be getting tired of karma fortunes by now.Pushka ought to be getting tired of karma fortunes by now.Pushka ought to be getting tired of karma fortunes by now.Pushka ought to be getting tired of karma fortunes by now.Pushka ought to be getting tired of karma fortunes by now.Pushka ought to be getting tired of karma fortunes by now.Pushka ought to be getting tired of karma fortunes by now.Pushka ought to be getting tired of karma fortunes by now.Pushka ought to be getting tired of karma fortunes by now.Pushka ought to be getting tired of karma fortunes by now.
 
Pushka's Avatar
 
Posts: 1,119
Karma: 1019140
Join Date: Oct 2009
Location: Australia
Device: kindle, Ipad, Iphone, Nexus and PPW
Quote:
Originally Posted by vastav View Post
Thanks for your kind words. If you were using the ePub tool, one thing I can think of is that the bounding rect wasn't big enough to fully encompass the page number, for the plugin to exclude it from conversion. I'd be happy to look at your PDF and provide you exact suggestion. You can email that to support at pdf2epub dot com or upload on this thread, if it is public domain.
You are welcome. I hope lots of people get on to it. One thing that will limit the usage is the high cost of Acrobat though.

Do I need to do this boundary marking for every page number that appears? I only did it for the first one.

I do have PDF edit, but I have to edit every page.
Pushka is offline   Reply With Quote
Old 08-10-2010, 06:34 PM   #32
hidari
MR Drone
hidari ought to be getting tired of karma fortunes by now.hidari ought to be getting tired of karma fortunes by now.hidari ought to be getting tired of karma fortunes by now.hidari ought to be getting tired of karma fortunes by now.hidari ought to be getting tired of karma fortunes by now.hidari ought to be getting tired of karma fortunes by now.hidari ought to be getting tired of karma fortunes by now.hidari ought to be getting tired of karma fortunes by now.hidari ought to be getting tired of karma fortunes by now.hidari ought to be getting tired of karma fortunes by now.hidari ought to be getting tired of karma fortunes by now.
 
hidari's Avatar
 
Posts: 1,613
Karma: 15612282
Join Date: Oct 2007
Location: DRONEZONE
Device: PB360+, Huawei MP5, Libra H20
Quote:
Originally Posted by Pushka View Post
You are welcome. I hope lots of people get on to it. One thing that will limit the usage is the high cost of Acrobat though.

Do I need to do this boundary marking for every page number that appears? I only did it for the first one.

I do have PDF edit, but I have to edit every page.

ALSO, consider PDF transformer: http://pdftransformer.abbyy.com/


It is a standalone program from ABBYY....I use it often. It will OCR and transfer files from PDF-> Searchable PDF, RTF, HTML or TXT. Then you can use Calibre to transfer it to epub.

It cost about 100 USD but is much cheaper than Adobe if you are only looking for a PDF converter....
hidari is offline   Reply With Quote
Advert
Old 08-10-2010, 08:13 PM   #33
vastav
Member
vastav began at the beginning.
 
Posts: 18
Karma: 38
Join Date: Sep 2009
Location: San Francisco Bay Area
Device: none
Quote:
Originally Posted by Pushka View Post
You are welcome. I hope lots of people get on to it. One thing that will limit the usage is the high cost of Acrobat though.

Do I need to do this boundary marking for every page number that appears? I only did it for the first one.

I do have PDF edit, but I have to edit every page.
If you selected the 'set area as background on All Pages' option using ePub tool, you do not need to edit every page. However, since the first few pages are usually different than the rest of the pages, I would recommend repeating this for some content pages beyond cover in your document.

For people who do not have Acrobat, I do have an online conversion option on my site. However, this option misses the artifacts elimination step.
vastav is offline   Reply With Quote
Old 08-10-2010, 10:19 PM   #34
Pushka
Wizard
Pushka ought to be getting tired of karma fortunes by now.Pushka ought to be getting tired of karma fortunes by now.Pushka ought to be getting tired of karma fortunes by now.Pushka ought to be getting tired of karma fortunes by now.Pushka ought to be getting tired of karma fortunes by now.Pushka ought to be getting tired of karma fortunes by now.Pushka ought to be getting tired of karma fortunes by now.Pushka ought to be getting tired of karma fortunes by now.Pushka ought to be getting tired of karma fortunes by now.Pushka ought to be getting tired of karma fortunes by now.Pushka ought to be getting tired of karma fortunes by now.
 
Pushka's Avatar
 
Posts: 1,119
Karma: 1019140
Join Date: Oct 2009
Location: Australia
Device: kindle, Ipad, Iphone, Nexus and PPW
Quote:
Originally Posted by vastav View Post
If you selected the 'set area as background on All Pages' option using ePub tool, you do not need to edit every page. However, since the first few pages are usually different than the rest of the pages,
Cant seem to find the option to do that? The thing I am trying to remove appears at the top of every page, so once I can do a generic delete, that will work for me!


Ah, I found it. I was using the Advanced Text tool but I see I can do this straight after the tagging process, which gives me a different type of edit tool, and a right click and was ableto set background to all pages! LEGEND!

Dammit, cant give you any more Karma just yet vastav til I spread the lurv!

Last edited by Pushka; 08-10-2010 at 10:40 PM.
Pushka is offline   Reply With Quote
Old 08-11-2010, 01:10 AM   #35
vastav
Member
vastav began at the beginning.
 
Posts: 18
Karma: 38
Join Date: Sep 2009
Location: San Francisco Bay Area
Device: none
Quote:
Originally Posted by Pushka View Post
Cant seem to find the option to do that? The thing I am trying to remove appears at the top of every page, so once I can do a generic delete, that will work for me!


Ah, I found it. I was using the Advanced Text tool but I see I can do this straight after the tagging process, which gives me a different type of edit tool, and a right click and was ableto set background to all pages! LEGEND!

Dammit, cant give you any more Karma just yet vastav til I spread the lurv!
Glad that you found the option. Thanks again for your encouragement. Here is some documentation that may be helpful in general - http://www.pdf2epub.com/help#conversion

Last edited by vastav; 08-11-2010 at 01:18 AM.
vastav is offline   Reply With Quote
Advert
Old 08-11-2010, 05:32 AM   #36
Pushka
Wizard
Pushka ought to be getting tired of karma fortunes by now.Pushka ought to be getting tired of karma fortunes by now.Pushka ought to be getting tired of karma fortunes by now.Pushka ought to be getting tired of karma fortunes by now.Pushka ought to be getting tired of karma fortunes by now.Pushka ought to be getting tired of karma fortunes by now.Pushka ought to be getting tired of karma fortunes by now.Pushka ought to be getting tired of karma fortunes by now.Pushka ought to be getting tired of karma fortunes by now.Pushka ought to be getting tired of karma fortunes by now.Pushka ought to be getting tired of karma fortunes by now.
 
Pushka's Avatar
 
Posts: 1,119
Karma: 1019140
Join Date: Oct 2009
Location: Australia
Device: kindle, Ipad, Iphone, Nexus and PPW
Now it is perfect!
Pushka is offline   Reply With Quote
Old 08-11-2010, 04:21 PM   #37
vastav
Member
vastav began at the beginning.
 
Posts: 18
Karma: 38
Join Date: Sep 2009
Location: San Francisco Bay Area
Device: none
Quote:
Originally Posted by PatNY View Post
vastav, I originally did use that website. I tried it again last evening and it still resulted in a file with the paragraphs all run together. OK, so then I downloaded and installed the plugin into Acrobat 9 and the issue with the run-in paragraphs is still there.

The paragraphs in this pdf may not be formatted in a standard way, but when I do an intermediate conversion to RTF or HTML in Acrobat, they are all picked up correctly!

You can see for yourself as I am going to send you the ebook file by email so you can investigate what is going on with your methods.

--Pat
The PDF file had tagging issues - entire chapter was marked as a single paragraph which resulted into the problem described above. Simply deleting the tags from the PDF and letting the ePub converter regenerate tags fixed the issue in ePub conversion.
vastav is offline   Reply With Quote
Old 08-12-2010, 11:35 AM   #38
Dark123
Zealot
Dark123 doesn't litterDark123 doesn't litter
 
Posts: 112
Karma: 105
Join Date: Jan 2010
Device: Kindle 3 WiFi
The best way I have found is to make sure the ePub output file size is over 10mb, and then convert it from PDF to ePub, and then use the CSS and the 1 HTML file to edit wherever it needs editing (Make sure to remove header and footer). Use Notepad++ to edit out whatever is extra, like some header not being removed, or something should be capital etc.
You will need to learn how the stylesheet.css works and some CSS tags (It's not hard). Then use that HTML with the CSS and convert that to ePub again.
This is the best FREE way I know.
I think Adobe Acrobat can convert PDF to HTML really well, but you will need to pay for it.

Last edited by Dark123; 08-12-2010 at 11:38 AM.
Dark123 is offline   Reply With Quote
Old 08-12-2010, 02:03 PM   #39
PatNY
Zennist
PatNY ought to be getting tired of karma fortunes by now.PatNY ought to be getting tired of karma fortunes by now.PatNY ought to be getting tired of karma fortunes by now.PatNY ought to be getting tired of karma fortunes by now.PatNY ought to be getting tired of karma fortunes by now.PatNY ought to be getting tired of karma fortunes by now.PatNY ought to be getting tired of karma fortunes by now.PatNY ought to be getting tired of karma fortunes by now.PatNY ought to be getting tired of karma fortunes by now.PatNY ought to be getting tired of karma fortunes by now.PatNY ought to be getting tired of karma fortunes by now.
 
PatNY's Avatar
 
Posts: 1,022
Karma: 47809468
Join Date: Jul 2010
Device: iPod Touch, Sony PRS-350, Nook HD+ & HD
Quote:
Originally Posted by vastav View Post
The PDF file had tagging issues - entire chapter was marked as a single paragraph which resulted into the problem described above. Simply deleting the tags from the PDF and letting the ePub converter regenerate tags fixed the issue in ePub conversion.
Just want to reiterate that first deleting the tags in the PDF document fixed the issue entirely with the pdf2epub conversion. Acrobat can re-generate tags automatically.

A big thank you to vastav for his assistance in solving the issue. He is extremely helpful, courteous, informative, and responsive.

I have to say now that Acrobat + pdf2epub seems to me to be the ideal solution for converting PDFs to epubs at the moment. It’s the quickest, easiest, most flexible and accurate way to do it. First off, pdf2epub can read bookmarks in a PDF ebook to create an accurate and workable metadata TOC. Many PDFs already contain bookmarks but if they don’t, Acrobat lets you automatically create them with one-click from its tags. And if the PDF doesn’t have existing tags you can have Acrobat scan the document to automatically create them too. This doesn’t work for all pdf ebooks. I have one that is so garbled in terms of structure that the auto-generated tags and bookmarks are useless (garbage in, garbage out). But most pdf ebooks I have looked at are pretty clean and usable for auto-generating bookmarks (if they don’t already have them). Second, in the worst case scenario, where you have to manually create bookmarks, Acrobat’s bookmark system is pretty easy to use. I found it much easier and faster to mark sections/chapters this way than in Sigil.

The biggest problem for me with all the other conversion methods I tried was the inaccurate paragraphs breaks. The Acrobat + pdf2epub solution not only is best at producing accurate breaks, but is the easiest way to get a workable metadata TOC at the same time.

Unfortunately Acrobat does cost a lot, but as I mentioned, one might be able to get an older lower-cost copy off of eBay. Or maybe you know someone who has an older copy to spare. If you can’t get it, you can at least go to the pdf2epub website and use their online converter to run your PDFs through. This would at least give you the benefit of accurate paragraph conversions and a workable TOC (if bookmarks already exist). To see if your pdf is already bookmarked, you can open it up in the free Adobe Reader and go to View>Navigation Panels>Bookmarks. You won’t be able to edit bookmarks with the free reader, but it will at least show you if they exist.
PatNY is offline   Reply With Quote
Old 08-12-2010, 03:41 PM   #40
AgiZ
Junior Member
AgiZ began at the beginning.
 
Posts: 4
Karma: 10
Join Date: Aug 2010
Device: Nook
So in short, what did you guys find as the best way to convert PDFs into some ebook compatible format apart from the expencive Adobe Acrobat conversion?

Thx.
AgiZ is offline   Reply With Quote
Old 08-14-2010, 04:32 PM   #41
Agama
Guru
Agama ought to be getting tired of karma fortunes by now.Agama ought to be getting tired of karma fortunes by now.Agama ought to be getting tired of karma fortunes by now.Agama ought to be getting tired of karma fortunes by now.Agama ought to be getting tired of karma fortunes by now.Agama ought to be getting tired of karma fortunes by now.Agama ought to be getting tired of karma fortunes by now.Agama ought to be getting tired of karma fortunes by now.Agama ought to be getting tired of karma fortunes by now.Agama ought to be getting tired of karma fortunes by now.Agama ought to be getting tired of karma fortunes by now.
 
Agama's Avatar
 
Posts: 776
Karma: 2751519
Join Date: Jul 2010
Location: UK
Device: PW2, Nexus7
Quote:
Originally Posted by easyrider View Post
I found today a good way to convert PDF to ePUB
It's in 3 steps:

1. mobipocket creator: PDF -> html
2. mobipocket creator: html -> PRC
3. Calibre PRC -> ePub
Thank-you easyrider, this is very helpful. I hadn't come across Mobipocket Creator but it works really well. I take the html output from step 1 and manually edit it to match the conventions that I use for all e-books so that I can link in my standard style sheet. Then calibre works it magic and I have an excellent ePub version of the pdf in a much shorter time span than by using my previous methods.
Agama is offline   Reply With Quote
Old 08-15-2010, 08:42 PM   #42
SCBrazil
Junior Member
SCBrazil began at the beginning.
 
Posts: 6
Karma: 10
Join Date: Aug 2010
Device: none
Hi,
I'm new to the forum. I live in Brazil and have never even seen an ereader. They are very expensive over here ($600 for Cool-er, $900 for PRS 600) but I have decided to invest in one as I have many textbooks for my profession in PDF but cannot read them on my PC.
Using this method of conversion, will I be able to satisfactorily read my books after conversion on either of these units? I don't mind spending time re-formatting manually to correct wrong line breaks.

Last edited by SCBrazil; 08-16-2010 at 06:16 AM.
SCBrazil is offline   Reply With Quote
Old 08-16-2010, 02:40 AM   #43
Agama
Guru
Agama ought to be getting tired of karma fortunes by now.Agama ought to be getting tired of karma fortunes by now.Agama ought to be getting tired of karma fortunes by now.Agama ought to be getting tired of karma fortunes by now.Agama ought to be getting tired of karma fortunes by now.Agama ought to be getting tired of karma fortunes by now.Agama ought to be getting tired of karma fortunes by now.Agama ought to be getting tired of karma fortunes by now.Agama ought to be getting tired of karma fortunes by now.Agama ought to be getting tired of karma fortunes by now.Agama ought to be getting tired of karma fortunes by now.
 
Agama's Avatar
 
Posts: 776
Karma: 2751519
Join Date: Jul 2010
Location: UK
Device: PW2, Nexus7
My experience with PDF to ePub is that the result is very good for text based books, e.g. small format paperback novels, but not so good for PDFs that contain a lot of diagrams or a large page size with complex layouts, (a 6-inch screen cannot compete with an A4 textbook). Many readers have PDF capability built in so, before investing a significant sum of money, I would try to get a hands-on with one if you can and see if your text books display well.
Agama is offline   Reply With Quote
Old 08-16-2010, 06:20 AM   #44
SCBrazil
Junior Member
SCBrazil began at the beginning.
 
Posts: 6
Karma: 10
Join Date: Aug 2010
Device: none
Quote:
Originally Posted by Agama View Post
My experience with PDF to ePub is that the result is very good for text based books, e.g. small format paperback novels, but not so good for PDFs that contain a lot of diagrams or a large page size with complex layouts, (a 6-inch screen cannot compete with an A4 textbook). Many readers have PDF capability built in so, before investing a significant sum of money, I would try to get a hands-on with one if you can and see if your text books display well.
They are psychology textbooks and are mostly text only. There are some tables to show statistics but not many so I guess I should be okay.
Thanks very much for your help.
SCBrazil is offline   Reply With Quote
Old 08-16-2010, 01:18 PM   #45
AgiZ
Junior Member
AgiZ began at the beginning.
 
Posts: 4
Karma: 10
Join Date: Aug 2010
Device: Nook
Indeed the conversion described above worked very well! Thank you.
1. Mobipocket Creator - PDF to html
2. Calibre - html to epub

About this:
Quote:
Originally Posted by SCBrazil View Post
Hi,
...They are very expensive over here ($600 for Cool-er, $900 for PRS 600).
Why not buy some cheaper device like Kindle or something like that? That works just as well, no?
AgiZ is offline   Reply With Quote
Reply


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Convert a 2 column PDF into epub thorm42 Conversion 18 06-03-2014 05:37 AM
Using Calibre to convert pdf to epub varelov Calibre 2 10-15-2010 02:20 AM
Would it be better if I convert pdf into epub? fantasyvn Sony Reader 7 04-15-2010 07:43 AM


All times are GMT -4. The time now is 10:05 AM.


MobileRead.com is a privately owned, operated and funded community.