![]() |
#1 |
Junior Member
![]() Posts: 9
Karma: 10
Join Date: Jun 2011
Location: Germany
Device: PRS-505
|
![]()
Hello,
I got some longer books as .txt-files. These are quite large (1.5MB or more). Calibre can convert them to my preferred format (epub or lrf) but stops at a point. The remaining text will be discarded. Only a partial text is converted. Is this a limitation of the file format? I think not since I have bought some files originally in lrf that are even larger. Is there a point in the Calibre-Options I overlooked - or something else? I would really appreciate your help, since I would like to read the whole books. Hammerwell PRS-505 Last edited by Hammerwell; 06-13-2011 at 09:41 AM. Reason: solved |
![]() |
![]() |
![]() |
#2 | ||
Sigil & calibre developer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 2,487
Karma: 1063785
Join Date: Jan 2009
Location: Florida, USA
Device: Nook STR
|
Quote:
Quote:
Please open a ticket at https://bugs.launchpad.net/calibre . Attach the file you are having trouble converting. Also, do a conversion and attach the conversion log. Bottom right of the window, where it says jobs, click it. select the job, click details. This way I can look into what's happening, it's easier to track issues using the bug tracker than here, and others can search for the issue if they run into it. |
||
![]() |
![]() |
Advert | |
|
![]() |
#3 | |
Well trained by Cats
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 30,939
Karma: 60358908
Join Date: Aug 2009
Location: The Central Coast of California
Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A
|
Quote:
Can you see the parts of the book using the Notepad Editor? |
|
![]() |
![]() |
![]() |
#4 |
Junior Member
![]() Posts: 9
Karma: 10
Join Date: Jun 2011
Location: Germany
Device: PRS-505
|
Thank you all for your reply.
Theducks had an interesting suggestion, so I played a little with the files. I used Open Office to convert one text file to .odt and tried to convert this with calibre to .epub. This went without a problem. In a next step I saved the .odt as .txt using OOo. The conversion of this .txt also failed to complete - but at a different point. The log said there is a error: "XMLSyntaxError: error parsing attribute name, line 2310, column 12" in the original .txt and an other error in the .txt converted over .odt: "XMLSyntaxError: Opening and ending tag mismatch: p line 1154 and li, line 1156, column 9" I would like to open an ticket, but unfortunately I an not free to publicize the files. Would it be enough if I copied the sentence around the unexpected end in a new text file and send these two? I believe these would not rise unwanted problems. I would also send the logs of the two convertions from .txt and the one from .odt too. Hammerwell |
![]() |
![]() |
![]() |
#5 | |
Well trained by Cats
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 30,939
Karma: 60358908
Join Date: Aug 2009
Location: The Central Coast of California
Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A
|
Quote:
Does the file contain something that 'looks like' a tag but really isn't ![]() |
|
![]() |
![]() |
Advert | |
|
![]() |
#6 | ||
Junior Member
![]() Posts: 9
Karma: 10
Join Date: Jun 2011
Location: Germany
Device: PRS-505
|
Quote:
Quote:
One ends as following in the epub: ###########snip####### on her thoughts... "... Manager?" "" "...="" pet="" then="">" ###########snip####### The original reads as follows: ###########snip####### played on her thoughts... "<... Manager?>" "<Of course. She is her manager.>" "<She isn't.... isn't a ... pet then?>" "<Um... no. She operates the colosseum. ###########snip####### Does this help? Hammerwell |
||
![]() |
![]() |
![]() |
#7 | |
Well trained by Cats
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 30,939
Karma: 60358908
Join Date: Aug 2009
Location: The Central Coast of California
Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A
|
Quote:
Opening start symbol tag < Closing start symbol tag > Confusion |
|
![]() |
![]() |
![]() |
#8 | ||
Sigil & calibre developer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 2,487
Karma: 1063785
Join Date: Jan 2009
Location: Florida, USA
Device: Nook STR
|
Quote:
Quote:
|
||
![]() |
![]() |
![]() |
#9 | |
Junior Member
![]() Posts: 9
Karma: 10
Join Date: Jun 2011
Location: Germany
Device: PRS-505
|
Quote:
![]() The automatics in the heuristics may be a problem since they have to be way more sophisticated. Could maybe the change from auto to plain in the formating txt input options solve this little problem automatically? Edit: It seems not. I can change the txt-formating options as I like, the conversion log always says "Auto detected formatting as markdown". Hm, this was apparently not the sole problem. In the log from the odt->txt-->epub conversion it says textile. ![]() Hammerwell Last edited by Hammerwell; 06-05-2011 at 05:20 AM. Reason: tried a little converting |
|
![]() |
![]() |
![]() |
#10 | |
Sigil & calibre developer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 2,487
Karma: 1063785
Join Date: Jan 2009
Location: Florida, USA
Device: Nook STR
|
Quote:
Markdown and Textile formatting both allow for embedded html. When it detects the document as either of those it keeps the < and > as is so later they throw off the HTML parser. Plain and Heuristic formatting change < and > into entities so they are not interpreted as tags later. |
|
![]() |
![]() |
![]() |
#11 | ||
Junior Member
![]() Posts: 9
Karma: 10
Join Date: Jun 2011
Location: Germany
Device: PRS-505
|
Sorry for the late answer, got other things to do.
Quote:
I will try this at the next occurrence. Since I now know about the solution, thanks to your help, this should be no problem. Quote:
Thank you both for this help. This problem is solved and the thread can be closed. [edit] Hm, no "SOLVED"-Button anywhere. Did I overlook it?[/edit] Last edited by Hammerwell; 06-13-2011 at 07:24 AM. |
||
![]() |
![]() |
![]() |
#12 |
Sigil & calibre developer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 2,487
Karma: 1063785
Join Date: Jan 2009
Location: Florida, USA
Device: Nook STR
|
When you edit the post you should be able to edit the title. Put solved in the title.
|
![]() |
![]() |
![]() |
#13 | ||
Well trained by Cats
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 30,939
Karma: 60358908
Join Date: Aug 2009
Location: The Central Coast of California
Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A
|
Quote:
Changes made on the conversion screen, fine tune the overall preferences to what works for the specific book. Really $%^& the book conversion setting?: There is the button on the bottom that clears the Saved mess and restores it to the current preference setting. Quote:
|
||
![]() |
![]() |
![]() |
#14 | |
Junior Member
![]() Posts: 9
Karma: 10
Join Date: Jun 2011
Location: Germany
Device: PRS-505
|
Quote:
![]() Have a nice Pentecost. The WGT* was nice. * http://www.wave-gotik-treffen.de/english/programm.php |
|
![]() |
![]() |
![]() |
Tags |
angled brackets, large file, txt conversion |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
Large file convert | ejacevich | Calibre | 2 | 09-29-2010 08:51 PM |
all file conversions now failing | moransami | Calibre | 2 | 08-07-2010 06:23 PM |
How can i convert HTML or txt file to EPUB file ? | guguqiaqia | ePub | 7 | 05-28-2010 09:15 PM |
LARGE pdf file | taildragger-j3 | Sony Reader | 3 | 03-12-2010 08:48 AM |
No line breaks in TXT conversions - is it just me? | TMF | Calibre | 3 | 09-24-2009 02:46 PM |