Register Guidelines E-Books Search Today's Posts Mark Forums Read

Go Back   MobileRead Forums > E-Book Software > Calibre > Recipes

Notices

Reply
 
Thread Tools Search this Thread
Old 11-16-2010, 04:31 AM   #1
feodor
Junior Member
feodor began at the beginning.
 
Posts: 4
Karma: 10
Join Date: Nov 2010
Location: Augsburg, Bavaria, Germany
Device: Kindle 3
German / Deutsch - Zeit Online broken

Hi,
does anyone happen to know why Zeit Online news download is broken?
Please see the file attached.

regards,
feodor
Attached Thumbnails
Click image for larger version

Name:	zeit_online_problem.JPG
Views:	142
Size:	21.7 KB
ID:	61279  
feodor is offline   Reply With Quote
Advert
Old 11-16-2010, 06:21 AM   #2
miwie
Connoisseur
miwie began at the beginning.
 
Posts: 76
Karma: 12
Join Date: Nov 2010
Device: Android, PB Pro 602
Try again. I managed to fetch "Zeit Online" just now w/o any problems.
miwie is offline   Reply With Quote
Old 11-16-2010, 06:45 AM   #3
feodor
Junior Member
feodor began at the beginning.
 
Posts: 4
Karma: 10
Join Date: Nov 2010
Location: Augsburg, Bavaria, Germany
Device: Kindle 3
I tried serveral times. doesn't work for me.
feodor is offline   Reply With Quote
Old 11-16-2010, 06:57 AM   #4
EeeGrill
Junior Member
EeeGrill began at the beginning.
 
Posts: 5
Karma: 10
Join Date: Nov 2010
Device: none
Its still broken.
In the log are many error messages like:
Quote:
Downloading
Fetching http://www.zeit.de/kultur/musik/2010...all&print=true
Failed to download article: Neu Delhi: Dutzende Menschen sterben in eingestürztem Haus from http://www.zeit.de/gesellschaft/zeit...all&print=true
Traceback (most recent call last):
File "site-packages/calibre/utils/threadpool.py", line 95, in run
File "site-packages/calibre/web/feeds/news.py", line 838, in fetch_article
File "site-packages/calibre/web/feeds/news.py", line 834, in _fetch_article
Exception: Konnte Artikel nicht abrufen. Mit -vv starten, um den Grund dafür zu sehen
EeeGrill is offline   Reply With Quote
Old 11-16-2010, 10:11 AM   #5
Starson17
Wizard
Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.
 
Posts: 4,004
Karma: 177841
Join Date: Dec 2009
Device: WinMo: IPAQ; Android: HTC HD2, Archos 7o; Java:Gravity T
Quote:
Originally Posted by feodor View Post
Hi,
does anyone happen to know why Zeit Online news download is broken?
Please see the file attached.

regards,
feodor
1) The thumbnail image ("file attached") doesn't seem to show an error, it looks like a normal progress bar that isn't done yet.
2) The recipe downloads and completes for me.
3) The recipe doesn't seem to have much content, but I don't know how much content that site normally has.
4) I see errors during the download, but that does not necessarily mean there's a problem. Malformed pages can generate errors during download, or pages that have unsuitable content, such as video or audio or links to pdf files can also do this. It depends on how the recipe is written. It is also possible that the errors are a problem.
Starson17 is offline   Reply With Quote
Advert
Old 11-16-2010, 11:53 AM   #6
EeeGrill
Junior Member
EeeGrill began at the beginning.
 
Posts: 5
Karma: 10
Join Date: Nov 2010
Device: none
Yes, the recipe downloads complete and result in a correct file,
but
in the file is only a table of contents with the names of the sections
and a short description of each section.
There are no articles at all.
The size of the file is now 90 KB and it was several MB.
EeeGrill is offline   Reply With Quote
Old 11-16-2010, 01:30 PM   #7
Starson17
Wizard
Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.
 
Posts: 4,004
Karma: 177841
Join Date: Dec 2009
Device: WinMo: IPAQ; Android: HTC HD2, Archos 7o; Java:Gravity T
Quote:
Originally Posted by EeeGrill View Post
Yes, the recipe downloads complete and result in a correct file,
but
in the file is only a table of contents with the names of the sections
and a short description of each section.
There are no articles at all.
The size of the file is now 90 KB and it was several MB.
It sounds like the site has changed its format.
Starson17 is offline   Reply With Quote
Old 11-18-2010, 04:56 PM   #8
Artemis_A
Train reader
Artemis_A began at the beginning.
 
Posts: 10
Karma: 15
Join Date: Nov 2010
Device: Kindle3
I just tried to download the ZEIT again. Still with the same bad result as described by EeeGrill.
I checked the links in the recipe. They are all ok and obviously haven't changed on the ZEIT homepage. So that's not the cause. I also checked the div tags. They seem to be ok also. Anybody with more ideas??
Artemis_A is offline   Reply With Quote
Old 11-18-2010, 06:55 PM   #9
-Thomas-
Addict
-Thomas- once ate a cherry pie in a record 7 seconds.-Thomas- once ate a cherry pie in a record 7 seconds.-Thomas- once ate a cherry pie in a record 7 seconds.-Thomas- once ate a cherry pie in a record 7 seconds.-Thomas- once ate a cherry pie in a record 7 seconds.-Thomas- once ate a cherry pie in a record 7 seconds.-Thomas- once ate a cherry pie in a record 7 seconds.-Thomas- once ate a cherry pie in a record 7 seconds.-Thomas- once ate a cherry pie in a record 7 seconds.-Thomas- once ate a cherry pie in a record 7 seconds.-Thomas- once ate a cherry pie in a record 7 seconds.
 
-Thomas-'s Avatar
 
Posts: 325
Karma: 1725
Join Date: Dec 2007
Location: Münster, Germany
Device: iRex iLiad v2
I had success by applying the following changes to the recipe:

Code:
--- /usr/share/calibre/recipes/zeitde.recipe    2010-11-12 21:33:30.000000000 +0100
+++ /tmp/zeitde.recipe  2010-11-19 00:58:10.000000000 +0100
@@ -11,7 +11,8 @@
 
     title = 'Zeit Online'
     description = 'Zeit Online'
-    language = 'de'
+    lang = 'de'
+    encoding = 'UTF-8'
 
     __author__ = 'Martin Pitt, Sujata Raman, Ingo Paschke and Marc Toensing'
The encoding is kind of hard-coded, but it works for me.

Last edited by -Thomas-; 11-18-2010 at 06:59 PM.
-Thomas- is offline   Reply With Quote
Old 11-18-2010, 07:52 PM   #10
Rod Laird
Junior Member
Rod Laird began at the beginning.
 
Posts: 7
Karma: 10
Join Date: Nov 2010
Device: Kindle
Die Zeit fix

Vielen Dank Thomas!

m f G aus Australien

Rod
Rod Laird is offline   Reply With Quote
Old 11-19-2010, 04:38 AM   #11
feodor
Junior Member
feodor began at the beginning.
 
Posts: 4
Karma: 10
Join Date: Nov 2010
Location: Augsburg, Bavaria, Germany
Device: Kindle 3
First of all I want so say that I'm really surprised how supportive this community is. Thank you!

Your proposal didn't work for me, Thomas. I located the zeit recipe on my disk.
It's "C:\Program Files (x86)\Calibre2\resources\recipes\zeitde.recipe" for me.
I removed the line you marked with "-" and added the lines you marked with "+".
Was that correct?

I saved and tried again -> same failure.

Regards,
Andreas

PS: Is it possible that Zeit implemented some kind of "mass-query-prevention"? To keep us leechers away :-)

Last edited by feodor; 11-19-2010 at 04:42 AM.
feodor is offline   Reply With Quote
Old 11-19-2010, 05:01 AM   #12
miwie
Connoisseur
miwie began at the beginning.
 
Posts: 76
Karma: 12
Join Date: Nov 2010
Device: Android, PB Pro 602
I just successfully generated an epub using "zeitde.recipe".

Michael
miwie is offline   Reply With Quote
Old 11-19-2010, 07:47 AM   #13
EeeGrill
Junior Member
EeeGrill began at the beginning.
 
Posts: 5
Karma: 10
Join Date: Nov 2010
Device: none
I applied the changes but the result is the same.
I still get errors like:
Quote:
Formel 1: Sebastian Vettel ist Weltmeister from Sport
http://www.zeit.de/sport/2010-11/vet...all&print=true
Traceback (most recent call last):
File "site-packages/calibre/utils/threadpool.py", line 95, in run
File "site-packages/calibre/web/feeds/news.py", line 838, in fetch_article
File "site-packages/calibre/web/feeds/news.py", line 834, in _fetch_article
Exception: Konnte Artikel nicht abrufen. Mit -vv starten, um den Grund dafür zu sehen

Parsing all content...
Parsing feed_1/index.html ...
Initial parse failed:
Traceback (most recent call last):
File "site-packages/calibre/ebooks/oeb/base.py", line 818, in first_pass
File "lxml.etree.pyx", line 2532, in lxml.etree.fromstring (src/lxml/lxml.etree.c:48270)
File "parser.pxi", line 1545, in lxml.etree._parseMemoryDocument (src/lxml/lxml.etree.c:71812)
File "parser.pxi", line 1417, in lxml.etree._parseDoc (src/lxml/lxml.etree.c:70608)
File "parser.pxi", line 898, in lxml.etree._BaseParser._parseUnicodeDoc (src/lxml/lxml.etree.c:67148)
File "parser.pxi", line 539, in lxml.etree._ParserContext._handleParseResultDoc (src/lxml/lxml.etree.c:63824)
File "parser.pxi", line 625, in lxml.etree._handleParseResult (src/lxml/lxml.etree.c:64745)
File "parser.pxi", line 565, in lxml.etree._raiseParseError (src/lxml/lxml.etree.c:64088)
XMLSyntaxError: Opening and ending tag mismatch: hr line 29 and div, line 30, column 7
EeeGrill is offline   Reply With Quote
Old 11-19-2010, 08:04 AM   #14
EeeGrill
Junior Member
EeeGrill began at the beginning.
 
Posts: 5
Karma: 10
Join Date: Nov 2010
Device: none
Hello miwie,

did you really get a complete epub?
How big is your epub?
My epub is approx. 100 KB.
This is only the table of contents.
EeeGrill is offline   Reply With Quote
Old 11-19-2010, 08:12 AM   #15
miwie
Connoisseur
miwie began at the beginning.
 
Posts: 76
Karma: 12
Join Date: Nov 2010
Device: Android, PB Pro 602
The resulting "zeit.epub" is approx. 5.7 MB in size. I did not check every article in it, but it looks like it is complete.

Michael
PS: I'm working with calibre 0.7.28 (WinXP)
miwie is offline   Reply With Quote
Reply

Thread Tools Search this Thread
Search this Thread:

Advanced Search

Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
"DIE ZEIT" im Online-Abo auch als ePub ewy Deutsches Forum 142 12-21-2011 07:41 AM
Deutsch-Deutsch (Bedeutungs-)wörterbuch bunique PocketBook 16 08-17-2010 05:06 PM
Biography Bronner, Franz Xaver: Ein Mönchsleben aus der empfindsamen Zeit German V1. 14-MAR-10 weatherwax ePub Books 0 03-14-2010 06:47 AM
Other Fiction Horváth, Ödön: Ein Kind unserer Zeit. v1 24 may 2009 german stahanovez Kindle Books 1 02-12-2010 07:05 AM
Other Fiction Horváth, Ödön: Ein Kind unserer Zeit. v1.1 24 may 2009 german stahanovez ePub Books 0 05-23-2009 07:33 PM


All times are GMT -4. The time now is 07:57 AM.


MobileRead.com is a privately owned, operated and funded community.