View Full Version : Problem with Catalog in Calibre


clockmaker
10-05-2010, 04:31 AM
I have tried about 5 times (with or without tags) to make a catalog of my rather large library; but without success. After about 2 hours of running I get the following error (only the last dozen lines given here):

Parsing file 'content/book_21442.html' as HTML
Forcing content/book_21442.html into XHTML namespace
Initial parse failed:
Parsing file 'content/book_21482.html' as HTML
Forcing content/book_21482.html into XHTML namespace
Initial parse failed:
Parsing file 'content/book_33595.html' as HTML
Forcing content/book_33595.html into XHTML namespace
Merging user specified metadata...
Detecting structure...
Flattening CSS and remapping font sizes...

Exception <unknown>MemoryError: in <generator object gen at 0x75E2DF18> ignored
Exception <unknown>MemoryError: in <generator object __iter__ at 0x4E2F42B0> ignored
Python function terminated unexpectedly
(Error Code: 1)
sys.excepthook is missing
Traceback (most recent call last):
File "site-packages\calibre\ebooks\oeb\transforms\flatcss.py", line 122, in __call__
File "site-packages\calibre\ebooks\oeb\stylizer.py", line 191, in __init__
<unknown>MemoryError

This is without the tags and with the tags it is a similar error

Initial parse failed:
Parsing file 'content/book_21442.html' as HTML
Forcing content/book_21442.html into XHTML namespace
Initial parse failed:
Parsing file 'content/book_3311.html' as HTML
Forcing content/book_3311.html into XHTML namespace
Initial parse failed:
Parsing file 'content/book_23694.html' as HTML
Forcing content/book_23694.html into XHTML namespace
Initial parse failed:
Parsing file 'content/book_3727.html' as HTML
Forcing content/book_3727.html into XHTML namespace
Initial parse failed:
Parsing file 'content/Genre_sciencefiction.html' as HTML
Forcing content/Genre_sciencefiction.html into XHTML namespace
Merging user specified metadata...
Detecting structure...
Flattening CSS and remapping font sizes...
Python function terminated unexpectedly
(Error Code: 1)
Traceback (most recent call last):
File "site.py", line 103, in main
File "site.py", line 85, in run_entry_point
File "site-packages\calibre\utils\ipc\worker.py", line 107, in main
File "site-packages\calibre\gui2\convert\gui_conversion.py", line 59, in gui_catalog
File "site-packages\calibre\library\catalog.py", line 4683, in run
File "site-packages\calibre\ebooks\conversion\plumber.py", line 915, in run
File "site-packages\calibre\ebooks\oeb\transforms\flatcss.py", line 122, in __call__
File "site-packages\calibre\ebooks\oeb\transforms\flatcss.py", line 147, in stylize_spine
File "site-packages\calibre\ebooks\oeb\stylizer.py", line 203, in __init__
File "site-packages\calibre\ebooks\oeb\stylizer.py", line 103, in __init__
File "site-packages\lxml\cssselect.py", line 523, in css_to_xpath
File "site-packages\lxml\cssselect.py", line 645, in parse
File "site-packages\lxml\cssselect.py", line 961, in __init__
MemoryError
Error in sys.exitfunc:
MemoryError


The database file is about 14 MB. Is there a limit on the size of the library? It appears that Catalog data is all used within the Ram. I am running it on 4 core (Intel 9300) machine with 4 GB of Ram. OR am I doing something wrong.

Thanks for any help.

DoctorOhh
10-05-2010, 06:27 AM
I have tried about 5 times (with or without tags) to make a catalog of my rather large library; but without success.

What version of calibre are you using?

How big is a "rather large library" 600? 6000? 16000?

The database file is about 14 MB. Is there a limit on the size of the library? It appears that Catalog data is all used within the Ram. I am running it on 4 core (Intel 9300) machine with 4 GB of Ram. OR am I doing something wrong.

I just created a epub catalog using calibre 0.7.22, Windows XP sp3, 3.5 Gigs of memory with a Core 2 Duo processor.

My library has 4011 books, 959 Authors, 469 Series and 740 tags. It took about 30 minutes.

You might want to check out the first post (http://www.mobileread.com/forums/showthread.php?t=71143) of the Creating Catalogs sticky thread for information about catalogs and how to proceed from here.