|
|
#1 |
|
Zealot
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 111
Karma: 39846
Join Date: Aug 2022
Device: PC
|
Release: 9.1 [31 Jan, 2026] BUG
Release: 9.1 [31 Jan, 2026]
A major bug existed between versions 8.10 and 9.10, causing imported OPML recipes to include a lot of irrelevant content. The versions were messy and didn't crawl as cleanly as before. For example, version 8.10 worked fine, crawling only articles without any extra clutter. Additionally, there are issues with conversion. The new version encounters errors when converting multiple EPUBs to PDF simultaneously, but version 8.10 does not exhibit this problem. Please help fix it. Thank you. windows 11 |
|
|
|
|
|
#2 |
|
creator of calibre
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 45,963
Karma: 29579516
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
There are no changes to recipe processing in calibre 9. As for conversion to PDF it will likely be an issue caused by Qt WebEngine being updated and not working well on your system. What is the actual error you get?
|
|
|
|
| Advert | |
|
|
|
|
#3 | |
|
Zealot
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 111
Karma: 39846
Join Date: Aug 2022
Device: PC
|
Quote:
<opml version="1.0"> <head> <title>Feed Subscriptions</title> </head> <body> <outline title="Iceshi" text="Iceshi"> <outline text="Editorials, Editorial Opinions, Editorial News, The Hindu Opinion | The Hindu" title="Editorials, Editorial Opinions, Editorial News, The Hindu Opinion | The Hindu" type="rss" xmlUrl="https://www.thehindu.com/opinion/editorial/feeder/default.rss" htmlUrl="https://www.thehindu.com/opinion/editorial/"/> <outline text="National Review" title="National Review" type="rss" xmlUrl="https://www.nationalreview.com/feed/" htmlUrl="https://www.nationalreview.com"/> <outline text="The Atlantic" title="The Atlantic" type="rss" xmlUrl="https://feeds.feedburner.com/TheAtlantic" htmlUrl="https://www.theatlantic.com/"/> <outline text="Articles on Smashing Magazine — For Web Designers And Developers" title="Articles on Smashing Magazine — For Web Designers And Developers" type="rss" xmlUrl="https://www.smashingmagazine.com/feed/" htmlUrl="https://www.smashingmagazine.com/"/> <outline text="CBC - CBC | Top Stories News" title="CBC - CBC | Top Stories News" type="rss" xmlUrl="https://rss.cbc.ca/lineup/topstories.xml" htmlUrl="https://www.cbc.ca/news/?cmp=rss"/> </outline> </body> </opml> Hey boss, I've provided a test OPML file. Could you please give it a try? Thanks! A lot of the scraped content is messy and disorganized. Last edited by fengli; Yesterday at 01:32 AM. |
|
|
|
|
|
|
#4 |
|
Zealot
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 111
Karma: 39846
Join Date: Aug 2022
Device: PC
|
Version 8.10 excels at both epub Conversion PDF and OPML extraction, delivering clean articles without any extraneous elements.
Last edited by fengli; Yesterday at 01:42 AM. |
|
|
|
|
|
#5 |
|
Zealot
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 111
Karma: 39846
Join Date: Aug 2022
Device: PC
|
[ATTACH]
Numerous disorganized elements, as shown in the figure, likely resulted from failure to remove other elements after scraping, leading to unclean and untidy article content. |
|
|
|
| Advert | |
|
|
|
|
#6 |
|
Zealot
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 111
Karma: 39846
Join Date: Aug 2022
Device: PC
|
I suspect that the updated version of Qt WebEngine has a bug when batch processing EPUB to PDF.
|
|
|
|
|
|
#7 |
|
Zealot
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 111
Karma: 39846
Join Date: Aug 2022
Device: PC
|
Versions from 8.10 through 9.1 have consistently exhibited these two issues. I tested each update, but ultimately had to revert to version 8.10—though it lacks new features.
1. OPML article elements are captured in a messy, disorganized manner. 2. Batch conversion of EPUB to PDF frequently fails, while individual conversions work fine. Please thoroughly investigate these issues as they are highly disruptive. Thank you very much. |
|
|
|
|
|
#8 |
|
creator of calibre
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 45,963
Karma: 29579516
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
batch conversion to PDF is working fine for me. Again, what actual error do you get and on what system? As for your OPML issues its likely a lxml update breaking readability. Neither of these have anything to do with calibre 9.
|
|
|
|
|
|
#9 |
|
creator of calibre
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 45,963
Karma: 29579516
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
|
And this fixes the OPML cleanup issue. As suspected it was because of an lxml change and happened in version 8.11 not version 9
https://github.com/kovidgoyal/calibr...f0ece7bd379a20 |
|
|
|
|
|
#10 |
|
want to learn what I want
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1,850
Karma: 7945227
Join Date: Sep 2020
Device: none
|
calibre < 8.11
calibre current calibre < 8.11 calibre current Great find. What I found interesting is that calibre < 8.11 didn't fetch the first image of the example article: https://www.cbc.ca/radio/asithappens...068855?cmp=rss Thank you for the fix, Kovid. |
|
|
|
|
|
#11 | |
|
Resident Curmudgeon
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 81,914
Karma: 150266009
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
|
Quote:
|
|
|
|
|
|
|
#12 | |
|
Zealot
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 111
Karma: 39846
Join Date: Aug 2022
Device: PC
|
Quote:
|
|
|
|
|
|
|
#13 |
|
Zealot
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 111
Karma: 39846
Join Date: Aug 2022
Device: PC
|
Is the problem not that the content cannot be captured。 It is that the captured content has many cluttered elements, such as various titles.
Last edited by fengli; Yesterday at 10:33 AM. |
|
|
|
|
|
#14 | |
|
Zealot
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 111
Karma: 39846
Join Date: Aug 2022
Device: PC
|
Quote:
|
|
|
|
|
|
|
#15 |
|
want to learn what I want
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1,850
Karma: 7945227
Join Date: Sep 2020
Device: none
|
|
|
|
|
![]() |
| Thread Tools | Search this Thread |
|
Similar Threads
|
||||
| Thread | Thread Starter | Forum | Replies | Last Post |
| Short Fiction Various: Past Masters 248, v1, 27 Jan 2026 | Pulpmeister | Other Books | 0 | 01-26-2026 09:01 PM |
| Short Fiction Various: Past Masters 247, v1, 18 Jan 2026 | Pulpmeister | ePub Books | 0 | 01-17-2026 07:47 PM |
| Short Fiction Various: Past Masters 246, v1, 5 Jan 2026 | Pulpmeister | Other Books | 0 | 01-05-2026 02:21 AM |
| Short Fiction Various: Past Masters 246, v1, 5 Jan 2026 | Pulpmeister | ePub Books | 0 | 01-05-2026 02:19 AM |
| Horror Various Authors: A Book of Ghosts 44. v1. 04 Jan. 2026 | GrannyGrump | Kindle Books | 0 | 01-05-2026 12:14 AM |