Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book Software > Calibre > Recipes

Notices

Reply
 
Thread Tools Search this Thread
Old 05-24-2025, 10:55 AM   #1
Villard
Connoisseur
Villard began at the beginning.
 
Posts: 73
Karma: 10
Join Date: May 2016
Device: Koreader running on Kobo Libra 2
Need help on get_browser for La Croix recipe

Hello
I'm building a recipe for La Croix newspaper https://www.la-croix.com/
I've got a subscription to this newspaper but I can not get with my recipe the articles reserved for subscribers.
I guess I need to use get_browser but I do not manage to fix it.

The articles are under the main URl https://www.la-croix.com and the connexion URl is of this type :in plain text :
sso.la-croix.com/auth/realms/bayard/protocol/openid-connect/auth?scope=openid&state=4a8c2a5c6410a6a1cc85a38726 15ed5c&response_type=code&approval_prompt=auto&red irect_uri=https%3A%2F%2Fwww.la-croix.com%2Fconnect%2Fkeycloak%2Fcheck&client_id=l a-croix.com
The state part "state=4a8c2a5c6410a6a1cc85a3872615ed5c" seems to be random and change at every visit.

I thank you for any help you can give me

Here is a sample of my current recipe :
Code:
from calibre.web.feeds.news import BasicNewsRecipe, classes
import re

class LaCroix(BasicNewsRecipe):
    title = 'La Croix'
    needs_subscription = True
    language = 'fr'
    remove_empty_feeds = True
    ignore_duplicate_articles = {'title', 'url'}
    reverse_article_order = True

    feeds          = [('International', 'https://www.la-croix.com/feeds/rss/international.xml'),]

    keep_only_tags = [
         (dict(name='div', class_='article-container article-container__columns')),
         (dict(name='div', class_='article-content')),
    ]

    remove_tags = [
        (dict(name='div', class_='read-also')),
        (dict(name='div', section_='page-section')),
        (dict(name='div', class_='tag-list')),
        (dict(name='div', class_='list list--separator')),
        (dict(name='div', class_='list-box')),
     ]

calibre_most_common_ua = 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/80.0.3987.87 Safari/537.36'
Villard
Villard is offline   Reply With Quote
Old 05-25-2025, 02:52 PM   #2
Villard
Connoisseur
Villard began at the beginning.
 
Posts: 73
Karma: 10
Join Date: May 2016
Device: Koreader running on Kobo Libra 2
As far as I see it, the authentification on the site is made via OpenID Connect which deliver an access token. Si I need to get this token.
I've tried to get it by downloading the cookies and loading them in get_browser(). It works but the cookies expire very quickly.
So I needt to find an other solution
Villard is offline   Reply With Quote
Advert
Reply

Tags
get_browser, recipe, subscription


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
La Croix recipe Villard Recipes 2 07-03-2023 09:35 AM
get_browser fail to submit akwokpc Recipes 0 01-07-2012 02:53 AM
Recipe works when mocked up as Python file, fails when converted to Recipe ode Recipes 7 09-04-2011 04:57 AM
Questions About def get_browser(self) Finbar127 Recipes 6 02-24-2011 09:36 PM
Can get_browser (login, password) be used to get the RSS XML? spedinfargo Recipes 1 02-03-2011 09:55 PM


All times are GMT -4. The time now is 05:42 AM.


MobileRead.com is a privately owned, operated and funded community.