View Single Post
Old 04-20-2024, 05:28 PM   #9479
Wolgan
Enthusiast
Wolgan began at the beginning.
 
Wolgan's Avatar
 
Posts: 31
Karma: 18
Join Date: Jun 2017
Device: Kindle Fire
[storiesonline.net] Invalid URLs picked up from RSS/Email notification

This isn't a bug or issue with FFF (I don't think), more with my particular email notification setup. For context, storiesonline.net publishes rss feeds with updates to series or stories you've bookmarked. I use feedrabbit.com to automatically route those rss updates to my email, then use the Get Story URLs from Email option in FFF to update stories.

My issues that it's recently started grabbing an extra url from the rss text (actual example: https://storiesonline.net/library/st...o.php?id=25311, which is a valid site url) that it interprets into "https://storiesonline.net/library/25311" (not a valid site url) somewhere in the process (may be just an overly greedy regex somewhere).

I first looked at just filtering any urls that start with "https://storiesonline.net/library/" but as best I can tell, the Reject URL list doesn't support wildcards.

Wondering if anyone could suggest another workaround to skip those urls? It's not a huge deal, the update just fails on those URLs, but since that site is really sensitive to download timing, it nearly doubles the amount of time FFF takes to do updates for their stories, which is already considerably long.

For reference, below is the raw body of the email I received for that example story, minus headers and with identifying info replaced with "redacted".

Spoiler:
Code:
Content-Type: text/html; charset="utf-8"
Content-Transfer-Encoding: quoted-printable

<!DOCTYPE html>
<html>
<head>
<meta http-equiv=3D"Content-Type" content=3D"text/html; charset=3Dutf-8">
</head>

<table cellspacing=3D"0" cellpadding=3D"8" border=3D"0" style=3D"width:100%=
; font-family:Arial,Sans-Serif; border:1px solid #ccc">
  <tr>
    <td style=3D"background-color:#f6f6f6; color:#888; border-bottom:1px so=
lid #ccc; font-family:Arial,Sans-serif; font-size:11px">
      <h3 style=3D"margin:0; font-family:Arial,Sans-serif; font-size:16px; =
font-weight:bold; color:#222">
        <a href=3D"https://storiesonline.net/library/my.php?list=3Dser">SOL=
: My Active Serials</a>
      </h3>
      <div style=3D"padding-top:3px">SOL My Active Serials: Updates to seri=
als that you have in your library on Storiesonline</div>
    </td>
  </tr>
  <tr>
    <td>
      <div style=3D"padding:2px">
        <div style=3D"padding-bottom:6px">
          <h3 style=3D"padding:0; margin:0; font-family:Arial,Sans-serif; f=
ont-size:16px; font-weight:bold; color:#222">
            <a href=3D"https://storiesonline.net/s/25311/rosencrantz-and">R=
osencrantz and Guildenstern by lordshipmayhem</a>
          </h3>
          <div style=3D"padding-top:2px; color:#888; font-family:Arial,Sans=
-serif; font-size:11px">
            by lordshipmayhem on
            Friday 19 April 2024 08:36 PM UTC-07
            | Tags:
            science-fiction
          </div>
        </div>

<div>
<p>Two old friends join the Armed Forces of the Confederacy. Meanwhile, Wil=
liam Whitefeather has one final training course to teach...</p><p style=3D"=
background-color: yellow">Added Chapters 31-33 (final)</p><p>[<a href=3D"ht=
tps://storiesonline.net/library/storyInfo.php?id=3D25311">More Info</a>]</p=
>
</div>

        <div style=3D"padding-top: 3px;">
          <span>Tags:</span>
          <ul style=3D"display: inline; list-style: none; margin: 0; paddin=
g: 0;">
            <li style=3D"display: inline-block; margin: 0 4px 2px 0; line-h=
eight: 24px; background-color: #f5f5f5; color: #999; padding: 0 8px; border=
-radius: 2px;">science-fiction</li>
          </ul>
        </div>
      </div>
    </td>
  </tr>
  <tr>
    <td style=3D"background-color:#f6f6f6; color:#888; border-top:1px solid=
 #ccc; font-family:Arial,Sans-serif; font-size:11px">
      You received this email because you set up a subscription at <a href=
=3D"https://feedrabbit.com/">Feedrabbit</a>.
      This email was sent to you at <a href=3D"mailto:redacted@gmail.com=
" target=3D"_blank" rel=3D"noopener">redacted@gmail.com</a>.
      <a href=3D"https://feedrabbit.com/subscriptions/redacted" target=3D"_bla=
nk" rel=3D"noopener">Unsubscribe or change your subscription</a>.
    </td>
  </tr>
</table>

</html><img width=3D"1px" height=3D"1px" alt=3D"" src=3D"http://email.feedr=
abbit.com/o/eJwUyVEKwyAMANDT1M8SbWKTDw9jarIJdQVXGLv92PdrxTQ3D1biHjdiIonhWWj=
DQ4U4RQdoBi5CbGbRqnvCHHpJkBAwAWwkBOuRhWtWV1PeGfOC4GZtVtV-r8c1wiyjzvtrr_d5fR=
aEx6j9_MsvAAD__2QjJv4">

Last edited by Wolgan; 04-20-2024 at 05:30 PM. Reason: clarity
Wolgan is offline   Reply With Quote