View Single Post
Old 11-02-2011, 02:54 PM   #8
scissors
Addict
scissors ought to be getting tired of karma fortunes by now.scissors ought to be getting tired of karma fortunes by now.scissors ought to be getting tired of karma fortunes by now.scissors ought to be getting tired of karma fortunes by now.scissors ought to be getting tired of karma fortunes by now.scissors ought to be getting tired of karma fortunes by now.scissors ought to be getting tired of karma fortunes by now.scissors ought to be getting tired of karma fortunes by now.scissors ought to be getting tired of karma fortunes by now.scissors ought to be getting tired of karma fortunes by now.scissors ought to be getting tired of karma fortunes by now.
 
Posts: 241
Karma: 1001369
Join Date: Sep 2010
Device: prs300, kindle keyboard 3g
Quote:
Originally Posted by Serpentine View Post
Are you sure that Google Reader is not breaking that section of text out of the header as it has a formatting element in it? I have no idea how the reader aggregation works - but I have a feeling it might be doing something funny there.

It seems that the <br/> is problematic, the regex run on the sun site itself works just fine.
Yeah, I figured it's the <br/> myself. The reason It's via google is I tried a standard feed recipe and for some reason they often fail.

@starson when you say print the soup - I assume the soup is the input html and the created intermediate xhtml. I kinda get the idea of B.soup but the whole syntax etc is beyond me. I study your examples etc but even the way variables are declared is hard for a dummy like myself to get to grips with.

(plus the mrs moans about the amount of time i'm spending on it)
scissors is offline   Reply With Quote