Register Guidelines E-Books Search Today's Posts Mark Forums Read

Go Back   MobileRead Forums > E-Book Software > Calibre > Recipes

Notices

Reply
 
Thread Tools Search this Thread
Old 05-26-2011, 10:33 PM   #1
BRGriff
Connoisseur
BRGriff began at the beginning.
 
Posts: 55
Karma: 12
Join Date: May 2011
Location: Deland, Florida
Device: Kindle 3
GoComics After Merger - New "Feeds" ListWith the ensuing merger of GoComics and Comic

With the ensuing merger of GoComics and Comics.com only days away (June 1), I have worked on a new "Feeds" list that can be copied and pasted into you current GoComics recipe. In the list, the following have been done:

1. The Comics.com URL has been changed to GoComics.com

2. The syntax has been modified to be consistent for use on GoComics:
examples: Andy_Capp has been changed to Andycapp
Arlo&Janis has been changed to ArloandJanis
KitNCaryle has been changed to KitandCarlye
(I give these examples in the event I have missed something and you need to edit the list)

3. Other formatting changes have been made to make both lists compatible.

4. The two site lists were then merged and sorted from A to Z.

5. General comics and Editorial comics were kept as separate groups.

To use, download the attached text file and then copy and paste it into your current GoComics recipe in place of the existing feeds list. I would highly recommend you rename the file as "GoComics Merged" so you do not overwrite you existing GoComics recipe. To do this change the name in the "Title" portion at the top of the recipe. This is just a safety precaution until enough testing is done to verify all is working properly. I have downloaded over 50 randomly selected comics with no problems. A few comics still have not been put on the GoComics server, but they will be there soon.

You will need to go through the list and remove the "#" sign before the name of any comic you desire to download. Be sure to "Add/Update" the changes you make.

If you have questions, please don't hesitate to ask. I would have posted a full reworked recipe but did not want to impinge upon Starson17's copyrights and other hard work he put in in writing the recipe. Maybe he will find my labors of use and post a new recipe incorporating my work.

Have fun, laugh loudly and never shed a tear except in joy.
Attached Files
File Type: txt GoComics New Feeds List.txt (25.4 KB, 343 views)
BRGriff is offline   Reply With Quote
Old 05-27-2011, 02:53 PM   #2
Starson17
Wizard
Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.
 
Posts: 4,004
Karma: 177841
Join Date: Dec 2009
Device: WinMo: IPAQ; Android: HTC HD2, Archos 7o; Java:Gravity T
It looks like comics.com will be merged into gocomics.com. Is there any reason not to eliminate the comics.com recipe and just revise gocomics.com to include all available strips?

I thought I'd wait to let it all settle before doing anything. If you have a final list, I'd suggest posting the entire recipe, rather than parts of it.
Starson17 is offline   Reply With Quote
Old 05-27-2011, 03:15 PM   #3
BRGriff
Connoisseur
BRGriff began at the beginning.
 
Posts: 55
Karma: 12
Join Date: May 2011
Location: Deland, Florida
Device: Kindle 3
That is what the attached "feeds" list IS: a compilation of ALL comic and editorial strips from GoComics AND Comics.com. I agree there is no longer a need for a separate Comics.com recipe since all strips from that site are incorporated in the above list.

I did not want to post a recipe since I would be copying all of YOUR code from the previous GoComics recipe and did not feel I had the right to do so. I can tell you the new list is working on 98% of the comics and should be fully functioning on all comics as of June 1st.
BRGriff is offline   Reply With Quote
Old 05-27-2011, 03:29 PM   #4
Starson17
Wizard
Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.
 
Posts: 4,004
Karma: 177841
Join Date: Dec 2009
Device: WinMo: IPAQ; Android: HTC HD2, Archos 7o; Java:Gravity T
Quote:
Originally Posted by brgriff View Post
That is what the attached "feeds" list IS: a compilation of ALL comic and editorial strips from GoComics AND Comics.com. I agree there is no longer a need for a separate Comics.com recipe since all strips from that site are incorporated in the above list.

I did not want to post a recipe since I would be copying all of YOUR code from the previous GoComics recipe and did not feel I had the right to do so. I can tell you the new list is working on 98% of the comics and should be fully functioning on all comics as of June 1st.
You've done the update work - post the full recipe. It's open source GPL anyway. Feel free to add your name and update the version number, too.
It really is easier for people to deal with the complete recipe. If you're sure it's ready to change, we can ask Kovid to update it. If you're still not certain what will happen on June 1, let's wait and make sure no other changes are needed before he updates the recipe for all.
Starson17 is offline   Reply With Quote
Old 05-27-2011, 03:50 PM   #5
BRGriff
Connoisseur
BRGriff began at the beginning.
 
Posts: 55
Karma: 12
Join Date: May 2011
Location: Deland, Florida
Device: Kindle 3
Starson17,

I think you are correct in suggesting waiting until after June 1st to post a new recipe. Things could change. I'll be keeping an eye on it.

The list I prepared was for your use in hopes it might assist in the posting of a new recipe when the time came. I was simply trying to be helpful and do some of the mundane work while leaving the coding to you. I can not at present write code myself, other than tinkering to meet my individual whims, and so I did not and do not feel justified in stealing another person's hard work and putting my name on it.

I do feel the list is good to go and accept that you or Kovid should write the new recipe when either of you feel the time is right.
BRGriff is offline   Reply With Quote
Old 06-07-2011, 09:02 PM   #6
Starson17
Wizard
Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.Starson17 can program the VCR without an owner's manual.
 
Posts: 4,004
Karma: 177841
Join Date: Dec 2009
Device: WinMo: IPAQ; Android: HTC HD2, Archos 7o; Java:Gravity T
Kovid:
Here's an updated GoComics recipe:
Spoiler:
Code:
#!/usr/bin/env  python

__license__   = 'GPL v3'
__copyright__ = 'Copyright 2010 Starson17'
'''
www.gocomics.com
'''
from calibre.web.feeds.news import BasicNewsRecipe
import mechanize, re

class GoComics(BasicNewsRecipe):
    title               = 'GoComics'
    __author__          = 'Starson17'
    __version__         = '1.06'
    __date__            = '07 June 2011'
    description         = u'200+ Comics - Customize for more days/comics: Defaults to 7 days, 25 comics - 20 general, 5 editorial.'
    category            = 'news, comics'
    language            = 'en'
    use_embedded_content= False
    no_stylesheets      = True
    remove_javascript   = True
    cover_url           = 'http://paulbuckley14059.files.wordpress.com/2008/06/calvin-and-hobbes.jpg'
    remove_attributes = ['style']

    ####### USER PREFERENCES - COMICS, IMAGE SIZE AND NUMBER OF COMICS TO RETRIEVE ########
    # num_comics_to_get - I've tried up to 99 on Calvin&Hobbes
    num_comics_to_get = 7
    # comic_size 300 is small, 600 is medium, 900 is large, 1500 is extra-large
    comic_size = 900
    # CHOOSE COMIC STRIPS BELOW - REMOVE COMMENT '# ' FROM IN FRONT OF DESIRED STRIPS
    # Please do not overload their servers by selecting all comics and 1000 strips from each!

    conversion_options = {'linearize_tables'  : True
                        , 'comment'           : description
                        , 'tags'              : category
                        , 'language'          : language
                        }

    keep_only_tags     = [dict(name='div', attrs={'class':['feature','banner']}),
                          ]

    remove_tags = [dict(name='a', attrs={'class':['beginning','prev','cal','next','newest']}),
                   dict(name='div', attrs={'class':['tag-wrapper']}),
                   dict(name='a', attrs={'href':re.compile(r'.*mutable_[0-9]+', re.IGNORECASE)}),
                   dict(name='img', attrs={'src':re.compile(r'.*mutable_[0-9]+', re.IGNORECASE)}),
                   dict(name='ul', attrs={'class':['share-nav','feature-nav']}),
                   ]

    def get_browser(self):
        br = BasicNewsRecipe.get_browser(self)
        cookies = mechanize.CookieJar()
        br = mechanize.build_opener(mechanize.HTTPCookieProcessor(cookies))
        br.addheaders = [('Referer','http://www.gocomics.com/')]
        return br

    def parse_index(self):
        feeds = []
        for title, url in [
                       (u"2 Cows and a Chicken", u"http://www.gocomics.com/2cowsandachicken"),
                       #(u"9 Chickweed Lane", u"http://www.gocomics.com/9chickweedlane"),
                       (u"9 to 5", u"http://www.gocomics.com/9to5"),
                       #(u"Adam At Home", u"http://www.gocomics.com/adamathome"),
                       (u"Agnes", u"http://www.gocomics.com/agnes"),
                       #(u"Alley Oop", u"http://www.gocomics.com/alleyoop"),
                       #(u"Andy Capp", u"http://www.gocomics.com/andycapp"),
                       #(u"Animal Crackers", u"http://www.gocomics.com/animalcrackers"),
                       #(u"Annie", u"http://www.gocomics.com/annie"),
                       #(u"Arlo & Janis", u"http://www.gocomics.com/arloandjanis"),
                       #(u"Ask Shagg", u"http://www.gocomics.com/askshagg"),
                       (u"B.C.", u"http://www.gocomics.com/bc"),
                       #(u"Back in the Day", u"http://www.gocomics.com/backintheday"),
                       #(u"Bad Reporter", u"http://www.gocomics.com/badreporter"),
                       #(u"Baldo", u"http://www.gocomics.com/baldo"),
                       #(u"Ballard Street", u"http://www.gocomics.com/ballardstreet"),
                       #(u"Barkeater Lake", u"http://www.gocomics.com/barkeaterlake"),
                       #(u"Basic Instructions", u"http://www.gocomics.com/basicinstructions"),
                       #(u"Ben", u"http://www.gocomics.com/ben"),
                       #(u"Betty", u"http://www.gocomics.com/betty"),
                       #(u"Bewley", u"http://www.gocomics.com/bewley"),
                       #(u"Big Nate", u"http://www.gocomics.com/bignate"),
                       #(u"Big Top", u"http://www.gocomics.com/bigtop"),
                       #(u"Biographic", u"http://www.gocomics.com/biographic"),
                       #(u"Birdbrains", u"http://www.gocomics.com/birdbrains"),
                       #(u"Bleeker: The Rechargeable Dog", u"http://www.gocomics.com/bleeker"),
                       #(u"Bliss", u"http://www.gocomics.com/bliss"),
                       (u"Bloom County", u"http://www.gocomics.com/bloomcounty"),
                       #(u"Bo Nanas", u"http://www.gocomics.com/bonanas"),
                       #(u"Bob the Squirrel", u"http://www.gocomics.com/bobthesquirrel"),
                       #(u"Boomerangs", u"http://www.gocomics.com/boomerangs"),
                       #(u"Bottomliners", u"http://www.gocomics.com/bottomliners"),
                       #(u"Bound and Gagged", u"http://www.gocomics.com/boundandgagged"),
                       #(u"Brainwaves", u"http://www.gocomics.com/brainwaves"),
                       #(u"Brenda Starr", u"http://www.gocomics.com/brendastarr"),
                       #(u"Brevity", u"http://www.gocomics.com/brevity"),
                       #(u"Brewster Rockit", u"http://www.gocomics.com/brewsterrockit"),
                       #(u"Broom Hilda", u"http://www.gocomics.com/broomhilda"),
                       (u"Calvin and Hobbes", u"http://www.gocomics.com/calvinandhobbes"),
                       #(u"Candorville", u"http://www.gocomics.com/candorville"),
                       #(u"Cathy", u"http://www.gocomics.com/cathy"),
                       #(u"C'est la Vie", u"http://www.gocomics.com/cestlavie"),
                       #(u"Cheap Thrills", u"http://www.gocomics.com/cheapthrills"),
                       #(u"Chuckle Bros", u"http://www.gocomics.com/chucklebros"),
                       #(u"Citizen Dog", u"http://www.gocomics.com/citizendog"),
                       #(u"Cleats", u"http://www.gocomics.com/cleats"),
                       #(u"Close to Home", u"http://www.gocomics.com/closetohome"),
                       #(u"Committed", u"http://www.gocomics.com/committed"),
                       #(u"Compu-toon", u"http://www.gocomics.com/compu-toon"),
                       #(u"Cornered", u"http://www.gocomics.com/cornered"),
                       #(u"Cow & Boy", u"http://www.gocomics.com/cow&boy"),
                       #(u"Cul de Sac", u"http://www.gocomics.com/culdesac"),
                       #(u"Daddy's Home", u"http://www.gocomics.com/daddyshome"),
                       #(u"Deep Cover", u"http://www.gocomics.com/deepcover"),
                       #(u"Dick Tracy", u"http://www.gocomics.com/dicktracy"),
                       (u"Dog Eat Doug", u"http://www.gocomics.com/dogeatdoug"),
                       #(u"Domestic Abuse", u"http://www.gocomics.com/domesticabuse"),
                       (u"Doodles", u"http://www.gocomics.com/doodles"),
                       (u"Doonesbury", u"http://www.gocomics.com/doonesbury"),
                       #(u"Drabble", u"http://www.gocomics.com/drabble"),
                       #(u"Eek!", u"http://www.gocomics.com/eek"),
                       #(u"F Minus", u"http://www.gocomics.com/fminus"),
                       #(u"Family Tree", u"http://www.gocomics.com/familytree"),
                       #(u"Farcus", u"http://www.gocomics.com/farcus"),
                       (u"Fat Cats Classics", u"http://www.gocomics.com/fatcatsclassics"),
                       #(u"Ferd'nand", u"http://www.gocomics.com/ferdnand"),
                       #(u"Flight Deck", u"http://www.gocomics.com/flightdeck"),
                       (u"Flo and Friends", u"http://www.gocomics.com/floandfriends"),
                       #(u"For Better or For Worse", u"http://www.gocomics.com/forbetterorforworse"),
                       #(u"For Heaven's Sake", u"http://www.gocomics.com/forheavenssake"),
                       #(u"Fort Knox", u"http://www.gocomics.com/fortknox"),
                       #(u"FoxTrot Classics", u"http://www.gocomics.com/foxtrotclassics"),
                       (u"FoxTrot", u"http://www.gocomics.com/foxtrot"),
                       #(u"Frank & Ernest", u"http://www.gocomics.com/frankandernest"),
                       #(u"Frazz", u"http://www.gocomics.com/frazz"),
                       #(u"Fred Basset", u"http://www.gocomics.com/fredbasset"),
                       #(u"Free Range", u"http://www.gocomics.com/freerange"),
                       #(u"Frog Applause", u"http://www.gocomics.com/frogapplause"),
                       #(u"Garfield Minus Garfield", u"http://www.gocomics.com/garfieldminusgarfield"),
                       (u"Garfield", u"http://www.gocomics.com/garfield"),
                       #(u"Gasoline Alley", u"http://www.gocomics.com/gasolinealley"),
                       #(u"Geech Classics", u"http://www.gocomics.com/geechclassics"),
                       #(u"Get Fuzzy", u"http://www.gocomics.com/getfuzzy"),
                       #(u"Gil Thorp", u"http://www.gocomics.com/gilthorp"),
                       #(u"Ginger Meggs", u"http://www.gocomics.com/gingermeggs"),
                       #(u"Girls & Sports", u"http://www.gocomics.com/girlsandsports"),
                       #(u"Graffiti", u"http://www.gocomics.com/graffiti"),
                       #(u"Grand Avenue", u"http://www.gocomics.com/grandavenue"),
                       #(u"Haiku Ewe", u"http://www.gocomics.com/haikuewe"),
                       #(u"Heart of the City", u"http://www.gocomics.com/heartofthecity"),
                       (u"Heathcliff", u"http://www.gocomics.com/heathcliff"),
                       #(u"Herb and Jamaal", u"http://www.gocomics.com/herbandjamaal"),
                       #(u"Herman", u"http://www.gocomics.com/herman"),
                       #(u"Home and Away", u"http://www.gocomics.com/homeandaway"),
                       #(u"Housebroken", u"http://www.gocomics.com/housebroken"),
                       #(u"Hubert and Abby", u"http://www.gocomics.com/hubertandabby"),
                       #(u"Imagine This", u"http://www.gocomics.com/imaginethis"),
                       #(u"In the Bleachers", u"http://www.gocomics.com/inthebleachers"),
                       #(u"In the Sticks", u"http://www.gocomics.com/inthesticks"),
                       #(u"Ink Pen", u"http://www.gocomics.com/inkpen"),
                       #(u"It's All About You", u"http://www.gocomics.com/itsallaboutyou"),
                       #(u"Jane's World", u"http://www.gocomics.com/janesworld"),
                       #(u"Joe Vanilla", u"http://www.gocomics.com/joevanilla"),
                       #(u"Jump Start", u"http://www.gocomics.com/jumpstart"),
                       #(u"Kit 'N' Carlyle", u"http://www.gocomics.com/kitandcarlyle"),
                       #(u"La Cucaracha", u"http://www.gocomics.com/lacucaracha"),
                       #(u"Last Kiss", u"http://www.gocomics.com/lastkiss"),
                       #(u"Legend of Bill", u"http://www.gocomics.com/legendofbill"),
                       #(u"Liberty Meadows", u"http://www.gocomics.com/libertymeadows"),
                       #(u"Li'l Abner Classics", u"http://www.gocomics.com/lilabnerclassics"),
                       #(u"Lio", u"http://www.gocomics.com/lio"),
                       #(u"Little Dog Lost", u"http://www.gocomics.com/littledoglost"),
                       #(u"Little Otto", u"http://www.gocomics.com/littleotto"),
                       #(u"Lola", u"http://www.gocomics.com/lola"),
                       #(u"Loose Parts", u"http://www.gocomics.com/looseparts"),
                       #(u"Love Is...", u"http://www.gocomics.com/loveis"),
                       #(u"Luann", u"http://www.gocomics.com/luann"),
                       #(u"Maintaining", u"http://www.gocomics.com/maintaining"),
                       (u"Marmaduke", u"http://www.gocomics.com/marmaduke"),
                       #(u"Meg! Classics", u"http://www.gocomics.com/megclassics"),
                       #(u"Middle-Aged White Guy", u"http://www.gocomics.com/middleagedwhiteguy"),
                       #(u"Minimum Security", u"http://www.gocomics.com/minimumsecurity"),
                       #(u"Moderately Confused", u"http://www.gocomics.com/moderatelyconfused"),
                       (u"Momma", u"http://www.gocomics.com/momma"),
                       #(u"Monty", u"http://www.gocomics.com/monty"),
                       #(u"Motley Classics", u"http://www.gocomics.com/motleyclassics"),
                       (u"Mutt & Jeff", u"http://www.gocomics.com/muttandjeff"),
                       #(u"Mythtickle", u"http://www.gocomics.com/mythtickle"),
                       #(u"Nancy", u"http://www.gocomics.com/nancy"),
                       #(u"Natural Selection", u"http://www.gocomics.com/naturalselection"),
                       #(u"Nest Heads", u"http://www.gocomics.com/nestheads"),
                       #(u"NEUROTICA", u"http://www.gocomics.com/neurotica"),
                       #(u"New Adventures of Queen Victoria", u"http://www.gocomics.com/thenewadventuresofqueenvictoria"),
                       #(u"Non Sequitur", u"http://www.gocomics.com/nonsequitur"),
                       #(u"Off The Mark", u"http://www.gocomics.com/offthemark"),
                       #(u"On A Claire Day", u"http://www.gocomics.com/onaclaireday"),
                       #(u"One Big Happy Classics", u"http://www.gocomics.com/onebighappyclassics"),
                       #(u"One Big Happy", u"http://www.gocomics.com/onebighappy"),
                       #(u"Out of the Gene Pool Re-Runs", u"http://www.gocomics.com/outofthegenepool"),
                       #(u"Over the Hedge", u"http://www.gocomics.com/overthehedge"),
                       #(u"Overboard", u"http://www.gocomics.com/overboard"),
                       #(u"PC and Pixel", u"http://www.gocomics.com/pcandpixel"),
                       (u"Peanuts", u"http://www.gocomics.com/peanuts"),
                       #(u"Pearls Before Swine", u"http://www.gocomics.com/pearlsbeforeswine"),
                       #(u"Pibgorn Sketches", u"http://www.gocomics.com/pibgornsketches"),
                       #(u"Pibgorn", u"http://www.gocomics.com/pibgorn"),
                       (u"Pickles", u"http://www.gocomics.com/pickles"),
                       #(u"Pinkerton", u"http://www.gocomics.com/pinkerton"),
                       #(u"Pluggers", u"http://www.gocomics.com/pluggers"),
                       #(u"Pooch Cafe", u"http://www.gocomics.com/poochcafe"),
                       #(u"PreTeena", u"http://www.gocomics.com/preteena"),
                       #(u"Prickly City", u"http://www.gocomics.com/pricklycity"),
                       #(u"Rabbits Against Magic", u"http://www.gocomics.com/rabbitsagainstmagic"),
                       #(u"Raising Duncan Classics", u"http://www.gocomics.com/raisingduncanclassics"),
                       #(u"Real Life Adventures", u"http://www.gocomics.com/reallifeadventures"),
                       #(u"Reality Check", u"http://www.gocomics.com/realitycheck"),
                       #(u"Red and Rover", u"http://www.gocomics.com/redandrover"),
                       #(u"Red Meat", u"http://www.gocomics.com/redmeat"),
                       #(u"Reynolds Unwrapped", u"http://www.gocomics.com/reynoldsunwrapped"),
                       #(u"Rip Haywire", u"http://www.gocomics.com/riphaywire"),
                       #(u"Ripley's Believe It or Not!", u"http://www.gocomics.com/ripleysbelieveitornot"),
                       #(u"Ronaldinho Gaucho", u"http://www.gocomics.com/ronaldinhogaucho"),
                       #(u"Rose Is Rose", u"http://www.gocomics.com/roseisrose"),
                       #(u"Rubes", u"http://www.gocomics.com/rubes"),
                       #(u"Rudy Park", u"http://www.gocomics.com/rudypark"),
                       #(u"Scary Gary", u"http://www.gocomics.com/scarygary"),
                       #(u"Shirley and Son Classics", u"http://www.gocomics.com/shirleyandsonclassics"),
                       #(u"Shoe", u"http://www.gocomics.com/shoe"),
                       #(u"Shoecabbage", u"http://www.gocomics.com/shoecabbage"),
                       #(u"Skin Horse", u"http://www.gocomics.com/skinhorse"),
                       #(u"Slowpoke", u"http://www.gocomics.com/slowpoke"),
                       #(u"Soup To Nutz", u"http://www.gocomics.com/souptonutz"),
                       #(u"Speed Bump", u"http://www.gocomics.com/speedbump"),
                       #(u"Spot The Frog", u"http://www.gocomics.com/spotthefrog"),
                       #(u"State of the Union", u"http://www.gocomics.com/stateoftheunion"),
                       #(u"Stone Soup", u"http://www.gocomics.com/stonesoup"),
                       #(u"Strange Brew", u"http://www.gocomics.com/strangebrew"),
                       #(u"Sylvia", u"http://www.gocomics.com/sylvia"),
                       #(u"Tank McNamara", u"http://www.gocomics.com/tankmcnamara"),
                       #(u"Tarzan Classics", u"http://www.gocomics.com/tarzanclassics"),
                       #(u"That's Life", u"http://www.gocomics.com/thatslife"),
                       #(u"The Academia Waltz", u"http://www.gocomics.com/academiawaltz"),
                       #(u"The Argyle Sweater", u"http://www.gocomics.com/theargylesweater"),
                       #(u"The Barn", u"http://www.gocomics.com/thebarn"),
                       #(u"The Boiling Point", u"http://www.gocomics.com/theboilingpoint"),
                       #(u"The Boondocks", u"http://www.gocomics.com/boondocks"),
                       #(u"The Born Loser", u"http://www.gocomics.com/thebornloser"),
                       #(u"The Buckets", u"http://www.gocomics.com/thebuckets"),
                       #(u"The City", u"http://www.gocomics.com/thecity"),
                       #(u"The Dinette Set", u"http://www.gocomics.com/dinetteset"),
                       #(u"The Doozies", u"http://www.gocomics.com/thedoozies"),
                       #(u"The Duplex", u"http://www.gocomics.com/duplex"),
                       #(u"The Elderberries", u"http://www.gocomics.com/theelderberries"),
                       #(u"The Flying McCoys", u"http://www.gocomics.com/theflyingmccoys"),
                       #(u"The Fusco Brothers", u"http://www.gocomics.com/thefuscobrothers"),
                       #(u"The Grizzwells", u"http://www.gocomics.com/thegrizzwells"),
                       #(u"The Humble Stumble", u"http://www.gocomics.com/thehumblestumble"),
                       #(u"The Knight Life", u"http://www.gocomics.com/theknightlife"),
                       #(u"The Meaning of Lila", u"http://www.gocomics.com/meaningoflila"),
                       #(u"The Middletons", u"http://www.gocomics.com/themiddletons"),
                       #(u"The Norm", u"http://www.gocomics.com/thenorm"),
                       #(u"The Other Coast", u"http://www.gocomics.com/theothercoast"),
                       #(u"The Quigmans", u"http://www.gocomics.com/thequigmans"),
                       #(u"The Sunshine Club", u"http://www.gocomics.com/thesunshineclub"),
                       #(u"Tiny Sepuk", u"http://www.gocomics.com/tinysepuk"),
                       #(u"TOBY", u"http://www.gocomics.com/toby"),
                       #(u"Tom the Dancing Bug", u"http://www.gocomics.com/tomthedancingbug"),
                       #(u"Too Much Coffee Man", u"http://www.gocomics.com/toomuchcoffeeman"),
                       #(u"Unstrange Phenomena", u"http://www.gocomics.com/unstrangephenomena"),
                       #(u"W.T. Duck", u"http://www.gocomics.com/wtduck"),
                       #(u"Watch Your Head", u"http://www.gocomics.com/watchyourhead"),
                       #(u"Wee Pals", u"http://www.gocomics.com/weepals"),
                       #(u"Winnie the Pooh", u"http://www.gocomics.com/winniethepooh"),
                       #(u"Wizard of Id", u"http://www.gocomics.com/wizardofid"),
                       #(u"Working Daze", u"http://www.gocomics.com/workingdaze"),
                       #(u"Working It Out", u"http://www.gocomics.com/workingitout"),
                       #(u"Yenny", u"http://www.gocomics.com/yenny"),
                       #(u"Zack Hill", u"http://www.gocomics.com/zackhill"),
                       (u"Ziggy", u"http://www.gocomics.com/ziggy"),
                       #
                       ######## EDITORIAL CARTOONS #####################
                       (u"Adam Zyglis", u"http://www.gocomics.com/adamzyglis"),
                       #(u"Andy Singer", u"http://www.gocomics.com/andysinger"),
                       #(u"Ben Sargent",u"http://www.gocomics.com/bensargent"),
                       #(u"Bill Day", u"http://www.gocomics.com/billday"),
                       #(u"Bill Schorr", u"http://www.gocomics.com/billschorr"),
                       #(u"Bob Englehart", u"http://www.gocomics.com/bobenglehart"),
                       (u"Bob Gorrell",u"http://www.gocomics.com/bobgorrell"),
                       #(u"Brian Fairrington", u"http://www.gocomics.com/brianfairrington"),
                       #(u"Bruce Beattie", u"http://www.gocomics.com/brucebeattie"),
                       #(u"Cam Cardow", u"http://www.gocomics.com/camcardow"),
                       #(u"Chan Lowe",u"http://www.gocomics.com/chanlowe"),
                       #(u"Chip Bok",u"http://www.gocomics.com/chipbok"),
                       #(u"Chris Britt",u"http://www.gocomics.com/chrisbritt"),
                       #(u"Chuck Asay",u"http://www.gocomics.com/chuckasay"),
                       #(u"Clay Bennett",u"http://www.gocomics.com/claybennett"),
                       #(u"Clay Jones",u"http://www.gocomics.com/clayjones"),
                       #(u"Dan Wasserman",u"http://www.gocomics.com/danwasserman"),
                       #(u"Dana Summers",u"http://www.gocomics.com/danasummers"),
                       #(u"Daryl Cagle", u"http://www.gocomics.com/darylcagle"),
                       #(u"David Fitzsimmons", u"http://www.gocomics.com/davidfitzsimmons"),
                       (u"Dick Locher",u"http://www.gocomics.com/dicklocher"),
                       #(u"Don Wright",u"http://www.gocomics.com/donwright"),
                       #(u"Donna Barstow",u"http://www.gocomics.com/donnabarstow"),
                       #(u"Drew Litton", u"http://www.gocomics.com/drewlitton"),
                       #(u"Drew Sheneman",u"http://www.gocomics.com/drewsheneman"),
                       #(u"Ed Stein", u"http://www.gocomics.com/edstein"),
                       #(u"Eric Allie", u"http://www.gocomics.com/ericallie"),
                       #(u"Gary Markstein", u"http://www.gocomics.com/garymarkstein"),
                       #(u"Gary McCoy", u"http://www.gocomics.com/garymccoy"),
                       #(u"Gary Varvel", u"http://www.gocomics.com/garyvarvel"),
                       #(u"Glenn McCoy",u"http://www.gocomics.com/glennmccoy"),
                       #(u"Henry Payne", u"http://www.gocomics.com/henrypayne"),
                       #(u"Jack Ohman",u"http://www.gocomics.com/jackohman"),
                       #(u"JD Crowe", u"http://www.gocomics.com/jdcrowe"),
                       #(u"Jeff Danziger",u"http://www.gocomics.com/jeffdanziger"),
                       #(u"Jeff Parker", u"http://www.gocomics.com/jeffparker"),
                       #(u"Jeff Stahler", u"http://www.gocomics.com/jeffstahler"),
                       #(u"Jerry Holbert", u"http://www.gocomics.com/jerryholbert"),
                       #(u"Jim Morin",u"http://www.gocomics.com/jimmorin"),
                       #(u"Joel Pett",u"http://www.gocomics.com/joelpett"),
                       #(u"John Cole", u"http://www.gocomics.com/johncole"),
                       #(u"John Darkow", u"http://www.gocomics.com/johndarkow"),
                       #(u"John Deering",u"http://www.gocomics.com/johndeering"),
                       #(u"John Sherffius", u"http://www.gocomics.com/johnsherffius"),
                       #(u"Ken Catalino",u"http://www.gocomics.com/kencatalino"),
                       #(u"Kerry Waghorn",u"http://www.gocomics.com/facesinthenews"),
                       #(u"Kevin Kallaugher",u"http://www.gocomics.com/kevinkallaugher"),
                       #(u"Lalo Alcaraz",u"http://www.gocomics.com/laloalcaraz"),
                       #(u"Larry Wright", u"http://www.gocomics.com/larrywright"),
                       #(u"Lisa Benson", u"http://www.gocomics.com/lisabenson"),
                       #(u"Marshall Ramsey", u"http://www.gocomics.com/marshallramsey"),
                       #(u"Matt Bors", u"http://www.gocomics.com/mattbors"),
                       #(u"Matt Davies",u"http://www.gocomics.com/mattdavies"),
                       #(u"Michael Ramirez", u"http://www.gocomics.com/michaelramirez"),
                       #(u"Mike Keefe", u"http://www.gocomics.com/mikekeefe"),
                       #(u"Mike Luckovich", u"http://www.gocomics.com/mikeluckovich"),
                       #(u"MIke Thompson", u"http://www.gocomics.com/mikethompson"),
                       #(u"Monte Wolverton", u"http://www.gocomics.com/montewolverton"),
                       #(u"Mr. Fish", u"http://www.gocomics.com/mrfish"),
                       #(u"Nate Beeler", u"http://www.gocomics.com/natebeeler"),
                       #(u"Nick Anderson", u"http://www.gocomics.com/nickanderson"),
                       #(u"Pat Bagley", u"http://www.gocomics.com/patbagley"),
                       #(u"Pat Oliphant",u"http://www.gocomics.com/patoliphant"),
                       #(u"Paul Conrad",u"http://www.gocomics.com/paulconrad"),
                       #(u"Paul Szep", u"http://www.gocomics.com/paulszep"),
                       #(u"RJ Matson", u"http://www.gocomics.com/rjmatson"),
                       #(u"Rob Rogers", u"http://www.gocomics.com/robrogers"),
                       #(u"Robert Ariail", u"http://www.gocomics.com/robertariail"),
                       #(u"Scott Stantis", u"http://www.gocomics.com/scottstantis"),
                       #(u"Signe Wilkinson", u"http://www.gocomics.com/signewilkinson"),
                       #(u"Small World",u"http://www.gocomics.com/smallworld"),
                       #(u"Steve Benson", u"http://www.gocomics.com/stevebenson"),
                       #(u"Steve Breen", u"http://www.gocomics.com/stevebreen"),
                       #(u"Steve Kelley", u"http://www.gocomics.com/stevekelley"),
                       #(u"Steve Sack", u"http://www.gocomics.com/stevesack"),
                       #(u"Stuart Carlson",u"http://www.gocomics.com/stuartcarlson"),
                       #(u"Ted Rall",u"http://www.gocomics.com/tedrall"),
                       #(u"(Th)ink", u"http://www.gocomics.com/think"),
                       #(u"Tom Toles",u"http://www.gocomics.com/tomtoles"),
                       (u"Tony Auth",u"http://www.gocomics.com/tonyauth"),
                       #(u"Views of the World",u"http://www.gocomics.com/viewsoftheworld"),
                       #(u"ViewsAfrica",u"http://www.gocomics.com/viewsafrica"),
                       #(u"ViewsAmerica",u"http://www.gocomics.com/viewsamerica"),
                       #(u"ViewsAsia",u"http://www.gocomics.com/viewsasia"),
                       #(u"ViewsBusiness",u"http://www.gocomics.com/viewsbusiness"),
                       #(u"ViewsEurope",u"http://www.gocomics.com/viewseurope"),
                       #(u"ViewsLatinAmerica",u"http://www.gocomics.com/viewslatinamerica"),
                       #(u"ViewsMidEast",u"http://www.gocomics.com/viewsmideast"),
                       (u"Walt Handelsman",u"http://www.gocomics.com/walthandelsman"),
                       #(u"Wayne Stayskal",u"http://www.gocomics.com/waynestayskal"),
                       #(u"Wit of the World",u"http://www.gocomics.com/witoftheworld"),
                             ]:
            print 'Working on: ', title
            articles = self.make_links(url)
            if articles:
                feeds.append((title, articles))
        return feeds

    def make_links(self, url):
        title = 'Temp'
        current_articles = []
        pages = range(1, self.num_comics_to_get+1)
        for page in pages:
            page_soup = self.index_to_soup(url)
            if page_soup:
                try:
                  strip_title = page_soup.find(name='div', attrs={'class':'top'}).h1.a.string
                except:
                  strip_title = 'Error - no Title found'
                try:
                  date_title = page_soup.find('ul', attrs={'class': 'feature-nav'}).li.string
                  if not date_title:
                      date_title = page_soup.find('ul', attrs={'class': 'feature-nav'}).li.string
                except:
                  date_title = 'Error - no Date found'
                title = strip_title + ' - ' + date_title
                for i in range(2):
                  try:
                    strip_url_date = page_soup.find(name='div', attrs={'class':'top'}).h1.a['href']
                    break #success - this is normal exit
                  except:
                    strip_url_date = None
                    continue #try to get strip_url_date again
                for i in range(2):
                  try:
                    prev_strip_url_date = page_soup.find('a', attrs={'class': 'prev'})['href']
                    break #success - this is normal exit
                  except:
                    prev_strip_url_date = None
                    continue #try to get prev_strip_url_date again
                if strip_url_date:
                  page_url = 'http://www.gocomics.com' + strip_url_date
                else:
                  continue
                if prev_strip_url_date:
                  prev_page_url = 'http://www.gocomics.com' + prev_strip_url_date
                else:
                  continue
            current_articles.append({'title': title, 'url': page_url, 'description':'', 'date':''})
            url = prev_page_url
        current_articles.reverse()
        return current_articles

    def preprocess_html(self, soup):
        if soup.title:
            title_string = soup.title.string.strip()
            _cd = title_string.split(',',1)[1]
            comic_date = ' '.join(_cd.split(' ', 4)[0:-1])
        if soup.h1.span:
            artist = soup.h1.span.string
            soup.h1.span.string.replaceWith(comic_date + artist)
        feature_item = soup.find('p',attrs={'class':'feature_item'})
        if feature_item.a:
            a_tag = feature_item.a
            a_href = a_tag["href"]
            img_tag = a_tag.img
            img_tag["src"] = a_href
            img_tag["width"] = self.comic_size
            img_tag["height"] = None
        return self.adeify_images(soup)

    extra_css = '''
                    h1{font-family:Arial,Helvetica,sans-serif; font-weight:bold;font-size:large;}
                    h2{font-family:Arial,Helvetica,sans-serif; font-weight:normal;font-size:small;}
                    img {max-width:100%; min-width:100%;}
                    p{font-family:Arial,Helvetica,sans-serif;font-size:small;}
                    body{font-family:Helvetica,Arial,sans-serif;font-size:small;}
		'''
Starson17 is offline   Reply With Quote
Old 02-01-2012, 08:33 PM   #7
bbishopski
Junior Member
bbishopski began at the beginning.
 
Posts: 1
Karma: 10
Join Date: Feb 2012
Device: kindle
Gocomics Recipe

I couldn't get the comics recipe to work even with the URL changes posted by BRGriff. Not sure if it's calibre version or gocomics changed since then.

I was able to use the following recipe to go to gocomics.com and get the URL of the .gif file (which is hosted somewhere else) and then go fetch just the .GIF comic. The only problem is that for some reason calibre will put the .gif into your book twice (in 2 different sizes). Not sure why calibre does that, it is not the recipe itself.

I pulled in BRGriff's URLs.

Works for me anyway.
Attached Files
File Type: txt gocomics-recipe.txt (27.9 KB, 108 views)
bbishopski is offline   Reply With Quote
Old 07-25-2012, 06:23 PM   #8
sdg3
Junior Member
sdg3 began at the beginning.
 
Posts: 1
Karma: 10
Join Date: Jul 2012
Device: none
Is there any way to configure the gocomic recipe to show certain strips in landscape format instead of portrait? Some strips look really small and barely readable on my smaller Sony 350 screen ...
sdg3 is offline   Reply With Quote
Reply

Thread Tools Search this Thread
Search this Thread:

Advanced Search

Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
GoComics/Comics.com Merger BRGriff News 5 06-05-2011 01:16 PM
gocomics recipe not longer working in full Andreas Recipes 8 05-20-2011 11:25 AM
Fixed: GoComics.com Starson17 Recipes 1 01-02-2011 02:21 PM
Note Merger help physics@war iRex 6 07-10-2009 09:04 AM
HTML Merger ProDigit Workshop 3 02-20-2009 06:51 AM


All times are GMT -4. The time now is 12:39 PM.


MobileRead.com is a privately owned, operated and funded community.