Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book Software > Calibre > Recipes

Notices

Reply
 
Thread Tools Search this Thread
Old 03-02-2012, 05:48 PM   #1
myrkul999
Junior Member
myrkul999 began at the beginning.
 
Posts: 2
Karma: 10
Join Date: Mar 2012
Device: kindle
Question Not getting the entire page

Hey all, I'm trying to create a recipe to get a feed which contains 2 comics, and one of the comics loads just fine, the other doesn't get the whole thing, even in the input phase. I don't get it.

Maybe bigger brains than I can figure out what Calibre is choking on, and how to fix it.
Here's the source of the page that works (these are all taken from the most recent comic, and will stay the same until Monday, in case someone wants to try it themselves):
Spoiler:

Code:
<!-- #### maxavailable = 2012-03-02, maxpage = 903 -->
<!-- #### maxavailable = 2012-03-02, maxpage = 903 -->
<!-- #### npage = 903 -->


<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN">
<html lang="en">
 <head>
  <title>Big Head Press - Thoughtful Stories, Graphic Novels Online And In Print - Escape From Terra - by Sandy Sandfort, Scott Bieser, Leila Del Duca and Lee Oaks!</title>
    <meta http-equiv="content-type" content="text/html; charset=iso-8859-1">
    <meta name="keywords" content="Big Head Press, The Probability Broach, A Drug War Carol, The Architect, The Hook, Roswell Texas, Mike Baron, L Neil Smith, Scott Bieser, Andie Tong, Gabe Eltaeb, comics, graphic novels, liberty, sci-fi, science fiction, horror, thriller, thoughtful stories, La Muse, Adi Tantimedh, Hugo Petrus">

    <LINK REL="SHORTCUT ICON" HREF="/favicon.ico" type="image/x-icon">
    <meta name="ROBOTS" content="INDEX, FOLLOW">
    <link rel="stylesheet" href="http://www.bigheadpress.com/main2008.css" type="text/css" media="screen">
    <link rel="alternate" type="application/rss+xml" title="Big Head Press Daily Updates" href="http://www.bigheadpress.com/rssupdates" />
 </head>

 <body style="background: #000000;">
   <div id="pagewrapper976"> <!-- start of page wrapper -->




<div id="topbar">
<table width="976px" border="0px" cellpadding="0" cellspacing="0">
	<!-- top row -->
	<tr>
	   <td rowspan="2" valign="top" background="/images/eft/bgcolor.gif"><a href="http://www.bigheadpress.com"><img src="images/eft/EFTpage_BHPlogo2.jpg" width="104px" height="124px" alt="Big Head Press" border="0px" /></a></td>

	   <td>
		<table width="872px" border="0px" cellpadding="0" cellspacing="0">

		<tr>
		<td colspan="1"><img src="/images/eft/EFTpage_EFTlogo1.gif" width="404px" height="60px" alt="" /></td>

		<!-- banner ad box -->
<!--
		<td colspan="5" width="468px" height="60px" align="center" valign="center"><a href='http://www.silvercirclemovie.com/?p=1&utm_source=BigHeadPress&utm_medium=Banner&utm_campaign=Web%2BComic' ><img src="/images/ads/silverCircleComicAd_468x60v2.jpg" width="468px" height="60px" border="0px" alt="Silver Circle Movie" /></a></td>
		<td colspan="5" width="468px" height="60px" align="center" valign="center"><a href='http://www.silvercirclemovie.com/comic-page-1/?utm_source=BigHeadPress&utm_medium=Banner&utm_campaign=WebComic' ><img src="/images/ads/silverCircleComicAd_468x60v2.jpg" width="468px" height="60px" border="0px" alt="Silver Circle Movie" /></a></td>
-->
		<td colspan="5" width="468px" height="60px" align="center" valign="center"><a href='http://www.silvercirclemovie.com/comic-page-1/?utm_source=BHP&utm_medium=Holiday&utm_campaign=Comic' ><img src="/images/ads/SilverCircleJanuary468x60.jpg" width="468px" height="60px" border="0px" alt="Silver Circle Movie" /></a></td>
		</tr>
		</table>
	   </td>
	</tr>

	<!-- 2nd row -->
	   <td>
		<table width="872px" border="0px" cellpadding="0" cellspacing="0">
		<tr>
		<td colspan="2"><img src="/images/eft/EFTpage_EFTlogo2.gif" width="523px" height="64px" alt="" /></td>

		<td><a href="http://forum.bigheadpress.com/index.php?board=13"><img src="images/eft/EFTpage_Forum.gif" width="82px" height="64px" alt="" border="0px"/></a></td>
<!--
		<td colspan="1" align="left"><a href="mailto:contact@bigheadpress.com"><img src="images/eft/EFTpage_e-mail.gif" width="87px" height="64px" alt="" border="0px"/></a></td>
-->
		<td colspan="1" align="left"><a href="/eft?page=0"><img src="images/eft/EFTpage2_about_link.gif" width="87px" height="64px" alt="" border="0px"/></a></td>

		<td colspan="1"><a href="/store"><img src="images/eft/EFTpage_shop.gif" width="67px" height="64px" alt="" border="0px"/></a></td>
		<td><a href="/contactus"><img src="images/eft/EFTpage2_Contact_link.gif" width="113px" height="64px" alt="" border="0px"/></a></td>
		</tr>
		</table>
	   </td>
	</tr>

	<!-- 3rd row - Navigation button row -->
	<tr>

         <td colspan="15">
           <table border="0px" cellspacing="0" cellpadding="0">
	    <tr>
		<td colspan="2" align="center" valign="bottom">
		  <a href="/eft?page=1" border="0px"><img src="/images/eft/EFTpage_first_strip2.gif" border="0px" width="110px" height="26px" /></a></td>

		<td align="center" valign="bottom">
		    <a href="/eft?page=844" border="0px">
		     <img src="/images/eft/EFTpage_prev_arc.gif" border="0px" width="94px" height="26px" /></a></td>

		<td align="center" valign="bottom">
		    <a href="/eft?page=902" border="0px">
		     <img src="/images/eft/EFTpage_prev_strip.gif" border="0px" width="105px" height="26px" /></a></td>

		<td align="center" valign="bottom">
		     <img src="/images/eft/EFTpage_next_stripDim.gif" border="0px" width="101px" height="26px" /></td>

		<td align="center" valign="bottom">
		     <img src="/images/eft/EFTpage_next_arcDim.gif" border="0px" width="94px" height="26px" /></td>

		<td colspan="2" align="center" valign="bottom">
		  <a href="/eft?page=869" border="0px"><img src="/images/eft/EFTpage_current_arc.gif" border="0px" widht="117px" height="26px" /></a></td>

		<td colspan="3">
		   <a href="/eft?page=899" border="0px"><img src="/images/eft/EFTpage_this_week.gif" width="107px" height="26px" alt="" border="0px"></a></td>

		<td colspan="4">
<a href="http://www.bigheadpress.com/eftrss">
<img src="/images/eft/EFTpage_RSS.gif" width="246px" height="26px" alt="RSS" / border="0px" /></a></td>
	      </tr>

            </table>
          </td>
	</tr>
</table>
</div>

<div style="width: 100%;  background-image: url(/images/eft/bgcolor.gif); background-repeat: repeat; padding-bottom: 0px; float: left; border: 2px; border-color: #ff0000;">

<table border="0px" cellspacing="0px" cellpadding="0px">
<tr> 

 <td valign="top" align="center">
<div style="width: 724px; background: /images/eft/bgcolor.gif; background-repeat: repeat-x; float: left; display: inline;">
  <div style="padding-top: 2px; padding-bottom: 2px;">

    <a href="http://www.bigheadpress.com/eftbook2">
     <img src="/images/main2010/ScreamerEFTv2OnSale.gif" width="724" height="36" border="0px" alt="Escape From Terra Vol 2 - On Sale Now!"></a> 
  </div>
  <img border="0px" alt="Strip 903 of Escape From Terra" src="/disppage2?story=eft&file=/simages/eft/EFT03-275.jpg">

<br>

</div>

<div style="text-align: center; font-size: 8pt; font-weight: normal; font-style: italic; color: #dfdf4f; float: block;" >
     <br>Strip 903     -- First Seen: 2012-03-02<br>
     Escape From Terra is updated with new pages every Monday through Friday.<br>

</div>

<div style="margin: 40px,0px,0px,40px; width: 100%;  background-image: url(/images/<?php echo ; ?>/bgcolor.gif); background-repeat: repeat; padding-bottom: 0px; float: left; border: 1px; border-color: #ffffff; text-align: left;">
<br><br>
<h2>
&nbsp; Most Recent Posts Regarding "Escape From Terra"
</h2>
<ol>
<li><a class="texteft" href="http://forum.bigheadpress.com/index.php?topic=686">What goes up... (Stealing the ISS)</a></li>
<li><a class="texteft" href="http://forum.bigheadpress.com/index.php?topic=680">Leon's Rocket</a></li>
<li><a class="texteft" href="http://forum.bigheadpress.com/index.php?topic=682">E.f.T. Sited in Public</a></li>
<li><a class="texteft" href="http://forum.bigheadpress.com/index.php?topic=678">Raid on Area 51! (1/13/12)</a></li>

<li><a class="texteft" href="http://forum.bigheadpress.com/index.php?topic=660">Typo</a></li>
<li><a class="texteft" href="http://forum.bigheadpress.com/index.php?topic=684">One thing missing</a></li>
<li><a class="texteft" href="http://forum.bigheadpress.com/index.php?topic=668">The New (12/12/2011) Arc on EFT</a></li>
<li><a class="texteft" href="http://forum.bigheadpress.com/index.php?topic=673">Brain bomb (1/9/12)</a></li>
<li><a class="texteft" href="http://forum.bigheadpress.com/index.php?topic=679">869: Art Style</a></li>
<li><a class="texteft" href="http://forum.bigheadpress.com/index.php?topic=677">anyone making book on whether or not Tobi has his trans-matter beam working?</a></li>
</ol><br></div>

<div style="margin: 10px 0px 0px 15px; padding-top: 40px; text-align: left;">
<h2><br>
The Transcript For This Page

</h2>

Panel 1
 <br>Terry picks up her glass from off the table and hands Leon his. Leon looks hurt, if we can see his face.
 <br>	Terry: Okay, lecture over. So what's the latest lurch of our zig-zag plan, Stan?
 <br>	Leon: Ouch!
 <br>Panel 2
 <br>Above-knee shot of Leon, still hurt.
 <br>	Leon: Come on, Terry. It's not as though we've changed our plan willy-nilly. 
 <br>	Leon: The government keeps changing the rules and moving the goal post. We have had 		to change with the changing times.
 <br>Panel 3
 <br>Small panel. View of Terry, rolling her eyes.
 <br>	Leon (OP): So anyway, I am almost totally sure...
 <br>Panel 4
 <br>Leon, a bit nervous at Terry's reaction from the previous panel.
 <br>	Leon: Uh... assuming nothing else comes up...
 <br>	Leon: ...a week from Friday a triple Highball Express launch will take you, me, the last 		of the crew and a boat-load of supplies and and equipment up to the ISS. 
 <br>Panel 5
 <br>Closer shot of Leon. He raises his glass to his lips to take a drink.
 <br>	Leon: Two of the three-member skeleton crew are in on the plan. The third, Svetlana 		Ruzayevka, will be given the chance to join us. 
 <br>	Leon: If she doesn't want to go, she will be sent home in the Soyuz, that's currently on 		station.
 <br>Panel 6
 <br>Looking down on Terry, who has sat down on the bed, taken Leon's hand, and is looking up at Leon.
 <br>	Terry: What's to keep her from reporting us if she doesn't want to go?
 <br>Panel 7
 <br>Leon smiles down at Terry in appreciation. Imply that they're still holding hands.
 <br>	Leon: Good question. If she doesn't want to go to Mars, we isolate her from all the comm 		gear and put her under 'house arrest' in the Russian habitat. 
 <br>Panel 8
 <br>Small panel. We see them still holding hands, but we focus on Terry's face. She looks dubious.
 <br>	Leon (OP): We've thought of everything.
 <br>   <br></div>

 </td>

 <td valign="top" background="/images/eft/bgcolor.gif">
<div style="width: 248px; margin: 0 0 0 0; padding: 0px; border: 0px; display: inline; float: left; background-image: url(/images/eft/bgcolor.gif); background-repeat: repeat;" >

 <table border="0px" width="248px" cellpadding="0px" cellspacing="0px">
 <tr>
  <!-- Left column -->
  <td valign="top" align="center" width="117px">
<!--
<script type="text/javascript">
var addthis_config =
{
   data_track_addressbar: true
}
</script>
-->

  <br style="font-size: 6px";>
<div style="margin: 0px; width: 117px; height: 21px; padding: 0px; background-image: url('/images/eft/bgcolor.gif');">
<form action="https://www.paypal.com/cgi-bin/webscr" method="post">
<input type="hidden" name="cmd" value="_s-xclick">
<input type="hidden" name="hosted_button_id" value="9WGNP9RNEQR44">
<input type="image" src="https://www.paypal.com/en_US/i/btn/btn_donate_SM.gif" border="0" name="submit" alt="PayPal - The safer, easier way to pay online!">
<img alt="" border="0" src="https://www.paypal.com/en_US/i/scr/pixel.gif" width="1" height="1">
</form>
</div>
  <br style="font-size: 6px";>
<!-- AddThis Button BEGIN -->
<a class="addthis_button" href="http://www.addthis.com/bookmark.php?v=250&amp;pubid=ra-4e6d1216052a0b43"><img src="http://s7.addthis.com/static/btn/sm-share-en.gif" width="83" height="16" alt="Bookmark and Share" style="border:0"/></a>
<script type="text/javascript">
   var addthis_config = {
      data_ga_property: 'UA-1657572-1',
      data_ga_social: true
   };
</script>
<script type="text/javascript" src="http://s7.addthis.com/js/250/addthis_widget.js#pubid=ra-4e6cdc6a3f6eb0ff"></script>

<!-- AddThis Button END -->
  <br style="font-size: 6px";>
  <br style="font-size: 6px";>
  <a href="http://www.bigheadpress.com/TheTimeSink"><img border="0px" src="/images/eft/EFTpage_Scotts_blog.gif"></a>
  <a href="http://www.leeoaks.blogspot.com/"><img border="0px" src="/images/eft/EFTpage_LEEs_blog.gif"></a>
  <a href="http://www.spaceentrepreneurs.org/"><img border="0px" src="/images/eft/EFTpage_IASE.gif"></a>
 
  <br>
  <br style="font-size: 6px";>
  <a href="http://topwebcomics.com/vote/8390/default.aspx" title="Vote for Escape From Terra on TopWebComics!"><img src="http://topwebcomics.com/rankimages/rankimage.aspx?ImageTemplate=dynamiclink3&SiteID=8390"></a>

  <br><br style="font-size: 6px";>
<!--
  <a href="http://www.buzzcomix.net/in.php?cid=12892"><img src="http://www.buzzcomix.net/vote/vote-big-rank.php?cid=12892" alt="Vote for Escape From Terra!" border="0"/></a>

  <br><br style="font-size: 6px";>
-->
<!-- Beginning of Project Wonderful ad code: -->
<!-- Ad box ID: 25748 -->
<script type="text/javascript">
<!--
var pw_d=document;
pw_d.projectwonderful_adbox_id = "25748";
pw_d.projectwonderful_adbox_type = "2";
//-->
</script>
<script type="text/javascript" src="http://www.projectwonderful.com/ad_display.js"></script>
<noscript><map name="admap25748" id="admap25748"><area href="http://www.projectwonderful.com/out_nojs.php?r=0&c=0&id=25748&type=2" shape="rect" coords="0,0,117,30" title="" alt="" target="_blank" rel="nofollow" /><area href="http://www.projectwonderful.com/out_nojs.php?r=1&c=0&id=25748&type=2" shape="rect" coords="0,30,117,60" title="" alt="" target="_blank" rel="nofollow" /><area href="http://www.projectwonderful.com/out_nojs.php?r=2&c=0&id=25748&type=2" shape="rect" coords="0,60,117,90" title="" alt="" target="_blank" rel="nofollow" /><area href="http://www.projectwonderful.com/out_nojs.php?r=3&c=0&id=25748&type=2" shape="rect" coords="0,90,117,120" title="" alt="" target="_blank" rel="nofollow" /></map>
<table cellpadding="0" border="0" cellspacing="0" width="117" bgcolor="#ffffff"><tr><td><img src="http://www.projectwonderful.com/nojs.php?id=25748&type=2" width="117" height="120" usemap="#admap25748" border="0" alt="" /></td></tr></table>
</noscript>
<!-- End of Project Wonderful ad code. -->
  <br style="font-size: 6px";>

 </td>

  <!-- Right Column -->
  <td valign="top" style="margin: 0px 0px 0px 0px; width: 125px; float: left; text-align: center; ">

  <!-- house box ad 124x150 -->
  <a href="/pk"><img border="0px" width="124px" src="/images/ads/PK124x150.gif"></a>

  <br><br style="font-size: 6px";>

  <!-- House ad tag line 124x180 -->
  <a href="http://www.quantumvibe.com"><img border="0px" width="124px" src="/images/ads/QV124x180.jpg"></a>

  <br><br style="font-size: 6px";>
<a href="http://www.thewebcomiclist.com/"><img src="http://www.thewebcomiclist.com/myranking.php?id=14166" alt="Vote For Escape From Terra on The Webcomic List" border="0" style="border:1px solid #000000"></a>
<!--
  <a href="http://www.onlinecomics.net"><img src="http://www.onlinecomics.net/images/banners/OC_88x31.gif" width="88px" height="31px" border="0"></a>
-->

  </td>

 </tr>

</table>
<table border="0px" width="248px" cellpadding="0px" cellspacing="0px">
<!--
 <tr>
  <td style="margin: 2px 0 2px 8px; width: 248px; float: left; text-align: left; ">
  <a href="http://www.freedomsphoenix.com/Promotion-Page.htm?ProNo=11"><img src="/images/ads/2010_Freedom_Summit_Banner_160_x_40_Flat.jpg" width="240px" height="60px" border="0px"></a>
 </td>

 </tr>
-->
 <!-- Skyscraper 160 x 600 -->

 <tr>
  <td style="margin: 2px 0 2px 87px; width: 248px; float: left; text-align: right; ">
<!-- Beginning of Project Wonderful ad code: -->
<!-- Ad box ID: 25753 -->
<script type="text/javascript">
<!--
var pw_d=document;
pw_d.projectwonderful_adbox_id = "25753";
pw_d.projectwonderful_adbox_type = "3";
//-->
</script>
<script type="text/javascript" src="http://www.projectwonderful.com/ad_display.js"></script>
<noscript><map name="admap25753" id="admap25753"><area href="http://www.projectwonderful.com/out_nojs.php?r=0&c=0&id=25753&type=3" shape="rect" coords="0,0,160,600" title="" alt="" target="_blank" rel="nofollow" /></map>
<table cellpadding="0" border="0" cellspacing="0" width="160" bgcolor="#ffffff"><tr><td><img src="http://www.projectwonderful.com/nojs.php?id=25753&type=3" width="160" height="600" usemap="#admap25753" border="0" alt="" /></td></tr></table>
</noscript>
<!-- End of Project Wonderful ad code. -->
 </td>

 </tr>

 </table>

</div>
 </td>

</tr>

</table>


</div>  <!-- end of body -->

<div id="copyright">Story Contents &copy 2008 - 2012 Sandy Sandfort, Scott Bieser, Leila Del Duca and Lee Oaks!<br>Framing Graphics &copy 2008 - 2012 Big Head Press</div>

</div> <!-- page wrapper -->

<script src="http://www.google-analytics.com/urchin.js" type="text/javascript">
</script>
<script type="text/javascript">
_uacct = "UA-1657572-1";
urchinTracker();
</script>

<script type="text/javascript">
var gaJsHost = (("https:" == document.location.protocol) ? "https://ssl." : "http://www.");
document.write(unescape("%3Cscript src='" + gaJsHost + "google-analytics.com/ga.js' type='text/javascript'%3E%3C/script%3E"));
</script>
<script type="text/javascript">
try {
var pageTracker = _gat._getTracker("UA-1657572-2");
pageTracker._trackPageview();
} catch(err) {}</script>

</body>
</html>


Here's the resulting page (input):
Spoiler:
Code:
<?xml version='1.0' encoding='utf-8'?>
<html xmlns="http://www.w3.org/1999/xhtml">
  <head>
    <title>Big Head Press - Thoughtful Stories, Graphic Novels Online And In Print - Escape From Terra - by Sandy Sandfort, Scott Bieser, Leila Del Duca and Lee Oaks!</title>
    <meta content="http://www.w3.org/1999/xhtml; charset=utf-8" http-equiv="Content-Type"/>
  <link href="../../stylesheet.css" type="text/css" rel="stylesheet"/><style type="text/css">
		@page { margin-bottom: 5.000000pt; margin-top: 5.000000pt; }</style></head>
  <body class="calibre"><div class="calibrenavbar">| <a href="../article_1/index.html" class="calibre6">Next</a> | <a href="../index.html#article_0" class="calibre6">Section Menu</a> | <a href="../../index.html#feed_0" class="calibre6">Main Menu</a> | <hr class="calibre7"/>
</div><div class="calibre5"><tr class="calibre8"><td valign="top" class="calibre9">
<div class="calibre5">
<div class="calibre5">
<a href="http://www.bigheadpress.com/eftbook2" class="calibre6">
<img src="images/img1.jpg" border="0px" alt="Escape From Terra Vol 2 - On Sale Now!" class="calibre2"/></a>
</div>
<img border="0px" alt="Strip 903 of Escape From Terra" src="images/img2.jpg" class="calibre2"/></div>
<p class="calibre10">
<br class="calibre5"/>Strip 903     -- First Seen: 2012-03-02<br class="calibre5"/>
     Escape From Terra is updated with new pages every Monday through Friday.<br class="calibre5"/></p>
<p class="calibre10">
</p><h2 class="calibre11"><br class="calibre12"/>
The Transcript For This Page
</h2>

Panel 1
 <br class="calibre5"/>Terry picks up her glass from off the table and hands Leon his. Leon looks hurt, if we can see his face.
 <br class="calibre5"/>	Terry: Okay, lecture over. So what's the latest lurch of our zig-zag plan, Stan?
 <br class="calibre5"/>	Leon: Ouch!
 <br class="calibre5"/>Panel 2
 <br class="calibre5"/>Above-knee shot of Leon, still hurt.
 <br class="calibre5"/>	Leon: Come on, Terry. It's not as though we've changed our plan willy-nilly. 
 <br class="calibre5"/>	Leon: The government keeps changing the rules and moving the goal post. We have had 		to change with the changing times.
 <br class="calibre5"/>Panel 3
 <br class="calibre5"/>Small panel. View of Terry, rolling her eyes.
 <br class="calibre5"/>	Leon (OP): So anyway, I am almost totally sure...
 <br class="calibre5"/>Panel 4
 <br class="calibre5"/>Leon, a bit nervous at Terry's reaction from the previous panel.
 <br class="calibre5"/>	Leon: Uh... assuming nothing else comes up...
 <br class="calibre5"/>	Leon: ...a week from Friday a triple Highball Express launch will take you, me, the last 		of the crew and a boat-load of supplies and and equipment up to the ISS. 
 <br class="calibre5"/>Panel 5
 <br class="calibre5"/>Closer shot of Leon. He raises his glass to his lips to take a drink.
 <br class="calibre5"/>	Leon: Two of the three-member skeleton crew are in on the plan. The third, Svetlana 		Ruzayevka, will be given the chance to join us. 
 <br class="calibre5"/>	Leon: If she doesn't want to go, she will be sent home in the Soyuz, that's currently on 		station.
 <br class="calibre5"/>Panel 6
 <br class="calibre5"/>Looking down on Terry, who has sat down on the bed, taken Leon's hand, and is looking up at Leon.
 <br class="calibre5"/>	Terry: What's to keep her from reporting us if she doesn't want to go?
 <br class="calibre5"/>Panel 7
 <br class="calibre5"/>Leon smiles down at Terry in appreciation. Imply that they're still holding hands.
 <br class="calibre5"/>	Leon: Good question. If she doesn't want to go to Mars, we isolate her from all the comm 		gear and put her under 'house arrest' in the Russian habitat. 
 <br class="calibre5"/>Panel 8
 <br class="calibre5"/>Small panel. We see them still holding hands, but we focus on Terry's face. She looks dubious.
 <br class="calibre5"/>	Leon (OP): We've thought of everything.
 <br class="calibre5"/><br class="calibre5"/></td>
<td valign="top" class="calibre13">
<div class="calibre5">
<table border="0px" cellpadding="0px" cellspacing="0px" class="calibre14"><tr class="calibre15"><td class="calibre13">
</td>
</tr></table></div>
</td>
</tr></div><div class="calibrenavbar">
<hr class="calibre7"/>
<p class="calibre16">This article was downloaded by <strong class="calibre17">calibre</strong> from <a href="http://www.bigheadpress.com/eft?page=903" class="calibre6">http://www.bigheadpress.com/eft?page=903</a></p>
<br class="calibre5"/><br class="calibre5"/> | <a href="../index.html#article_0" class="calibre6">Section Menu</a> | <a href="../../index.html#feed_0" class="calibre6">Main Menu</a> | </div></body></html>


The source of the page that doesn't work:
Spoiler:
Code:
<!-- #### maxavailable = 2012-03-02, maxpage = 311 -->
<!-- #### maxavailable = 2012-03-02, maxpage = 311 -->
<!-- #### npage = 311 -->


<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN">
<html lang="en">
 <head>
  <title>Quantum Vibe by Scott Bieser, published by Big Head Press</title>
    <meta http-equiv="content-type" content="text/html; charset=iso-8859-1">
    <meta name="keywords" content="Quantum Vibe, Big Head Press, Scott Bieser, comics, graphic novels, liberty, sci-fi, science fiction, thoughtful stories">

    <LINK REL="SHORTCUT ICON" HREF="/favicon.ico" type="image/x-icon">
    <meta name="ROBOTS" content="INDEX, FOLLOW">
    <link rel="stylesheet" href="/qv.css" type="text/css" media="screen">
    <link rel="alternate" type="application/rss+xml" title="Big Head Press Daily Updates" href="http://www.bigheadpress.com/rssupdates" />

 </head>


<! -- ShareThis script -->
<script type="text/javascript" src="http://w.sharethis.com/button/buttons.js"></script><script type="text/javascript">stLight.options({publisher:'f3b6ed38-ef74-4bdf-b02d-87379e683263'});</script>



<body style="background: #ffffff; margin: 0px 0px 0px 0px;">
<script type="text/javascript">

  var _gaq = _gaq || [];
  _gaq.push(['_setAccount', 'UA-1657572-4']);
  _gaq.push(['_trackPageview']);

  (function() {
    var ga = document.createElement('script'); ga.type = 'text/javascript'; ga.async = true;
    ga.src = ('https:' == document.location.protocol ? 'https://ssl' : 'http://www') + '.google-analytics.com/ga.js';
    var s = document.getElementsByTagName('script')[0]; s.parentNode.insertBefore(ga, s);
  })();

</script>

<!-- start of page wrapper -->
<div style="width: 977px; background-image: url('/images/qv/QV-Spacer1.gif'); background-repeat: repeat; color: #000000; display: block; font-weight: normal; margin: 0 auto; padding: 0; ">


<!-- Left Column -->
<div style="width: 814px;  float: left; display: block;  background-image: url('/images/qv/QV-Content.gif'); background-repeat: repeat; margin: 0 auto; padding: 0px; ">

 <!-- Top Bar -->
 <div style="width: 814px; float: left; display: block; background-color: #436588; margin: 0 auto; padding: 0px; ">

 <div style="width: 86px; height: 220px; float: left; display: block; margin: 0px 0 0px 0; padding: 0px; border: 0px;">
   
  <a href="http://www.bigheadpress.com">
   <div><img src="/images/qv/QV-BHPlogo.gif" border="0px" width="86px" height="100px"/></div> </a>

   <div><img src="/images/qv/QV-Thoughtful.gif" border="0px" width="86px" height="22px"/></div>
 
   <a href="http://forum.bigheadpress.com">
    <div><img src="/images/qv/QV-Forum.gif" border="0px" width="86px" height="25px"/></div> </a>

   <a href="http://www.bigheadpress.com/store">
    <div><img src="/images/qv/QV-Shop.gif" border="0px" width="86px" height="25px"/></div> </a>

   <a href="http://www.bigheadpress.com/comics">
    <div><img src="/images/qv/QV-About.gif" border="0px" width="86px" height="25px"/></div> </a>

   <a href="http://www.bigheadpress.com/contactus">
    <div><img src="/images/qv/QV-Contact.gif" border="0px" width="86px" height="25px"/></div> </a>

 </div>

 <div style="width: 728px; height: 102px; background-image: url('/images/qv/QV-LeaderBoard.gif'); float: left; display: inline; margin: 0 auto; padding: 0px;">
  <!-- Beginning of Project Wonderful ad code: -->
<!-- Ad box ID: 53636 -->
<script type="text/javascript">
<!--
var pw_d=document;
pw_d.projectwonderful_adbox_id = "53636";
pw_d.projectwonderful_adbox_type = "5";
//-->
</script>
<script type="text/javascript" src="http://www.projectwonderful.com/ad_display.js"></script>
<noscript><map name="admap53636" id="admap53636"><area href="http://www.projectwonderful.com/out_nojs.php?r=0&c=0&id=53636&type=5" shape="rect" coords="0,0,728,90" title="" alt="" target="_blank" /></map>
<table cellpadding="0" border="0" cellspacing="0" width="728" bgcolor="#ffffff"><tr><td bgcolor="#ffffff" colspan="1"><center><a style="font-size:10px;color:#0000ff;text-decoration:none;line-height:1.2;font-weight:bold;font-family:Tahoma, verdana,arial,helvetica,sans-serif;text-transform: none;letter-spacing:normal;text-shadow:none;white-space:normal;word-spacing:normal;" href="http://www.projectwonderful.com/advertisehere.php?id=53636&type=5" target="_blank">Ads by Project Wonderful!  Your ad could be here, right now.</a></center></td></tr><tr><td><img src="http://www.projectwonderful.com/nojs.php?id=53636&type=5" width="728" height="90" usemap="#admap53636" border="0" alt="" /></td></tr></table>
</noscript>
<!-- End of Project Wonderful ad code. -->
 </div>

<!--
 <div style="float: left; display: inline; margin: 0 auto; padding: 0px; width: 728px; height: 10px; border: 0px;">
  <div><img src="/images/qv/QV-Spacer1.gif" border="0px" width="728px" height="10px" /></div>
 </div>
-->

 <div style="width: 330px; float: left; display: inline; margin: 0 auto; padding: 0px;">
  <div><img src="/images/qv/QV-Logo2.jpg" border="0px"/></div>
 </div>

 <div style="width: 398px; float: left; display: inline; margin: 0 auto; padding: 0px;">
  <div style="float: left; display: block; margin: 0 auto; padding: 0px; width: 398px; height: 80px; border: 0px;">
  <div><img src="/images/qv/QV-Blurb.gif" border="0px" width="398px" height="80px"/></div>
  </div>

  <div style="float: left; display: block; margin: 0 auto; padding: 0px;">
    <div style="float: left; display: block; margin: 0 auto; padding: 0px;">
      <!-- First Page -->
     <a href="/strip?page=1" border="0px">
     <div><img src="/images/qv/QV-FirstStrip.gif" border="0px" width="52px" height="40px"/></div> </a>
    </div>
    <div style="float: left; display: inline; margin: 0 auto; padding: 0px;">
      <!-- Prev Arc -->

     <a href="/strip?page=121" border="0px">
      <div><img src="/images/qv/QV-PrevArc.gif" border="0px" width="55px" height="40px"/></div> </a>
    </div>
    <div style="float: left; display: inline; margin: 0 auto; padding: 0px;">
      <!-- Prev Page -->
     <a href="/strip?page=310" border="0px">
      <div><img src="/images/qv/QV-PrevStrip.gif" border="0px" width="55px" height="40px"/></div> </a>
    </div><div style="float: left; display: inline; margin: 0 auto; padding: 0px;">

      <!-- Next Page -->
     <div><img src="/images/qv/QV-NextStrip.gif" border="0px" width="54px" height="40px"/></div>
    </div><div style="float: left; display: inline; margin: 0 auto; padding: 0px;">
      <!-- Next Arc -->
     <div><img src="/images/qv/QV-NextArc.gif" border="0px" width="55px" height="40px"/></div>
    </div><div style="float: left; display: inline; margin: 0 auto; padding: 0px;">
      <!-- Next Current Arc -->
     <a href="/strip?page=201" border="0px">
      <div><img src="/images/qv/QV-CurrentArc.gif" border="0px" width="72px" height="40px"/></div> </a>

    </div><div style="float: left; display: inline; margin: 0 auto; padding: 0px;">
      <!-- This Week -->
     <a href="/strip?page=307" border="0px">
      <div><img src="/images/qv/QV-ThisWeek.gif" border="0px" width="55px" height="40px"/></div> </a>
    </div>
  </div>
 </div>
 </div>


<!-- Page Display Area -->
 <div style="width: 814px; float: left; display: block; padding-top: 5px;  background-color: #436588;">
  <div style="width: 808; padding-left: 6px; float: left; display: block;  background-color: #436588;">
<!--
   <div style="padding-left: 0px; padding-bottom: 5px;">
   <a href="http://www.libertopia.org">
     <img src="http://www.bigheadpress.com/images/ads/ScreamerQVLibertopia.gif" width="802" height="36" border="0px" alt=""> </a>
   </div>
-->

    <img border="0px" alt="Strip 311 of Quantum Vibe" src="/disppage2?story=qv&file=/simages/qv/QV01-305.jpg"/>
 
  <br>
  </div>


  <div style="width: 814px; text-align: center; font-size: 8pt; font-weight: normal; font-style: italic; color: #dff8f8; background-color: #436588; float: left; display: block;" >
     Strip 311      -- First Seen: 2012-03-02<br>
     Quantum Vibe is updated with new pages every Monday through Friday.<br><br>
  </div>

  <!-- PW ads -->
  <div style="float: left; display: block; width: 750px; margin: 0 0 20px 60px;">
     <!-- Beginning of Project Wonderful ad code: -->

<!-- Ad box ID: 53544 -->
<script type="text/javascript">
<!--
var pw_d=document;
pw_d.projectwonderful_adbox_id = "53544";
pw_d.projectwonderful_adbox_type = "2";
//-->
</script>
<script type="text/javascript" src="http://www.projectwonderful.com/ad_display.js"></script>
<noscript><map name="admap53544" id="admap53544"><area href="http://www.projectwonderful.com/out_nojs.php?r=0&c=0&id=53544&type=2" shape="rect" coords="0,0,117,30" title="" alt="" target="_blank" /><area href="http://www.projectwonderful.com/out_nojs.php?r=0&c=1&id=53544&type=2" shape="rect" coords="117,0,234,30" title="" alt="" target="_blank" /><area href="http://www.projectwonderful.com/out_nojs.php?r=0&c=2&id=53544&type=2" shape="rect" coords="234,0,351,30" title="" alt="" target="_blank" /><area href="http://www.projectwonderful.com/out_nojs.php?r=0&c=3&id=53544&type=2" shape="rect" coords="351,0,468,30" title="" alt="" target="_blank" /><area href="http://www.projectwonderful.com/out_nojs.php?r=0&c=4&id=53544&type=2" shape="rect" coords="468,0,585,30" title="" alt="" target="_blank" /><area href="http://www.projectwonderful.com/out_nojs.php?r=0&c=5&id=53544&type=2" shape="rect" coords="585,0,702,30" title="" alt="" target="_blank" /></map>
<table cellpadding="0" border="0" cellspacing="0" width="702" bgcolor="#ffffff"><tr><td><img src="http://www.projectwonderful.com/nojs.php?id=53544&type=2" width="702" height="30" usemap="#admap53544" border="0" alt="" /></td></tr><tr><td bgcolor="#ffffff" colspan="6"><center><a style="font-size:10px;color:#0000ff;text-decoration:none;line-height:1.2;font-weight:bold;font-family:Tahoma, verdana,arial,helvetica,sans-serif;text-transform: none;letter-spacing:normal;text-shadow:none;white-space:normal;word-spacing:normal;" href="http://www.projectwonderful.com/advertisehere.php?id=53544&type=2" target="_blank">Your ad could be here, right now.</a></center></td></tr></table>
</noscript>
<!-- End of Project Wonderful ad code. -->
  </div>

  <!-- Display Forum -->
  <div style="display: block;">
  <br>

<h2>
&nbsp; Most Recent Posts Regarding "Quantum Vibe"
</h2>
<ol>
<li><a class="texteft" href="http://forum.bigheadpress.com/index.php?topic=688">Burned</a></li>
<li><a class="texteft" href="http://forum.bigheadpress.com/index.php?topic=687">Inserts</a></li>
<li><a class="texteft" href="http://forum.bigheadpress.com/index.php?topic=588">A remarkable resemblance</a></li>
<li><a class="texteft" href="http://forum.bigheadpress.com/index.php?topic=670">For New Fans</a></li>
<li><a class="texteft" href="http://forum.bigheadpress.com/index.php?topic=669">Spider Jerusalem tribute</a></li>
<li><a class="texteft" href="http://forum.bigheadpress.com/index.php?topic=681">Plus Tax!</a></li>

<li><a class="texteft" href="http://forum.bigheadpress.com/index.php?topic=519">Shovels in L5?</a></li>
<li><a class="texteft" href="http://forum.bigheadpress.com/index.php?topic=685">S.A.</a></li>
<li><a class="texteft" href="http://forum.bigheadpress.com/index.php?topic=630">So it's Greg Egan's New Territories</a></li>
<li><a class="texteft" href="http://forum.bigheadpress.com/index.php?topic=674">Loonies in the bar</a></li>
</ol><br>
  </div>

  <!-- Display Script -->
  <div style="margin: 0px 5px 10px 15px; color: #ffffff; display: block;">
  

<h2>
The Transcript For This Page
</h2>

Panel 1
 <br>Nicole races away from Rando's room, down a hallway.
 <br>Nicole: holyshuckincrat
 <br>Panel 2
 <br>Nicole is standing anxiously by an elevator (we see a sign indicating 55th Floor) when she sees Rando stumbling towards her.
 <br>Rando (weakly): BITCH!
 <br>Nicole: gourdammit
 <br>Panel 3
 <br>Nicole races out onto a balcony that was near the elevator.
 <br>Rando: GOT U NAO, U FUKIN HOR!
 <br>Rando: DAR IZ NO WAY OFF DIS BALCONY!
 <br>Panel 4
 <br>Nicole launches herself  over the balcony's edge into the open air above the street. Rando looks on helplessly.
 <br>Rando: WAIT!
 <br>   <br>  </div>

 </div>

</div> <!-- End of Left Column -->

<!-- Right Column -->
<div style="width: 162px; background-image: inherit; float: left; display: inline; border: 0px;">

 <div style="width: 162px; float: left; display: block;">
  <a href="http://questioncopyright.org/creator_endorsed">
  <div><img src="/images/qv/QV-CElogo.gif" border="0px" width="162px" height="102px"/></div> </a>

 </div>

 <div style="width: 162px; float: left; display: inline; border: 0px;">
  <div><img src="/images/qv/QV-SharingText.gif" border="0px" width="162px" height="19px"/></div>
 </div>

 <div style="width: 162px; height: 71px; float: left; display: block; background-image: url('/images/qv/QV-SharingBtns.gif');">
   <span class="st_twitter_large" displayText="Tweet"></span><span class="st_facebook_large" displayText="Facebook"></span><span class="st_ybuzz_large" displayText="Yahoo! Buzz"></span><span class="st_gbuzz_large" displayText="Google Buzz"></span><span class="st_email_large" displayText="Email"></span><span class="st_sharethis_large" displayText="ShareThis"></span>
   <span style="float: left; display: inline; margin: 0px 3px 0px 3px; padding: 0px;"> 
     <a href="http://www.quantumvibe.com/qvrss">

      <img src="http://w.sharethis.com/images/rss_32.png" border="0px" width="32px" height="32px"/> </a>
   </span>
 </div>

 <div style="width: 162px; float: left; display: block; padding-top: 8px; padding-left: 42px;" > 
  <form action="https://www.paypal.com/cgi-bin/webscr" method="post">
<input type="hidden" name="cmd" value="_s-xclick">
<input type="hidden" name="hosted_button_id" value="46F2KARMUZSXA">
<input type="image" src="https://www.paypal.com/en_US/i/btn/btn_donate_SM.gif" border="0" name="submit" alt="PayPal - The safer, easier way to pay online!">
<img alt="" border="0" src="https://www.paypal.com/en_US/i/scr/pixel.gif" width="1" height="1">
</form>
</div>

 </div>

 <div style="width: 162px; background-image: inherit; float: left; display: block;">
  <a href="http://www.escapefromterra.com">
  <div style="padding-left: 18px; padding-bottom: 10px;"><img src="http://www.bigheadpress.com/images/ads/EFT124x180V2.jpg" border="0px" width="124px" height="180px"/></div> </a>
 </div>

 <div style="width: 162px; background-image: inherit; height: 602px; float: left; display: block;">
  <div style="margin: 0 0 0 21px; width: 120px; height: 600px; float: left; display: block;">

  <script type="text/javascript"><!--
google_ad_client = "ca-pub-6026936804066153";
/* Quantum Vibe 120x600 */
google_ad_slot = "5000875387";
google_ad_width = 120;
google_ad_height = 600;
//-->
</script>
<script type="text/javascript"
src="//pagead2.googlesyndication.com/pagead/show_ads.js">
</script>
  </div>
 </div>

<!--
 <div style="width: 162px; float: left; display: block;">
  <a href="http://www.twitter.com/ScottBieser">
  <div><img src="/images/qv/QV-ScottsTwitter.gif" border="0px" width="162px" height="38px"/></div> </a>
 </div>
-->

 <div style="width: 162px; float: left; display: block;">
  <a href="http://www.facebook.com/scott.bieser">
  <div><img src="/images/qv/QV-ScottsFacebook.gif" border="0px" width="162px" height="39px"/></div> </a>

 </div>
 <div style="width: 162px; float: left; display: block;">
<script src="http://widgets.twimg.com/j/2/widget.js"></script>
<script>
new TWTR.Widget({
  version: 2,
  type: 'profile',
  rpp: 5,
  interval: 30000,
  width: 'auto',
  height: 300,
  theme: {
    shell: {
      background: '#2c3e5c',
      color: '#ffffff'
    },
    tweets: {
      background: '#000000',
      color: '#ffffff',
      links: '#ebeb07'
    }
  },
  features: {
    scrollbar: true,
    loop: false,
    live: true,
    behavior: 'all'
  }
}).render().setUser('ScottBieser').start();
</script>
 </div>


</div> <!-- End of Right Column -->


<!-- Display Copyright -->

<div id="copyright"><br>Story Contents &copy 2012 Scott Bieser<br>Framing Graphics &copy 2012 Big Head Press</div>

</div> <!-- page wrapper -->

</body>

</html>


And again the resulting input page:
Spoiler:
Code:
<?xml version='1.0' encoding='utf-8'?>
<html xmlns="http://www.w3.org/1999/xhtml">
  <head>
    <title>Quantum Vibe by Scott Bieser, published by Big Head Press</title>
    <meta content="http://www.w3.org/1999/xhtml; charset=utf-8" http-equiv="Content-Type"/>
  <link href="../../stylesheet.css" type="text/css" rel="stylesheet"/><style type="text/css">
		@page { margin-bottom: 5.000000pt; margin-top: 5.000000pt; }</style></head>
  <body class="calibre"><div class="calibrenavbar">| <a href="../../feed_1/index.html" class="calibre6">Next</a> | <a href="../index.html#article_1" class="calibre6">Section Menu</a> | <a href="../../index.html#feed_0" class="calibre6">Main Menu</a> | <a href="../article_0/index.html" class="calibre6">Previous</a> | <hr class="calibre7"/>
</div><h2 class="calibre11">Quantum Vibe by Scott Bieser, published by Big Head Press</h2><div class="calibre5">
<p id="copyright" class="calibre10"><br class="calibre5"/>Story Contents &amp;copy 2012 Scott Bieser<br class="calibre5"/>Framing Graphics &amp;copy 2012 Big Head Press</p>
</div><div class="calibrenavbar">
<hr class="calibre7"/>
<p class="calibre16">This article was downloaded by <strong class="calibre17">calibre</strong> from <a href="http://www.quantumvibe.com/strip?page=311" class="calibre6">http://www.quantumvibe.com/strip?page=311</a></p>
<br class="calibre5"/><br class="calibre5"/> | <a href="../index.html#article_1" class="calibre6">Section Menu</a> | <a href="../../index.html#feed_0" class="calibre6">Main Menu</a> | </div></body></html>


As you can see, much shorter, and missing most of the insides of the body. It doesn't download any of the images from the page, it just seems to ignore everything after the title.

Just in case it makes a difference, here's the recipe, it's just a basic recipe with my first few (clearly unsuccessful) attempts at righting the problem added.
Spoiler:
Code:
class BasicUserRecipe1330721625(AutomaticNewsRecipe):
    title          = u'Bighead Press'
    oldest_article = 7
    max_articles_per_feed = 100
    auto_cleanup = True

    feeds          = [(u'Bighead Press story updates', u'http://bigheadpress.com/rssupdates')]

    no_stylesheets = True

    remove_javascript = True


Thanks in advance for any assistance.
myrkul999 is offline   Reply With Quote
Old 03-12-2012, 01:56 PM   #2
myrkul999
Junior Member
myrkul999 began at the beginning.
 
Posts: 2
Karma: 10
Join Date: Mar 2012
Device: kindle
Thanks for the bugfix, Kovid. That seems to have helped. It's not getting all of them still, but I'm not entirely sure why, and it seems (without having dissected the input) pretty much random. I'll look into it more later tonight. If there's another forum where this belongs more accurately, feel free to move it.

Last edited by myrkul999; 03-12-2012 at 01:58 PM.
myrkul999 is offline   Reply With Quote
Advert
Reply


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Is it possible to import an entire website? nbtrap Library Management 1 01-10-2012 10:07 PM
Only able to add one book out of entire library read4fun Library Management 1 11-16-2011 12:33 PM
Reconverted Entire Library DoctorOhh Conversion 5 09-13-2011 01:17 PM
Translating entire pages RickyMaveety Lounge 3 02-10-2009 12:31 PM


All times are GMT -4. The time now is 12:24 PM.


MobileRead.com is a privately owned, operated and funded community.