View Single Post
Old 01-19-2010, 08:07 PM   #267
nickredding
onlinenewsreader.net
nickredding knows the difference between 'who' and 'whom'nickredding knows the difference between 'who' and 'whom'nickredding knows the difference between 'who' and 'whom'nickredding knows the difference between 'who' and 'whom'nickredding knows the difference between 'who' and 'whom'nickredding knows the difference between 'who' and 'whom'nickredding knows the difference between 'who' and 'whom'nickredding knows the difference between 'who' and 'whom'nickredding knows the difference between 'who' and 'whom'nickredding knows the difference between 'who' and 'whom'nickredding knows the difference between 'who' and 'whom'
 
Posts: 328
Karma: 10143
Join Date: Dec 2009
Location: Phoenix, AZ & Victoria, BC
Device: Kindle 3, Kindle Fire, IPad3, iPhone4, Playbook, HTC Inspire
There is no difference except for the photo. Here is the input html for the article that is not indexed properly
Code:
<div class="navbar calibre_rescale_70" style="text-align: center;">
	| <a href="../article_1/index.html">Next</a> |
	<a href="../index.html#article_0">Section menu</a> |
	<a href="../../index.html#feed_0">Main menu</a> | <hr /></div>
<div id="storyheader">
	<div class="headline">
		<h1>Symphony Splash seeks sponsor for Victoria's most popular public event</h1>
	</div>
	<div class="subheadline">
		<h2>$75,000 needed for free outdoor concert; players to take salary cut</h2>
	</div>
	<div class="byline">
		<span class="name">By Jim Gibson, Times Colonist</span><span class="timestamp">January 
		19, 2010</span></div>
</div>
<div id="storycontent" class="para18">
	<div id="imageBox">
		<div class="wrapper_0_10_0_0">
			<div class="storyimage" id="">
				<a href="javascript:void(0);" onclick="tabClick(' - Photos Tab',false,'storypage','story_photo_content',true,true);">
				<img id="storyphoto" class="thumbnail" border="0" alt="The 2009 event of Symphony Splash drew an estimated 40,000 people to the Inner Harbour on Aug. 2." src="images/img2.bin.jpg" /></a></div>
			<div class="imagetext">
				<h1 id="photocaption">The 2009 event of Symphony Splash drew an 
				estimated 40,000 people to the Inner Harbour on Aug. 2.</h1>
				<h2 id="photocredit"><b>Photograph by: </b>Adrian Lam, Times Colonist</h2>
			</div>
		</div>
	</div>
	<div id="page1">
		<p>Symphony Splash, Victoria's most popular public event, is looking for 
		a new sponsor.</p>
Here is the corresponding code in the MOBI output (transformed back into HTML by ebook-convert mobi->HTML), Notice the bookmark <a id="filepos970"></a> about half way down--after the headline, subhead and byline

Code:
      <hr class="calibre5"/>
      <p class="calibre6">
        <span class="calibre3">
          <span class="bold">Symphony Splash seeks sponsor for Victoria's most popular public event</span>
        </span>
      </p>
      <p class="calibre6">
        <span class="calibre3">
          <span class="bold">$75,000 needed for free outdoor concert; players to take salary cut</span>
        </span>
      </p>
      <p class="calibre6">By Jim Gibson, Times Colonist</p>
      <p class="calibre7">January 19, 2010</p>
      <a></a>
      <a id="filepos970"></a>
      <p class="calibre7">
        <img src="images/00006.jpg" class="calibre8"/>
      </p>
      <p class="calibre9">
        <span class="calibre3">
          <span class="bold">The 2009 event of Symphony Splash drew an estimated 40,000 people to the Inner Harbour on Aug. 2.</span>
        </span>
      </p>
      <a></a>
      <p class="calibre10">
        <span class="calibre3">
          <span class="bold">Photograph by: Adrian Lam, Times Colonist</span>
        </span>
      </p>
      <a></a>
      <p class="calibre11">Symphony Splash, Victoria's most popular public event, is looking for a new sponsor.</p>
      <p class="calibre11">The Victoria Symphony's free outdoor concert, which drew an estimated 40,000 people to the Inner Harbour last Aug. 2, needs a replacement for Bayview Residences, the title sponsor for the last three years.</p>
      <p class="calibre11">Bayview says it will continue to make "a significant contribution" to Splash, but not as title sponsor.</p>
Here is the input HTML for the next artyicle which is indexed properly
Code:
<div class="navbar calibre_rescale_70" style="text-align: center;">
	| <a href="../article_2/index.html">Next</a> |
	<a href="../index.html#article_1">Section menu</a> |
	<a href="../../index.html#feed_0">Main menu</a> |
	<a href="../article_0/index.html">Previous</a> | <hr /></div>
<div id="storyheader">
	<div class="headline">
		<h1>Handling of domestic violence overhauled</h1>
	</div>
	<div class="subheadline">
		<h2>B.C. pressured to act after gaps in services cited in murder-suicide</h2>
	</div>
	<div class="byline">
		<span class="name">By Rob Shaw and Lindsay Kines, Times Colonist</span><span class="timestamp">January 
		19, 2010</span></div>
</div>
<div id="storycontent" class="para18">
	<div id="page1">
		<p>The B.C. government unveiled changes yesterday to the way police and 
		Crown prosecutors handle domestic violence cases, but critics say it's not 
		enough to plug holes in the system.</p>
		<p>The province will help pay for a Greater Victoria regional domestic violence 
		unit, launch a B.C. Coroners Service panel to review domestic violence homicides 
		and try to better co-ordinate policies between Crown and police in the wake
and the corresponding code in the MOBI output. Notice the bookmark <div id="filepos6685" ...> at the beginning, where it should be.
Code:
    <div id="filepos6685" class="calibre1">
      <p class="calibre2">
        <span class="calibre3">
          <tt class="calibre4"> | </tt>
        </span>
        <a href="#filepos13231">
          <span class="calibre3">
            <tt class="calibre4">Next</tt>
          </span>
        </a>
        <span class="calibre3">
          <tt class="calibre4"> | </tt>
        </span>
        <a href="../index.html#article_1">
          <span class="calibre3">
            <tt class="calibre4">Section menu</tt>
          </span>
        </a>
        <span class="calibre3">
          <tt class="calibre4"> | </tt>
        </span>
        <a href="../../index.html#feed_0">
          <span class="calibre3">
            <tt class="calibre4">Main menu</tt>
          </span>
        </a>
        <span class="calibre3">
          <tt class="calibre4"> | </tt>
        </span>
        <a href="#filepos970">
          <span class="calibre3">
            <tt class="calibre4">Previous</tt>
          </span>
        </a>
        <span class="calibre3">
          <tt class="calibre4"> | </tt>
        </span>
      </p>
      <hr class="calibre5"/>
      <p class="calibre6">
        <span class="calibre3">
          <span class="bold">Handling of domestic violence overhauled</span>
        </span>
      </p>
      <p class="calibre6">
        <span class="calibre3">
          <span class="bold">B.C. pressured to act after gaps in services cited in murder-suicide</span>
        </span>
      </p>
      <p class="calibre6">By Rob Shaw and Lindsay Kines, Times Colonist</p>
      <p class="calibre7">January 19, 2010</p>
      <a></a>
      <p class="calibre11">The B.C. government unveiled changes yesterday to the way police and Crown prosecutors handle domestic violence cases, but critics say it's not enough to plug holes in the system.</p>
      <p class="calibre11">The province will help pay for a Greater Victoria regional domestic violence unit, launch a B.C. Coroners Service panel to review domestic violence homicides and try to better co-ordinate policies between Crown and police in the wake of a tragic 2007 murder-suicide in Oak Bay, said B.C. Solicitor General Kash Heed.</p>
There is no difference (except for the inclusion of the image file in the incorrectly index article) in the "processed directory either.
nickredding is offline   Reply With Quote