View Single Post
Old 08-12-2011, 04:20 AM   #1
pgmariotti
Junior Member
pgmariotti began at the beginning.
 
Posts: 4
Karma: 10
Join Date: Jul 2011
Device: Windows Phone 7
How to decompose content server metadata?

Hi Korvid.

I am downloading ebook data from the calibre library via the content server into an ebook reader I am developing.

I am using the new feature of Calibre to download the metadata.opf file, but the result I get is not easily understood in terms of fields and relative lengths.

What I get is as follows

[http://calibre-ebook.com]2010-02-16T00:00:00+00:00<div><p class="description">EDITORIAL REVIEW:</p>
<p class="description">The first interstellar starship, *John Glenn,* fled a Solar System populated by rogue AIs and machine/human hybrids, threatened by too much nanotechnology and rife with political dangers. The *John Glenn’*s crew intended to terraform the nearly pristine planet Ymir, in hopes of creating a utopian society that will limit intelligent technology. </p>
<p class="description">But by some miscalculation they have landed in another solar system, and extremely low on the antimatter needed to continue to Ymir, they must shape the nearby planet Harlequin’s moon, Selene, into a new, temporary home. Their only hope of ever reaching Ymir is to rebuild their store of antimatter through decades of terraforming the moon. </p>
<p class="description">Gabriel, the head terraformer, must lead this nearly impossible task, with all the wrong materials. His primary tools are the uneducated and nearly illiterate children of the original colonists, born and bred to build Harlequin’s moon into a virtual antimatter factory. With no concept of the future and with life defined as duty, one girl, Rachel Vanowen, begins to ask herself the question: what will become of the children of Selene once the terraforming is complete.</p>
<p class="description"> (20050916)</p></div>Tor Science Fiction9780765351296en-GBScience FictionFictionAdventureScience Fiction - GeneralFiction - Science FictionScience Fiction - High Tech


I have no problem understanding and extracting the Html for the editorial review, but I find it impossible to reconstruct the original Xml from

Tor Science Fiction9780765351296en-GBScience FictionFictionAdventureScience Fiction - GeneralFiction - Science FictionScience Fiction - High Tech

Can you please advise how I can obtain publisher, series, tags etc. from this output?

Paul G Mariotti
pgmariotti is offline   Reply With Quote