Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book Formats > ePub

Notices

Reply
 
Thread Tools Search this Thread
Old 04-07-2019, 09:34 PM   #1
lumpynose
Wizard
lumpynose ought to be getting tired of karma fortunes by now.lumpynose ought to be getting tired of karma fortunes by now.lumpynose ought to be getting tired of karma fortunes by now.lumpynose ought to be getting tired of karma fortunes by now.lumpynose ought to be getting tired of karma fortunes by now.lumpynose ought to be getting tired of karma fortunes by now.lumpynose ought to be getting tired of karma fortunes by now.lumpynose ought to be getting tired of karma fortunes by now.lumpynose ought to be getting tired of karma fortunes by now.lumpynose ought to be getting tired of karma fortunes by now.lumpynose ought to be getting tired of karma fortunes by now.
 
Posts: 1,086
Karma: 6719822
Join Date: Jul 2012
Device: Palm Pilot M105
project gutenberg us formating style/method

Whenever I look at the html of an epub I've downloaded from the US Project Gutenberg I'm always rolling my eyes. Epubs from other free book sites aren't anywhere nearly as contorted and antediluvian. Even recent books are full of weird stuff (to me).

Anyhow, I was wondering if anyone knows why they do what they do.
lumpynose is offline   Reply With Quote
Old 04-08-2019, 03:38 AM   #2
HarryT
eBook Enthusiast
HarryT ought to be getting tired of karma fortunes by now.HarryT ought to be getting tired of karma fortunes by now.HarryT ought to be getting tired of karma fortunes by now.HarryT ought to be getting tired of karma fortunes by now.HarryT ought to be getting tired of karma fortunes by now.HarryT ought to be getting tired of karma fortunes by now.HarryT ought to be getting tired of karma fortunes by now.HarryT ought to be getting tired of karma fortunes by now.HarryT ought to be getting tired of karma fortunes by now.HarryT ought to be getting tired of karma fortunes by now.HarryT ought to be getting tired of karma fortunes by now.
 
HarryT's Avatar
 
Posts: 85,544
Karma: 93383099
Join Date: Nov 2006
Location: UK
Device: Kindle Oasis 2, iPad Pro 10.5", iPhone 6
Moved to the ePub forum.
HarryT is offline   Reply With Quote
Advert
Old 04-08-2019, 06:12 AM   #3
HarryT
eBook Enthusiast
HarryT ought to be getting tired of karma fortunes by now.HarryT ought to be getting tired of karma fortunes by now.HarryT ought to be getting tired of karma fortunes by now.HarryT ought to be getting tired of karma fortunes by now.HarryT ought to be getting tired of karma fortunes by now.HarryT ought to be getting tired of karma fortunes by now.HarryT ought to be getting tired of karma fortunes by now.HarryT ought to be getting tired of karma fortunes by now.HarryT ought to be getting tired of karma fortunes by now.HarryT ought to be getting tired of karma fortunes by now.HarryT ought to be getting tired of karma fortunes by now.
 
HarryT's Avatar
 
Posts: 85,544
Karma: 93383099
Join Date: Nov 2006
Location: UK
Device: Kindle Oasis 2, iPad Pro 10.5", iPhone 6
What did you have in mind? I've recently looked at quite a few PG ePubs and the formatting seemed fairly reasonable to me. The only slightly unusual feature I noticed is the way they add the labels for physical page numbers with, for example:

Code:
<span class="pagenum pncolor"><a id="page_10"></a>10</span>
using a media query:

Code:
@media screen {
    hr.ppg-pb {
        margin: 30px 0;
        width: 100%;
        border: none;
        border-top: thin dashed silver
        }
    .pagenum {
        display: inline;
        font-size: x-small;
        text-align: right;
        position: absolute;
        right: 2%;
        padding: 1px 3px;
        font-style: normal;
        font-variant: normal;
        font-weight: normal;
        text-decoration: none;
        background-color: inherit;
        border: 1px solid #eee
        }
    .pncolor {
        color: silver
        }
    }
I quite like the appearance of this, though: an inconspicuous boxed page number in light grey ("silver") in the right margin.
HarryT is offline   Reply With Quote
Old 04-08-2019, 03:52 PM   #4
lumpynose
Wizard
lumpynose ought to be getting tired of karma fortunes by now.lumpynose ought to be getting tired of karma fortunes by now.lumpynose ought to be getting tired of karma fortunes by now.lumpynose ought to be getting tired of karma fortunes by now.lumpynose ought to be getting tired of karma fortunes by now.lumpynose ought to be getting tired of karma fortunes by now.lumpynose ought to be getting tired of karma fortunes by now.lumpynose ought to be getting tired of karma fortunes by now.lumpynose ought to be getting tired of karma fortunes by now.lumpynose ought to be getting tired of karma fortunes by now.lumpynose ought to be getting tired of karma fortunes by now.
 
Posts: 1,086
Karma: 6719822
Join Date: Jul 2012
Device: Palm Pilot M105
Yeah, I think I'm just ranting, perhaps more about the older books. Don't ereaders typically start a new page at the start of each .htm file? A lot of the Gutenberg books have the front matter (e.g., the Gutenberg copyright notice in a fixed width font) in the first .htm file with table of contents and even the first chapter. And then they use a bunch of br tags, I'm guessing to try to push things to the next page.

I've backed myself into a corner and decided I must use Sigil's mend and prettify on all of the epub books I've downloaded and then converting them (again) to awz3. Then I see the first file stuff and roll my eyes.

Last edited by lumpynose; 04-08-2019 at 03:53 PM. Reason: changed start of the .htm file to start of each .htm file
lumpynose is offline   Reply With Quote
Old 04-10-2019, 04:06 AM   #5
HarryT
eBook Enthusiast
HarryT ought to be getting tired of karma fortunes by now.HarryT ought to be getting tired of karma fortunes by now.HarryT ought to be getting tired of karma fortunes by now.HarryT ought to be getting tired of karma fortunes by now.HarryT ought to be getting tired of karma fortunes by now.HarryT ought to be getting tired of karma fortunes by now.HarryT ought to be getting tired of karma fortunes by now.HarryT ought to be getting tired of karma fortunes by now.HarryT ought to be getting tired of karma fortunes by now.HarryT ought to be getting tired of karma fortunes by now.HarryT ought to be getting tired of karma fortunes by now.
 
HarryT's Avatar
 
Posts: 85,544
Karma: 93383099
Join Date: Nov 2006
Location: UK
Device: Kindle Oasis 2, iPad Pro 10.5", iPhone 6
They use a “page-break-before: always” on the header styles in the CSS to get a page break before the various elements they put in the initial (and final) source file, which is a perfectly reasonable way to do it.

When I edit a PG ePub for my own purposes, I always split them up so each element of the introductory material gets its own source file. This is very easy to do in the Calibre editor.
HarryT is offline   Reply With Quote
Advert
Old 04-10-2019, 02:23 PM   #6
lumpynose
Wizard
lumpynose ought to be getting tired of karma fortunes by now.lumpynose ought to be getting tired of karma fortunes by now.lumpynose ought to be getting tired of karma fortunes by now.lumpynose ought to be getting tired of karma fortunes by now.lumpynose ought to be getting tired of karma fortunes by now.lumpynose ought to be getting tired of karma fortunes by now.lumpynose ought to be getting tired of karma fortunes by now.lumpynose ought to be getting tired of karma fortunes by now.lumpynose ought to be getting tired of karma fortunes by now.lumpynose ought to be getting tired of karma fortunes by now.lumpynose ought to be getting tired of karma fortunes by now.
 
Posts: 1,086
Karma: 6719822
Join Date: Jul 2012
Device: Palm Pilot M105
Quote:
Originally Posted by HarryT View Post
They use a “page-break-before: always” on the header styles in the CSS to get a page break before the various elements they put in the initial (and final) source file, which is a perfectly reasonable way to do it.

When I edit a PG ePub for my own purposes, I always split them up so each element of the introductory material gets its own source file. This is very easy to do in the Calibre editor.
Great, thanks. I've been splitting them like you do as well.
lumpynose is offline   Reply With Quote
Old 04-12-2019, 12:47 PM   #7
BobC
Guru
BobC ought to be getting tired of karma fortunes by now.BobC ought to be getting tired of karma fortunes by now.BobC ought to be getting tired of karma fortunes by now.BobC ought to be getting tired of karma fortunes by now.BobC ought to be getting tired of karma fortunes by now.BobC ought to be getting tired of karma fortunes by now.BobC ought to be getting tired of karma fortunes by now.BobC ought to be getting tired of karma fortunes by now.BobC ought to be getting tired of karma fortunes by now.BobC ought to be getting tired of karma fortunes by now.BobC ought to be getting tired of karma fortunes by now.
 
Posts: 691
Karma: 3026110
Join Date: Dec 2008
Location: Lancashire, U.K.
Device: BeBook 1, BeBook Pure, Kobo Glo, (and HD),Energy Sistem EReader Pro +
Quote:
Originally Posted by HarryT View Post
They use a “page-break-before: always” on the header styles in the CSS to get a page break before the various elements they put in the initial (and final) source file, which is a perfectly reasonable way to do it.
But they also split chapters between files in mid-chapter even when there is no need from a size point of view to do it. That means on most readers that you get a bogus page break in mid chapter.

For my own use I usually merge everything from Gutenberg into a single big file then get Calibre to split at proper chapter breaks. I also rename all the files into something much shorter and more meaningful.

BobC
BobC is offline   Reply With Quote
Reply


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
There are several different Project Gutenberg owly General Discussions 2 12-22-2012 10:48 PM
Project Gutenberg... carpetmojo General Discussions 37 12-06-2011 09:33 AM
Project Gutenberg crutledge Sigil 4 01-22-2011 01:49 PM
Project Gutenberg levi_john Workshop 17 07-26-2010 06:02 PM
Project Gutenberg interview ardeegee News 4 11-09-2009 07:02 PM


All times are GMT -4. The time now is 07:38 AM.


MobileRead.com is a privately owned, operated and funded community.