![]() |
#61 |
Banned
![]() ![]() ![]() Posts: 269
Karma: -273
Join Date: Sep 2006
Location: los angeles
|
my guess is that, of the 7 million volumes google will scan at umichigan,
just as one example, 99.98% of them will have pagenumbers in them... those are the books that will form the cyberlibrary of the future, and thus those are the books that we need to find a way to make _pointers_ into... as pagenumbers have been the pointer-system used on them up until now, we'll need to create digital means so that we can continue to support that, and that infrastructure will allow us to continue using pagenumber pointers. yes, we'll have other means too, but we'll need to make pagenumbers work. luckily, as i believe i've shown in the examples i've posted, it's not too hard. -bowerbird |
![]() |
![]() |
![]() |
#62 | |
eBook Enthusiast
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 85,544
Karma: 93383099
Join Date: Nov 2006
Location: UK
Device: Kindle Oasis 2, iPad Pro 10.5", iPhone 6
|
Quote:
|
|
![]() |
![]() |
Advert | |
|
![]() |
#63 | |
Gizmologist
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 11,615
Karma: 929550
Join Date: Jan 2006
Location: Republic of Texas Embassy at Jackson, TN
Device: Pocketbook Touch HD3
|
Quote:
![]() This morning, however, some of the overnight comments, have got me hinking maybe we're making this too complicated. We're talking about computers here, and computers do boring, repetitive functions fast and without complaint. Why not have the reading application generate some sort of text index? It could be as simple as a straight character count (which would get ... rather large), or it could be some sort of graduated count by chapter and then paragraph and then character. For instance, 10.3.400-475 would be chapter 10, paragraph 3 starting at character 400, running to character 475. I'm not pushing for that specifically, just making a "top of my head" example. The important bit is that it be an agreed upon standard, and that it be repeatable. The reading app can generate the reference and locate the point in the text from the reference. Of course, those needs will have to be met whatever the eventual system ends up being. ![]() |
|
![]() |
![]() |
![]() |
#64 | ||||||
Gizmologist
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 11,615
Karma: 929550
Join Date: Jan 2006
Location: Republic of Texas Embassy at Jackson, TN
Device: Pocketbook Touch HD3
|
Quote:
![]() But then, I don't regard discussing and exploring solutions in a respectful manner to be "blowing sunshine up someone's behind" either. I guess I gave up the personal illusion that I could give the Final, Infallible, and Only Answer on sweeping matters some time ago. One of the side-effects of discussing things politely, and respectfully, even when the discussers disagree, is that people continue to consider what's being said, and don't skip, blow off, or otherwise Ignore comments by people who discuss things in such a fashion. Having the best point in the world, or being absolutely right is pretty worthless if no one will listen. And if no one listens because they're tired of the tone the commenter takes with those who disagree with him is really rather sad. ![]() Quote:
Quote:
Quote:
Quote:
Quote:
In any case, now that the posts in question are here where the discussion is continuing, others may find in them points worth responding to. |
||||||
![]() |
![]() |
![]() |
#65 |
fruminous edugeek
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 6,745
Karma: 551260
Join Date: Oct 2006
Location: Northeast US
Device: iPad, eBw 1150
|
As a doctoral student, I'm pretty much stuck with having to reference printed page numbers, but I'd like to see a transition to paragraph numbers in the future, to better support electronic reflowable documents. I think we'll have to support both for the foreseeable future, to allow references to pre-electronic documents, even those that have been converted to digital form. Some kind of embedded semantic tagging for each of these methods of identifying text location that can be shown or hidden at will would be great.
|
![]() |
![]() |
Advert | |
|
![]() |
#66 |
Connoisseur
![]() Posts: 54
Karma: 29
Join Date: Oct 2006
|
Natch, thanks for bringing over the info from the other thread (I kind of gave up reading that thread after the 'debates' started.) After reading through the post above I have a question.
Bowerbird, can i ask the reasoning behind splitting the document into individual pages? Couldn't you point to the page content using http://z-m-l.com/go/myant/myantp.html#189 as opposed to http://z-m-l.com/go/myant/myantp189.html. That way the whole content of the book is in one file and conversion to other formats would be easier. For example how do you recognize when a paragraph splits across two pages and how do you join them back together when converting? You might have a good reason that I haven't considered so I would like to hear your take on it. |
![]() |
![]() |
![]() |
#67 |
Banned
![]() ![]() ![]() Posts: 269
Karma: -273
Join Date: Sep 2006
Location: los angeles
|
natch said:
> When the comment is little more than distilled sarcasm, > with no actual content, it's not criticism, in my book, > It comes closer to sniping. "with no actual content"? did you not get the content in that post of mine? if so, then let me explain it to you a little bit more directly... _lots_ of people have already spent _lots_ of time and energy thinking about these questions, running up solutions, and actually putting _even_more_ of their own time and energy to code experimental solutions so that they could be tested. the results have largely confirmed what most of us suspected, namely that there is no reliable way to point to a piece of info if someone (else) has the ability to change that info any time, up to and including the option of completely _removing_ it... because, hey, it's hard to point to something that ain't there. a fact which -- in retrospect -- seems to be fairly "obvious", and which might have been a tip-off from the very beginning that maybe this was one of those problems with no solution... because, realistically_, that _is_ the situation which we're in. someone (else) _is_ going to have control over the info that we want to point to. it's called copyright, and it's our burden. furthermore, when someone here suggests that the people over at i.p.d.f. should pay some attention to this question, that implies that i.p.d.f. has _not_ paid any attention to it... when the fact of the matter is that they _have_. they've paid more attention than you know, including enough attention to understand (which y'all here don't seem to have grasped) that this is one of those problems with no solution, or at least no "really good solution". so to imply that they "need to consider this" is _stupid_... so here's my choice. i can either use a little bit of sarcasm, which -- last i checked -- is considered a form of _humor_ (albeit not as happy-go-lucky and feel-good as slapstick), or i can instead go for the "explain everything to them like they were a bunch of second-graders, and let the fact that they've ignored some basic reality give the solid impression that they're not just second-graders, but kinda stupid ones, even though that ain't the impression i _want_ to leave...". i went for the form of humor. was that a mistake? -bowerbird |
![]() |
![]() |
![]() |
#68 |
Banned
![]() ![]() ![]() Posts: 269
Karma: -273
Join Date: Sep 2006
Location: los angeles
|
sartori said:
> Bowerbird, can i ask the reasoning behind > splitting the document into individual pages? first of all, my e-books _can_ exist in several forms. the individual-pages form is just one of those forms. but i can (and do) spin out "whole-book" forms too... (plus chapter-by-chapter forms, for some purposes.) i pointed to the page-by-page form because this topic -- page-based referencing within scholarly situations -- is one whose basic requirements call out for that form... to see the "master file" for "my antonia", look here: > http://z-m-l.com/go/myant/myant.zml (as you see, the master itself is in whole-book form.) that "master" _generated_ the page-by-page form... the page-by-page form has many intended purposes. its first major purpose is to facilitate _proofreading_... you want to do proofreading on a page-by-page basis; you want the page-scan to be shown alongside the text; and you want the text to contain the original linebreaks. this format is geared toward those proofreading needs... (this is a "final-stage" proofing interface, where errors are "reported", because there are very few. for earlier stages of proofreading, where there might be many more errors, we'll want an interface that lets us fix them more directly.) the next major purpose of it is for _confirming_accuracy_. we want to give people an ability to confirm our digitization, to satisfy themselves we did that conversion job correctly... to do so, we show them our text and the original page-scan, so they can do a direct comparison and see for themselves... the third major purpose is the one we're discussing here -- the ability for people to make a pointer to a specific page... and -- as i have said -- the reason we need to facilitate that is because our culture heritage is full of page-based pointers. and again, we _could_ point them to a place with just the text, but everyone knows that text can be easily "edited", so we also put the original page-scan up so as to increase the trust factor. (of course, scans could _also_ be doctored, but at some point, there's only so much you can do.) > Couldn't you point to the page content using > http://z-m-l.com/go/myant/myantp.html#189 > as opposed to > http://z-m-l.com/go/myant/myantp189.html. sure. and sometimes that's what you'll want to do instead. but let me show you something. stopwatch this link: > http://z-m-l.com/go/myant/myantp189.html now check the length of time it takes to go to this one: > http://www.openreader.org/myantonia/...a.html#page189 unless that second page was already in your cache or you have a superfast connection, it took _lots_ longer to load, because you're loading in some 500k of text -- the whole book -- instead of 1k of text and a scan. (for the dialup users, the second file will be _painful_.) so it depends on what you need your readers to load... if you only need them to load one page of text, do that. if you need them to load the whole book, then do _that_. you'll notice that the second link doesn't include the scans in-line in the file; you have to click a link to view each one. (the scans run to 30 megs, so it'd be suicide to load 'em all.) so it depends on what you need. if you wanted to point to one page in each of 50 books, you wouldn't want to force your reader to load each of the 50 books in full just to see that one page. and this is often the essence of a scholarly reference section. so it depends. this is why we need the flexibility to quickly and easily auto-generate whatever format is needed at the time... > That way the whole content of the book is in one file > and conversion to other formats would be easier. in sum, i pointed to a page-based form because of this discussion... i can also create book-based forms when _that_ is more appropriate. (such flexibility is one reason i invented my z.m.l. format, which is a sidetrack topic in that other thread from which this one came...) > For example how do you recognize when a paragraph splits across > two pages and how do you join them back together when converting? good question. but easy answer. in a "master" file which has pagebreaks marked, like this one: > http://z-m-l.com/go/myant/myant.zml the formula for generating a version _without_ the pagebreak info is to: 1. delete the _one_ blank line _above_ the [[doublebracketed]] pagenumber, and delete the _one_ blank line _below_ the {{doublebraced}} scan-filename... 2. if there were _two_ blank lines above and below, respectively, then that was a paragraph break, so you should insert a blank line in the output file. if you follow that rule, you'll find that paragraphs which cross pagebreaks get joined together, while the ones that ended on the pagebreak still do... for instance, in the .zml master, compare the breaks between these pages: > http://z-m-l.com/go/myant/myantp040.html > http://z-m-l.com/go/myant/myantp041.html versus: > http://z-m-l.com/go/myant/myantp061.html > http://z-m-l.com/go/myant/myantp062.html see how easy it was for me to point you to those pages specifically? and also the _usefulness_ of being able to see both text _and_ scan? -bowerbird |
![]() |
![]() |
![]() |
#69 | |
Gizmologist
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 11,615
Karma: 929550
Join Date: Jan 2006
Location: Republic of Texas Embassy at Jackson, TN
Device: Pocketbook Touch HD3
|
Quote:
Sarcasm is indeed used humorously on the forum a great deal, but it has to be well telegraphed as humor, because things like tone of voice don't come through in text without a good deal of effort, and they can easily be taken the wrong way. Because of that it also requires a willingness to step back from it and clarify what was meant when it doesn't come across as funny, even to the point of apologizing for giving offense that was never intended. You come across as seeming to consider anyone who doesn't see things your way to be an imbecile, and people are starting to assume that you mean to be abrasive even when you don't. I've noticed this, but if you have, you have given no sign of it. You have managed to get more folks to put you on ignore in a week than I've seen happen in the preceding almost two years that I've been around MR. These are the results of the absence of the respect for which you have expressed such scorn: you are driving folks away even as you claim to wish to persuade them. I, and several others have put significant amounts of effort in attempting to communicate this to you, but you seem to regard those efforts as aimed at getting you to shut up -- if the moderators here wanted to stifle you as you seem to believe we do, we wouldn't have resorted to talking to you to do so. The fact that we have ought to tell you something all by itself. I've reached the point where I simply don't know what else to say to you. |
|
![]() |
![]() |
![]() |
#70 |
Banned
![]() ![]() ![]() Posts: 269
Karma: -273
Join Date: Sep 2006
Location: los angeles
|
natch said:
> Talk to people like they're actually functionally intelligent i do! whenever people strike me as being "functionally intelligent". in addition, if they strike me as being stupid, i talk to them like that. but people who want me to talk to them as "functionally intelligent" when they're holding up their end of the conversation with stupidity, they get _sarcasm_ from me. because that's the best they _deserve_. > even to the point of apologizing > for giving offense that was never intended. did you _intend_ to offend me with this sentence? or with your post as a whole? more importantly are you ready to _apologize_ for doing so? > you are driving folks away even as you claim to wish to persuade them. hold it there. i never said i "wish to persuade" _anyone_ of _anything_. in fact, i expressly disclaim that as an intention, wholly and completely. frankly, i don't care what anyone thinks, if they disagree or agree with me. i speak my mind, and you can make of it whatever you wish, fine by me... > You have managed to get more folks to put you on ignore in a week than > I've seen happen in the preceding almost two years that I've been around MR. some people don't want to hear anyone else speak frankly. so what? others take offense much too easily, especially the insecure. so what? i too ignore a lot of what i read here, because it has very little truth value. it doesn't make sense. when i weigh it as evidence, it registers no mass... i don't bother to filter out what people say, because i've found that it's not generally a good idea to stick my head in the sand, but if other people want to stick their head in the sand, i'm totally fine with that. indeed, i would prefer that people put me on "ignore" than try to chastise me for speaking my truth. i'm not "rude". i'm a gentle soul who believes in truth, and has enough respect for my fellow human beings to be honest with them when they're being stupid, honest enough to tell them directly. if you think that's a bad thing, i suggest that you too put me on "ignore", so my words will magically be turned into white space and you live in ignorant bliss. sincerely, i want you to be happy. -bowerbird Last edited by bowerbird; 11-07-2007 at 06:36 PM. Reason: because somebody messed with my formatting... |
![]() |
![]() |
![]() |
#71 |
Connoisseur
![]() Posts: 54
Karma: 29
Join Date: Oct 2006
|
Bowerbird,
Your reasoning makes sense to me (In response to my question). So some more questions if you don't mind When you receive an error notification do you just update the master file then regenerate the paged version? or vice-versa? or for small updates do you just make the change in both versions? On page 61 (http://z-m-l.com/go/myant/myantp061.html) I noticed that a few words are hyphenated across lines. On your master view the words are correctly joined (tea-kettle & followed). Were these manually corrected or automated? If automated did it correctly catch tea-kettle should keep its' hyphen? I'm not sure if z.m.l. is the way I want to go with my formatting but I'm still at the early stages of formatting so I'm just checking out options (googling for ebook markup languages is hopeless as you just get a ton of responses that are actual ebooks). Thanks, rob |
![]() |
![]() |
![]() |
#72 |
Banned
![]() ![]() ![]() Posts: 269
Karma: -273
Join Date: Sep 2006
Location: los angeles
|
sartori said:
> So some more questions if you don't mind i don't mind a bit. that's why i'm here, to discuss... > When you receive an error notification do you just > update the master file then regenerate the paged version? > or vice-versa? or for small updates > do you just make the change in both versions? if you go to the directory now, you'll see a bunch of files: > http://z-m-l.com/go/myant/ including all of the .html files to which i've been linking... the .html files were generated in a batch from the master. but eventually, all the separate .html files will disappear. they'll be replaced by a script which intercepts links like this: > http://z-m-l.com/go/myant/myantp061.html and creates that .html file on-the-fly... so yes, any correction will be made to the master, after which the script will include it when it builds the .html file next time. > On page 61 (http://z-m-l.com/go/myant/myantp061.html) > I noticed that a few words are hyphenated across lines. > On your master view the words are correctly joined > (tea-kettle & followed). um, as far as i can tell, you're mistaken. here's the master: > http://z-m-l.com/go/myant/myant.zml what i see there, in the master, is this: > Peter shuffled to his feet, caught up the tea- > kettle and mixed him some hot water and > whiskey. The sharp smell of spirits went > through the room. > > Pavel snatched the cup and drank, then > made Peter give him the bottle and slipped > it under his pillow, grinning disagreeably, > as if he had outwitted some one. His eyes fol- > lowed Peter about the room with a contempt- > uous, unfriendly expression. It seemed to > me that he despised him for being so simple > and docile. do you really see something different? if so, that's a mystery... > Were these manually corrected or automated? > If automated did it correctly catch tea-kettle should keep its' hyphen? not all of the example-files that i have up are correct on this point yet, but they'll be marked as to whether an end-line hyphen is kept or not... so, if "tea-kettle" -- with the dash -- is the form used in this book (when the word is mid-sentence), then the master will look like this: > Peter shuffled to his feet, caught up the tea-@ > kettle and mixed him some hot water and (i haven't decided if we'll use the at-sign, but you get the idea.) on the other hand, if this book uses "teakettle", the master will say: > Peter shuffled to his feet, caught up the tea- > kettle and mixed him some hot water and (for the record, this book does indeed use "tea-kettle" in the one other instance where the word occurs. in the cases where there is no other use of an end-line hyphenate, we consult the dictionary. when there is inconsistency within a book, we edit to consistency.) > I'm not sure if z.m.l. is the way I want to go with my formatting but > I'm still at the early stages of formatting so I'm just checking out options i definitely suggest light-markup. "markdown" is the current favorite, if you want broad support. my tool-change is approaching coherence, so you could get the job done, but markdown gives you more reliability. google "showdown" and "markdown" for an interesting real-time demo: > http://www.attacklab.net/showdown-gui.html -bowerbird |
![]() |
![]() |
![]() |
#73 |
Connoisseur
![]() Posts: 54
Karma: 29
Join Date: Oct 2006
|
Bowerbird - sorry didn't mean the master I meant the html view that you listed.
I like the showdown stuff - seems a little limited as far as layout but it looks really easy to use. Thanks. |
![]() |
![]() |
![]() |
#74 |
Banned
![]() ![]() ![]() Posts: 269
Karma: -273
Join Date: Sep 2006
Location: los angeles
|
sartori said:
> I like the showdown stuff - seems a little limited as far as layout depends on what you want to do with a book, and what platforms you want to put it out to... you can often exercise tight control in _one_ setting, but then it blows up on you when you try to move it... a good rule of thumb is that if you cannot do it with light-markup, then you shouldn't be doing it anyway, because it's not gonna convert well to other settings. so living with some "limitations" from the beginning can save you a truckload of heartburn done the road. but, you know, your demo showed you've got chops... so i'd encourage you to let your mind experiment fully. -bowerbird |
![]() |
![]() |
![]() |
#75 |
Banned
![]() ![]() ![]() Posts: 269
Karma: -273
Join Date: Sep 2006
Location: los angeles
|
sartori sadi:
> sorry didn't mean the master I meant the html view that you listed. except i still don't follow. the individual-page .html file shows end-line hyphenates just like the scan: > http://z-m-l.com/go/myant/myantp061.html -bowerbird |
![]() |
![]() |
![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
Page numbers | Fincary | Astak EZReader | 4 | 02-18-2010 03:06 PM |
page numbers | nenad | Amazon Kindle | 2 | 12-19-2009 09:01 AM |
Professional and scholarly ebooks account for 75% of ebook market? | anurag | News | 1 | 11-26-2009 12:40 PM |
Page numbers, AGAIN | orlincho | Bookeen | 92 | 08-19-2008 07:15 AM |
Page numbers (again) | Prospect | Workshop | 50 | 04-10-2008 02:19 AM |