|
![]() |
|
Thread Tools | Search this Thread |
![]() |
#16 | |||
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 2,306
Karma: 13057279
Join Date: Jul 2012
Device: Kobo Forma, Nook
|
Quote:
Could you PM me a link to the PDF+DOCX, and I'll see what I can do. If the file is as you say, and all the footnotes have "smaller superscript numbers" in their proper locations, I have a method which should be able to convert the majority of footnotes. Quote:
Finereader usually does a pretty good job at recognizing what's a footnote, and actually treating it as such. It has more intelligent algorithms for sensing differences between header/body/footnote/footer. When it then exports to other formats (DOCX, HTML, EPUB), it then tries to properly mark the footnotes as actual footnotes. Quote:
A superscript number being the first thing in a paragraph "is most likely a footnote". This allows you to markup something like: Code:
<p><sup>123</sup> Example footnote.</p> Code:
<p class="footnote"><sup>123</sup> Example footnote.</p> Code:
<p>This is a sent-</p> <p class="footnote"><sup>123</sup> Example footnote.</p> <p class="footnote"><sup>124</sup> Another example footnote.</p> <p>ence that gets split across pages.</p> Code:
<p>This is a sent-</p> <p>ence that gets split across pages.</p> [...] <p class="footnote"><sup>123</sup> Example footnote.</p> <p class="footnote"><sup>124</sup> Another example footnote.</p> </body> </html> I've tested this method across tons of books, and it works, but it requires some initial massaging of the HTML. * Note: If the multi-paragraph footnotes are also smaller text, and nothing else in the book is, this can also be used to mark paragraphs as "footnote" class. From what droopy said in Post #7, it seems like this may be the case for this specific book. Last edited by Tex2002ans; 04-15-2020 at 09:20 PM. |
|||
![]() |
![]() |
![]() |
#17 |
Guru
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 834
Karma: 2912460
Join Date: Apr 2009
Device: Kobo Forma
|
Hi Tex,
PM sent. |
![]() |
![]() |
Advert | |
|
![]() |
#18 |
Bookmaker & Cat Slave
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 11,503
Karma: 158448243
Join Date: Apr 2010
Location: Phoenix, AZ
Device: K2, iPad, KFire, PPW, Voyage, NookColor. 2 Droid, Oasis, Boox Note2
|
|
![]() |
![]() |
![]() |
#19 |
Wizard
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 2,306
Karma: 13057279
Join Date: Jul 2012
Device: Kobo Forma, Nook
|
Just seeing if the theory plays out in other materials.
Could take a teensy weensy break from the madness to write up another beast. You know I'm stuck over here in my little Economics/History/Non-Fiction bubble, and I love my footnotes! Once I see that word, I begin foaming at the mouth! ![]() Thanks. I quickly scanned through droopy's 3 PDFs. The PDFs don't actually have superscript footnotes. The actual text uses the form: Code:
Example sentence.<sup>1</sup> Code:
1. Example footnote. And like I said earlier, Finereader does an okay job at detecting differences between body-text/footnotes. In this specific case, it detected most footnotes okay (definitely looks better than Word's PDF Import in that regard). * * * And here is ~ the rest of the PM I sent droopy: I generated 3 types of files: 1. [Finereader] - This is a DOCX generated straight from Finereader. 2. [Toxaris] - This is the [Finereader] DOCX, which I ran through Toxaris's fantastic "EPUB Tools". Note: It tries its best to clean up a bunch of Finereader's hidden junk, and do some basic cleanup like combine broken paragraphs together, etc. The text with red highlights is paragraphs that could be broken/merged incorrectly, so you can more closely look at them and fix manually if needed. 3. EPUB - This was generated straight from EPUB Tools using the [Toxaris] DOCX. Because this was all OCRed (and PDF sucks + the source files weren't the greatest), there ARE going to be the usual OCR issues creeping in there:
So it's up to you... you could:
But as has been discussed on MobileRead many, many times... PDFs are awful as input formats. If you want perfectly clean ebooks, you would have to get in there and do all the manual corrections, there just ain't no way around it. Last edited by Tex2002ans; 04-16-2020 at 09:54 PM. |
![]() |
![]() |
![]() |
#20 |
mostly an observer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1,518
Karma: 987654
Join Date: Dec 2012
Device: Kindle
|
I now see "wanna" used so frequently that I am beginning to suspect that some people actual believe this is standard English usage. Please tell me that it doesn't appear in Webster's Collegiate 11th edition? Must I pay $18.33 to find that the language has degraded so much since the 10th edition?
|
![]() |
![]() |
Advert | |
|
![]() |
#21 | |
Guru
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 834
Karma: 2912460
Join Date: Apr 2009
Device: Kobo Forma
|
Quote:
wanna = 5 characters want to = 7 characters |
|
![]() |
![]() |
![]() |
#22 |
Resident Curmudgeon
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 78,958
Karma: 144284074
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
|
You type in the topic as you want it to be and only if it doesn't fit do you shorten it. But please when you do shorten a topic, don't do it grammatically incorrectly.
|
![]() |
![]() |
![]() |
#23 | |
null operator (he/him)
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 21,605
Karma: 29710338
Join Date: Mar 2012
Location: Sydney Australia
Device: none
|
![]() Quote:
Spelling errors are a different matter, especially missing apostrophes ![]() BR |
|
![]() |
![]() |
![]() |
#24 | |
Guru
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 834
Karma: 2912460
Join Date: Apr 2009
Device: Kobo Forma
|
Hi BR. LEMME fix YER title FER YA:
Quote:
![]() ![]() Last edited by droopy; 04-27-2020 at 06:26 PM. |
|
![]() |
![]() |
![]() |
#25 | |
Bookmaker & Cat Slave
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 11,503
Karma: 158448243
Join Date: Apr 2010
Location: Phoenix, AZ
Device: K2, iPad, KFire, PPW, Voyage, NookColor. 2 Droid, Oasis, Boox Note2
|
Quote:
you've seen me use it dozens of times, when I'm typing obvious slang. Everybody here that has ever read more than 2 posts from me knows full well I can deploy the Queen's English at will. Using Wanna, woulda, coulda...nobody will die and nobody's bits and pieces will fall off of their you-knows. Good God, man, you act as though we're using Textspeak. Rnt U gld that wRt not? Hitch |
|
![]() |
![]() |
![]() |
#26 |
mostly an observer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 1,518
Karma: 987654
Join Date: Dec 2012
Device: Kindle
|
Yer rite, Hitch. Less all spel the way we wanna.
|
![]() |
![]() |
![]() |
#27 |
Well trained by Cats
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 30,884
Karma: 59840450
Join Date: Aug 2009
Location: The Central Coast of California
Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A
|
There is a difference between deliberate (dialect?) spelling and careless spelling / word use errors .
Would Jon correct all those great music lyrics? Code:
Whacha gonna do when the man comes for you? |
![]() |
![]() |
![]() |
#28 | |
Bookmaker & Cat Slave
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 11,503
Karma: 158448243
Join Date: Apr 2010
Location: Phoenix, AZ
Device: K2, iPad, KFire, PPW, Voyage, NookColor. 2 Droid, Oasis, Boox Note2
|
Quote:
I mean, they're fine, sprinkled throughout for "flavor," right? But when a character's entire dialogue is phonetically rendered, line after line after line...AGGGGH!! I doesn't seem to matter if it's Low Country, "Texan" (don't get me started), Irish, Scots, Russian...it's just grinding. Yes, I know, there are good-selling books that have this in them; but there are bestsellers with idiotic crap like sparkly vampires and "heroines" with the character depth of a piece of paper, too. No accounting for tastes. But once a character has been drawn, and we've "heard" his voice in our heads, OMG, give it a REST! To me, doing every single line of dialogue phonetically is the writing equivalent of exposition--telling, not showing. If your writing is so pathetic that you have to remind me, line after line, of how your character sounds, then maybe you need to go back to the drawing board and create a character that comes alive for us, ya know? Hitch |
|
![]() |
![]() |
![]() |
#29 | |
Klak
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 174
Karma: 150374
Join Date: Sep 2011
Location: Belgrade, Serbia
Device: many
|
Quote:
|
|
![]() |
![]() |
![]() |
#30 |
Grand Sorcerer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 5,730
Karma: 103020299
Join Date: Apr 2011
Device: pb360
|
|
![]() |
![]() |
![]() |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
In Docx to ePub Conversion window, wanna add Kobo Forma in the "Output Profile" list | droopy | Conversion | 3 | 04-08-2020 05:37 PM |
How to turn all images in Kobo Forma to grayscale (to save space & speed xp) | droopy | Devices | 20 | 10-27-2019 10:16 PM |
How to turn an ePub/InteractivePDF/Docx file into a standalone eBook APP? | danrojest | ePub | 13 | 01-12-2017 09:13 AM |
Drawbacks with Pop Up Footnotes in epub 3 ? | verydeepwater | ePub | 8 | 06-13-2014 05:28 AM |
How do I make either end notes of footnotes in epub? | ghostyjack | ePub | 69 | 11-01-2010 01:26 PM |