02-07-2014, 10:04 PM | #91 | |
Well trained by Cats
Posts: 29,781
Karma: 54830978
Join Date: Aug 2009
Location: The Central Coast of California
Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A
|
Quote:
Later I went with Compuserve (remember CIM?) |
|
02-07-2014, 11:04 PM | #92 |
Grand Sorcerer
Posts: 12,157
Karma: 73448616
Join Date: Nov 2007
Location: Toronto
Device: Nexus 7, Clara, Touch, Tolino EPOS
|
|
Advert | |
|
02-07-2014, 11:21 PM | #93 |
Well trained by Cats
Posts: 29,781
Karma: 54830978
Join Date: Aug 2009
Location: The Central Coast of California
Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A
|
|
02-07-2014, 11:59 PM | #94 |
null operator (he/him)
Posts: 20,553
Karma: 26954694
Join Date: Mar 2012
Location: Sydney Australia
Device: none
|
I had a timeshare account with GE in the early 70's - wrote a yachting regatta scoring program in Dartmouth Basic when my club hosted the (I forget) class world championships - i/o device was a Decwriter on an 300 baud acoustic coupler.
|
02-08-2014, 07:35 AM | #95 |
Wizard
Posts: 1,065
Karma: 858115
Join Date: Jan 2011
Device: Kobo Clara, Kindle Paperwhite 10
|
Irrelevant content deleted.
Last edited by unboggling; 01-02-2023 at 02:45 AM. Reason: Irrelevant content |
Advert | |
|
02-08-2014, 08:09 AM | #96 |
Almost legible
Posts: 1,457
Karma: 4611110
Join Date: Dec 2013
Location: In a high desert, CA
Device: Galaxy Note 9, Galaxy Tab A (2017), Likebook P78
|
I think what was even tougher was trying to produce a presentable report on a good old-fashioned manual typewriter... one mistake and you had retype the entire sheet again.
|
02-08-2014, 08:58 AM | #97 | |
Wizard
Posts: 1,065
Karma: 858115
Join Date: Jan 2011
Device: Kobo Clara, Kindle Paperwhite 10
|
Quote:
http://en.wikipedia.org/wiki/Liquid_Paper Last edited by unboggling; 02-09-2014 at 02:19 PM. |
|
02-08-2014, 10:41 AM | #98 | |
Grand Sorcerer
Posts: 12,157
Karma: 73448616
Join Date: Nov 2007
Location: Toronto
Device: Nexus 7, Clara, Touch, Tolino EPOS
|
Quote:
|
|
02-09-2014, 07:50 AM | #99 | |
Fanatic
Posts: 515
Karma: 1470724
Join Date: Jul 2013
Location: Quebec CA
Device: android 4 (samsung tablet and asus tablet)
|
Quote:
That was in the early 60s in Ontario Canada and only 30 miles from the nations capital lol. Hard to believe someone who started out in a school like that would end up working in computers isn't it. |
|
02-09-2014, 07:55 AM | #100 |
Fanatic
Posts: 515
Karma: 1470724
Join Date: Jul 2013
Location: Quebec CA
Device: android 4 (samsung tablet and asus tablet)
|
One way someone could edit a document in a word processor for publication as an ebook is to put a tag around the bold and italics manually, make sure they don't use hard breaks for anything other than paragraphs and when the file looks good save it to text (that's txt not rtf).
Next step, reload the document under a test name. Search for the markers for bold and italic. Change the bold and italic to bold... italic. Now try saving the document as html. This should remove a great deal of the excess formatting. |
02-09-2014, 12:44 PM | #101 | |||
Wizard
Posts: 1,065
Karma: 858115
Join Date: Jan 2011
Device: Kobo Clara, Kindle Paperwhite 10
|
Quote:
Quote:
Quote:
http://daringfireball.net/projects/markdown/ (mentioned in Prefs > Input Options > TXT Input > Markdown) Last edited by unboggling; 02-10-2014 at 07:51 AM. |
|||
02-09-2014, 02:55 PM | #102 | |
Fanatic
Posts: 515
Karma: 1470724
Join Date: Jul 2013
Location: Quebec CA
Device: android 4 (samsung tablet and asus tablet)
|
Quote:
The freebie tool (not around that I could find but still works well) HTML Book Fixer, strips the excess spans BUT it also manages to remove the italics if they are in a span. Most irritating. With excess nested spans it is darn near impossible to find the matching open / close tags that refer to italics using regex and a royal pain to "eyeball" the italics in the original. I don't know why modern word processors don't allow the option to clean up the underlying code that is used to create pdf and html files. The main reason I find pdf files so hard to clean up is because most were created in a wysiwyg program. From the underlying code I get in the html it is usually word or a word clone that uses the horrid "<p class=MsoNormal><span style='mso-fareast-font-family:"MS Mincho"'>" often skipping the quotes around the class name. (note the font family/name is whatever font the doc used.) I think all that excess code can lead to problems in conversions when nested too deep. I had one problem caused by not cleaning up a file because I had not noticed that one of the nested div tags was class="chapter" and around the entire chapter and another was class="chapterHead" and around the Chapter whatever. |
|
02-09-2014, 04:25 PM | #103 | ||
Wizard
Posts: 1,065
Karma: 858115
Join Date: Jan 2011
Device: Kobo Clara, Kindle Paperwhite 10
|
Quote:
Quote:
This is what I do, using the "ignore underlying code" approach. If bold or italic formatting are there to begin with in the original format, that formatting is preserved all the way through to the end (except in cases where I fix inappropriate use of bold or italic).
Now, instead of starting with AZW3, let's say I start with PDF. I would do the same sequence: convert to EPUB and assess it, convert to RTF, fix in Word (get rid of headers/footers, deal with Word's presentation of markup, fix other annoying problems) and save as DOCX, convert to EPUB, assess it in Viewer. Ignore the underlying code. Read the book. Admittedly this works best for simply formatted text-based books, starting from EPUB, AZW3, or MOBI. PDF conversions usually have more problems so for me they're more trouble than they're worth. Last edited by unboggling; 02-10-2014 at 09:20 PM. |
||
02-09-2014, 09:28 PM | #104 | |
Fanatic
Posts: 515
Karma: 1470724
Join Date: Jul 2013
Location: Quebec CA
Device: android 4 (samsung tablet and asus tablet)
|
Quote:
HTML can be obtained by opening an ePub, or a Mobi file from Calibre. Saving an rtf, doc or docx file as html in some kind of editor that handles it. Converting a pdf file to HTM or HTML using Acrobat Pro (I only have version 7 lol. don't use it enough to buy a newer version), a word processor that can translate to HTML or mobipocket creator which as part of the process of translating the prc generates an html file. In other words. Using any method I can find I translate my original document to HTML. Perhaps even taking an old text file and going through and adding tags to it. (I can't find the php files I had that used a bunch of rules for creating paragraphs out of a flat txt file. It took me quite a while to write it and figure out the regex for finding all the characters found in a paragraph) Sometimes, if it is horrid with nested spans and garbage I will strip all coding from it and using the original file do a search for italics and bold or strong. Using two editors I will search in the original file for the tag. Copy enough of the text in or around the tag to find the text in the "clean" copy and put in clean tags. ARgh I must apologize. This has become much too rambling and probably has bored everyone. |
|
02-10-2014, 03:40 AM | #105 | |
Wizard
Posts: 1,065
Karma: 858115
Join Date: Jan 2011
Device: Kobo Clara, Kindle Paperwhite 10
|
Quote:
btw, take a look at Toxaris' Word macro for clean HTML code: https://www.mobileread.com/forums/showthread.php?t=142530 (for Word on Windows or OS X) Last edited by unboggling; 02-13-2014 at 08:56 PM. Reason: clarify. |
|
Tags |
calibre workflow, ebook management strategy, ebook management workflow |
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
PRS-T1 Manage Collections in Calibre (Help!) | FatCat0 | Sony Reader | 19 | 08-11-2012 12:00 PM |
How to find & manage ebooks from various apps? | rapidlanguage | Library Management | 3 | 01-06-2012 08:13 AM |
Development Using Calibre to manage eDGe library | mrspaceman | enTourage Archive | 76 | 05-12-2011 12:38 PM |
Neo How to manage ebooks? | ivanm | BeBook | 11 | 08-19-2010 11:01 AM |
How do you manage your read queue with ebooks? | DuncanWatson | General Discussions | 7 | 05-14-2010 01:30 PM |