09-03-2011, 01:36 PM | #196 |
Wizard
Posts: 1,065
Karma: 858115
Join Date: Jan 2011
Device: Kobo Clara, Kindle Paperwhite 10
|
My basic approach to strategy and workflow is fundamentally back to what I started with in KISS Post #1 plus some refinements, tweaks, and expansion of detail and discussion throughout. (Edit: re strategy back to start, exception: going slow is a major change.)
I'll be cleaning up any minor formatting and link glitches and whatever other small problems I find for a few hours if necessary. I considered all feedback, input, and discussion in this thread from start to present, and incorporated what I could into my own work habits. The rule I'm using for inclusion of content changes is: if I haven't tested it myself and worked with it for awhile, it doesn't go in except where I note it as a recommendation from people at MobileRead. If you think I missed including something important, please let me know - I went through 5 pages of scrawled poor handwriting to add/revise content based on responses in this thread, and I probably missed things. As usual, any feedback, input, and discussion from whoever cares to participate will be helpful to me for the next revision. Last edited by unboggling; 09-05-2011 at 02:39 AM. Reason: Link to KISS v0.60, exception |
09-04-2011, 04:30 AM | #197 | |
Wizard
Posts: 3,450
Karma: 10484861
Join Date: May 2006
Device: PocketBook 360, before it was Sony Reader, cassiopeia A-20
|
Quote:
I have created a new user column, called "Clear". I go through my "Kitchen sink and everything else mess imported from God-Knows-Where" Calibre Library and I am going Author by author, and correcting metadata, gathering covers, descriptions, series info (Series are extremely important metadata), merging duplicates, examining what version to ditch during merge (*). Then I mark the book[s] "Clear". I use your plugins heavily in the process. (*) I never delete formats I find "not useful" at the moment. Experience taught me that sooner or later you are going to get a device that supports different bunch of formats, and disk space is *SO* cheap nowadays (well, not if you collect HD movies, but 500kB *.lit file it costs very little to store, Just-In-Case) |
|
Advert | |
|
09-04-2011, 04:58 AM | #198 |
Wizard
Posts: 1,065
Karma: 858115
Join Date: Jan 2011
Device: Kobo Clara, Kindle Paperwhite 10
|
@kacir, so you use just one library for everything?, do a pass (requiring months maybe?) to work by author from A to Z, meanwhile adding whatever you download, then start over with A-Z iteratively forever? Or do you eventually move the "clear" ones to a Clear Library?
I keep all downloaded formats, even when they are many or bad, in a Raw Books folder on a 2TB hard drive and try to use only the best of each lot inside calibre. For example, after trashing everything and restarting I downloaded all the free Baen CDs, put them in expanded folders into Raw Books/Pending/Baen CDs/CDnameFolder and started adding only the best EPUB or in a few cases a MOBI if no EPUB, one CD at a time which has usually mostly the same author. So far I've done 4 CDs. Using just one library. I also keep the original compressed downloads of important things in a different folder that also has subfolders for things like application installers, including old versions of those. Because disk space is cheap and just seems to keep getting cheaper. Last edited by unboggling; 09-04-2011 at 12:09 PM. Reason: inserted "/Pending/" |
09-04-2011, 05:56 AM | #199 |
Calibre Plugins Developer
Posts: 4,637
Karma: 2162064
Join Date: Oct 2010
Location: Australia
Device: Kindle Oasis
|
@kacir - trying to clean up the "kitchen sink" an author at a time in the same library was also my initial approach. I can't recall all the things I experimented in terms of a tag or a custom column to make a book as processed but I also used a user category to mark an author as processed amongst them.
However the issue I found was twofold. Primarily that the Calibre performance was pretty dire - taking many seconds or sometimes minutes to perform simple operations. A lot of work has been done on Calibre performance since that time to improve things, but it made for intolerably slow performance at the time. Also back then I had not yet written a number of the plugins like Find Duplicates to assist. The second reason was more the sense of lack of progress, every time I opened the library I was faced with the bulk of it being unprocessed, and the books I had done kind of got lost in the morass. Putting just the cleaned ones into a new library gave me a sense of achievement when I opened that up to know that all the books contained within were ready for reading. And also importantly the books I most wanted to be reading, rather than including many others that I had just collected for the sake of collecting. Of course you can use a search restriction to similar effect, but with the performance issues I had I chose a clean start instead. As for the formats thing, I don't disagree that space is cheap but the more files you have you will pay a penalty somewhere. Whether it is at the time you do a search in Calibre, the time you do a search via the file system or doing a backup. As for the formats themselves, well it depends on whether you own a legacy reader (I don't) or not. Formats like lit, lrf, etc are dead as a dodo. Txt, rtf, doc are a poor man's ePub. HTML is a hop skip to an ePub. PRC, AZW - just use MOBI (which in itself is easily converted to/from ePub). These are my opinions based on the devices I own. Of course there will likely be some other format come out one day to obsolete ePub - but I am hedging my bets that given its dominance a product like Calibre will ensure you can convert from ePub to it. And quite apart from any of that - keeping say an ePub and a LIT format of a book will always leave a question in your mind as to which version is the "better" one if they were not converted from the same source. Which you can only resolve by inspecting. Better imho to do that job once, make your decision (converting the LIT to overwrite the ePub if it wins) and move on. But we all have our own ways, that is part of the fun |
09-04-2011, 06:49 AM | #200 | |
Wizard
Posts: 1,065
Karma: 858115
Join Date: Jan 2011
Device: Kobo Clara, Kindle Paperwhite 10
|
Quote:
I think I want to start deleting everything except the format I put in calibre and sometimes cleanup. But that raises the question of when to delete those other formats. If I don't clean it right away, I might want to refer to one of those other formats in the future when I do clean the one in calibre or even use it instead if I find a big problem while cleaning. So then there's the mechanical question of how do I automatically remember things need to be deleted if I don't delete them right away at Add time? It would have to be at Clean time. But Clean time doesn't apply to all books. btw, a couple hours ago I put reference and links to your workflow and metadata bigposts into 0.60 after posting, using an ((Edit date)) marker. I'd integrated some of your ideas already and am still thinking about other stuff in those posts that's not integrated into my workflow yet. Last edited by unboggling; 09-04-2011 at 07:01 AM. |
|
Advert | |
|
09-04-2011, 08:00 AM | #201 |
Calibre Plugins Developer
Posts: 4,637
Karma: 2162064
Join Date: Oct 2010
Location: Australia
Device: Kindle Oasis
|
@Unboggling - re timing of deletion. If I understand you properly your question concerns the situation where you dont clean the book straight away. A situation I have for much of my library given the earlier previous attempt to put it in my old library. What I do is a combination of things. Like Kacir mentioned, I have a #done custom column on each book to indicate I have cleaned/processed it. I also have an "Unprocessed" user category that I will add an author to if at the time I add a book I don't clean it right away (or they have so many books I just want to get them in and clean when I have more time). Strictly speaking the latter is kind of redundant since I could do a search for #done:false, but I like to see a count and list of the authors that are outstanding in my library, and a simple click on an author in that list in the tag browser gives me the books just for that author.
In terms of timing, as soon as I have processed (cleaned) a book any other formats in Calibre for it are deleted. As to when I delete from my source folders, I certainly do that when I have processed all books for that author, but sometimes for a specific book as I go. It depends on how I am processing the author, how many books they have and how many different versions of the book I have. Sometimes I end up doing a windows explorer search for a specific book rather than just the author, so I can more easily see and find my best format. In that situation it is just as easy to delete the files in the search results window when I have processed the book. That can help when you have an author who has short stories as well (which don't tend to be listed on Fantastic Fiction) and you are trying to figure out what books for that author you have not yet put in Calibre. Note that a valid alternative that some people do instead of deleting is to move the books that they added into Calibre from their raw source folders into some other kind of "processed" folder. Just so they have them available should they do something bad with their conversions or find they picked the wrong format in thinking it was their best one. However I would certainly *not* attempt to store those in a Calibre library in any way - just use a filesystem search to find a book from there in the extremely rare circumstance you need to do so. Re the 0.6 version - I'll be honest, I haven't read it in detail . I saw spoilers within spoilers, and lots of text, and as I am not the target user for it anyway I moved on. I will say what I saw kind of read more as a "diary" than a quick summary? If I was a user looking to get a quick and simple recommended setup, I don't really care about reading a whole paragraph or two of the "why" too much as a new user (if at all for many users I would suggest) - I just want to know the "what" to do and "when" to do it. The problem with explaining the why is that a why can be very verbose - after all, look at my own post here taking three paragraphs to explain some opinions on deletion. I don't have the answers for you, as I saw someone else post writing documentation is freaking hard, and you have to do a fine balance between just enough to get them going without turning it into a whole user manual in its own right. Like I said, I am not your target user though, and others may find what you have done to be exactly what they were after so don't let me discourage you too much! |
09-04-2011, 08:35 AM | #202 | |
Evangelist
Posts: 480
Karma: 270594
Join Date: Aug 2010
Device: palm tx, Windows7, Galaxy A5
|
Quote:
Very rarely I may try new conversion from another format, usually it's just massage the html > create prc to check if last improvement took. Also, during reading I can find tendencies like for example </i><i> instead of space. If such things look like continuing throughout the whole book, I search and replace them, make new prc and continue reading that. After reading I keep original pdf/doc/html I converted from, opf+edited html+picture(s) and final prc. If converted from epub, I may keep one nice-looking pdf also. Everything else goes to the trash. Sometimes during that process I add it to the Calibre. Original format usually before I start to read, with metadata search (have to be sure about series and reading order), final prc when I finish reading it. I can't really imagine more naked format than html. If I can get my claws into that, I can tweak anything in the book (or learn to tweak). And I believe it will convert less painlessly into some future format (more likely, noname readers will accept several formats). Now I try to download only epub or mobi. I do not have good experience with lrf/lit conversion. And if I can't get acceptable html now, then I doubt there will be better results converting into some future super-duper format. Of course could have been lousy lit/lrf. @kiwidude - I must disagree, I remember being very interested in 'why'. Especially new user needs to know why somebody does things, in my opinion it makes a lot easier to decide 'no, not for me, don't need to know how'. Last edited by travger; 09-04-2011 at 08:52 AM. Reason: @kiwidude |
|
09-04-2011, 09:33 AM | #203 | |
Calibre Plugins Developer
Posts: 4,637
Karma: 2162064
Join Date: Oct 2010
Location: Australia
Device: Kindle Oasis
|
Quote:
If I take the case of my parents as an example Calibre user, they really don't care about the intimate details of a "why" for most things - they would just want to know a "what" that is built from the experiences of other users to save them the hours of figuring it out for themselves. However I completely understand (as I am the same) that there are users like yourself who want to know the "why" for a recommendation so they can choose to use it or not, particularly when there are alternatives. I think it is valid to have the information available, I just suggest it is pretty verbose burying the "what" in amongst the "why". Perhaps an alternative approach would be to have the "Why" within the spoilers - so people can see at a glance the recommendations, and if they are curious they can drill into the detail. Last edited by kiwidude; 09-04-2011 at 09:35 AM. |
|
09-04-2011, 09:47 AM | #204 |
Wizard
Posts: 1,065
Karma: 858115
Join Date: Jan 2011
Device: Kobo Clara, Kindle Paperwhite 10
|
I'm a lousy editor. And worse self-editor. It was even worse before I cut a lot of conversational background/why chat out, reducing size considerably. But I did intentionally leave in some why's. Without them, it would be here's how to set up calibre for new users step by step and follow my instructions and don't ask why each thing is important. Which I wanted to avoid - I'm really not trying to provide a manual. Just a map of what I'm doing with some examples. Examples in a structure of some reasoning help me learn better than examples not in some kind of bigger structure that makes sense.
Re nested spoilers, not to sound defensive, but I thought a flat 3 - 6 pages of text per high-level topic would have been potentially more intimidating to new user than another few buttons to press. (I'm aware of the python "philosophy" flat is better than nested.) I like pressing buttons, it's fun. But I probably should go find a professional editor (edit: a human one I mean) - there are some on MR - and ask for a slash job. Any volunteers? Last edited by unboggling; 09-04-2011 at 10:45 AM. |
09-04-2011, 09:50 AM | #205 |
Calibre Plugins Developer
Posts: 4,637
Karma: 2162064
Join Date: Oct 2010
Location: Australia
Device: Kindle Oasis
|
@unboggling - as I said above - it is just my opinion and what you have done may be exactly what others are wanting. For myself I wanted to just skim through the recommendations to see "where you were at" in certain areas, but instead found myself with lots of reading to do. Which I personally decided I didn't want to do, so I stopped. I totally agree that if the post was three pages long I would have skipped it as well. Can't please everyone can you
|
09-04-2011, 09:55 AM | #206 |
Resident Curmudgeon
Posts: 73,983
Karma: 128903378
Join Date: Nov 2006
Location: Roslindale, Massachusetts
Device: Kobo Libra 2, Kobo Aura H2O, PRS-650, PRS-T1, nook STR, PW3
|
Once you have your ePub, don't convert it to ePub. That will change it and in most cases, it will change it for the worse, not better. Calibre doesn't know how to just add a cover, just change metadata or change just the margins. It know how to deconstruct what you have and rebuild it to make your changes. Basically, if you have your ePub, open it up, make the fixes and then rebuild it or use Sigil.
|
09-04-2011, 10:22 AM | #207 | |
Wizard
Posts: 1,065
Karma: 858115
Join Date: Jan 2011
Device: Kobo Clara, Kindle Paperwhite 10
|
Quote:
About HTML and learning HtML being the primary keys to the lock of most clean-up and conversion problems, I fully agree. Maybe after I learn enough HTML, my core library will hold onlyHTML formats. Edit them, generate formats for devices on the fly, read, immediately delete any other format. It'd be even nicer if I could edit that HTML directly on reading device as I read. Raw HTML is probably the truest "lossless" format currently available in the state of current technology, in my admittedly new and limited understanding of the eBook world. @kiwidude, I like the idea of "why" inside spoilers. That's a great idea, and I'll work on implementing something like that for next version. Have a feeling that won't be easy for me, leaving the do-this's and do-that's naked with the why's hiding under the bed button. I'm the kind of person who likes to know why's first too. |
|
09-04-2011, 10:28 AM | #208 | |
Wizard
Posts: 1,065
Karma: 858115
Join Date: Jan 2011
Device: Kobo Clara, Kindle Paperwhite 10
|
Quote:
|
|
09-04-2011, 10:31 AM | #209 | |
Enthusiast
Posts: 31
Karma: 130
Join Date: Apr 2010
Location: USA
Device: Kindle 2
|
Learning Calibre
Quote:
Thanks so much for your help... that got me over a big problem, but I want to learn all I can about Calibre conversions. |
|
09-04-2011, 10:48 AM | #210 |
Well trained by Cats
Posts: 29,803
Karma: 54830978
Join Date: Aug 2009
Location: The Central Coast of California
Device: Kobo Libra2,Kobo Aura2v1, K4NT(Fixed: New Bat.), Galaxy Tab A
|
|
|
Similar Threads | ||||
Thread | Thread Starter | Forum | Replies | Last Post |
Two Users--One Calibre Library? | jerrypettit | Devices | 3 | 06-24-2011 01:01 PM |
Hi, Calibre users | zaphod234 | Introduce Yourself | 6 | 07-22-2010 08:16 AM |
The Success of the iPad and KISS | ColdSun | General Discussions | 68 | 07-10-2010 10:33 PM |
How do Calibre users.... | Lanyon | Calibre | 24 | 01-01-2009 01:29 PM |