Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book General > General Discussions

Notices

Reply
 
Thread Tools Search this Thread
Old 11-05-2017, 03:49 AM   #1
karta
Junior Member
karta ought to be getting tired of karma fortunes by now.karta ought to be getting tired of karma fortunes by now.karta ought to be getting tired of karma fortunes by now.karta ought to be getting tired of karma fortunes by now.karta ought to be getting tired of karma fortunes by now.karta ought to be getting tired of karma fortunes by now.karta ought to be getting tired of karma fortunes by now.karta ought to be getting tired of karma fortunes by now.karta ought to be getting tired of karma fortunes by now.karta ought to be getting tired of karma fortunes by now.karta ought to be getting tired of karma fortunes by now.
 
Posts: 3
Karma: 1000000
Join Date: Oct 2017
Device: RCA
Right Forum??

I am in the proces of trying to catalog a rather large etext collection, and looking for methods to learn and to share.

However....

I ony read etexts wit dektops and a laptop. I have a reader, but find it of little use.

My pimary intrest is NOT using any library software, but rather scripts to categorize etexts into category directories. I do use calibre, but only to get titles and metadata.

Is this the right forum?

Other etext forums, at least from what i have seen appear moirbund. Any pointers would be appreciated...
karta is offline   Reply With Quote
Old 11-05-2017, 03:55 AM   #2
pdurrant
The Grand Mouse 高貴的老鼠
pdurrant ought to be getting tired of karma fortunes by now.pdurrant ought to be getting tired of karma fortunes by now.pdurrant ought to be getting tired of karma fortunes by now.pdurrant ought to be getting tired of karma fortunes by now.pdurrant ought to be getting tired of karma fortunes by now.pdurrant ought to be getting tired of karma fortunes by now.pdurrant ought to be getting tired of karma fortunes by now.pdurrant ought to be getting tired of karma fortunes by now.pdurrant ought to be getting tired of karma fortunes by now.pdurrant ought to be getting tired of karma fortunes by now.pdurrant ought to be getting tired of karma fortunes by now.
 
pdurrant's Avatar
 
Posts: 73,946
Karma: 315160596
Join Date: Jul 2007
Location: Norfolk, England
Device: Kindle Oasis
You're making your task much harder by not using calibre to store your etexts. You can define custom metadata, and using calibre is so much less error prone than moving texts between dictionaries.

What are your objections to using calibre as a store for the texts and metadata?
pdurrant is offline   Reply With Quote
Advert
Old 11-05-2017, 10:32 AM   #3
Dr. Drib
Grand Sorcerer
Dr. Drib ought to be getting tired of karma fortunes by now.Dr. Drib ought to be getting tired of karma fortunes by now.Dr. Drib ought to be getting tired of karma fortunes by now.Dr. Drib ought to be getting tired of karma fortunes by now.Dr. Drib ought to be getting tired of karma fortunes by now.Dr. Drib ought to be getting tired of karma fortunes by now.Dr. Drib ought to be getting tired of karma fortunes by now.Dr. Drib ought to be getting tired of karma fortunes by now.Dr. Drib ought to be getting tired of karma fortunes by now.Dr. Drib ought to be getting tired of karma fortunes by now.Dr. Drib ought to be getting tired of karma fortunes by now.
 
Dr. Drib's Avatar
 
Posts: 45,441
Karma: 59592133
Join Date: Jan 2007
Location: Peru
Device: KINDLE: Oasis 3, Scribe (1st), Matcha; KOBO: Libra 2, Libra Colour
^ I completely agree with Paul.

Calibre would be a great choice, and you can manipulate the Metadata in just about any way you may wish. Why do you prefer to use a script, rather than solving this with a software solution?

One of your other questions (asked twice) on whether this is the 'right' forum: Probably not.
Dr. Drib is offline   Reply With Quote
Old 11-07-2017, 02:27 AM   #4
frahse
occasional author
frahse ought to be getting tired of karma fortunes by now.frahse ought to be getting tired of karma fortunes by now.frahse ought to be getting tired of karma fortunes by now.frahse ought to be getting tired of karma fortunes by now.frahse ought to be getting tired of karma fortunes by now.frahse ought to be getting tired of karma fortunes by now.frahse ought to be getting tired of karma fortunes by now.frahse ought to be getting tired of karma fortunes by now.frahse ought to be getting tired of karma fortunes by now.frahse ought to be getting tired of karma fortunes by now.frahse ought to be getting tired of karma fortunes by now.
 
frahse's Avatar
 
Posts: 2,315
Karma: 2064403292
Join Date: Sep 2011
Location: Wandering God's glorious hills, valleys and plains.
Device: A Franklin BI (before Internet) was the first. I still have it.
Quote:
Originally Posted by Dr. Drib View Post
^ I completely agree with Paul.

Calibre would be a great choice, and you can manipulate the Metadata in just about any way you may wish. Why do you prefer to use a script, rather than solving this with a software solution?

One of your other questions (asked twice) on whether this is the 'right' forum: Probably not.
Well, maybe it is... The people here have information and skills galore!

Still, the question and objectives need to be worked out to give an idea as to what is really needed. I think I am hearing a predetermined preference of method as opposed to just a desired starting point and end result.
frahse is offline   Reply With Quote
Old 11-08-2017, 02:17 AM   #5
karta
Junior Member
karta ought to be getting tired of karma fortunes by now.karta ought to be getting tired of karma fortunes by now.karta ought to be getting tired of karma fortunes by now.karta ought to be getting tired of karma fortunes by now.karta ought to be getting tired of karma fortunes by now.karta ought to be getting tired of karma fortunes by now.karta ought to be getting tired of karma fortunes by now.karta ought to be getting tired of karma fortunes by now.karta ought to be getting tired of karma fortunes by now.karta ought to be getting tired of karma fortunes by now.karta ought to be getting tired of karma fortunes by now.
 
Posts: 3
Karma: 1000000
Join Date: Oct 2017
Device: RCA
Though I like Calibre, I really dislike the idea of requiring a GUI interface to acess my ever growing and shifting collections.

Crurrently, I use text files to store data in a large directory, and grep portions of titles to tell me where they are. The bulk of my non-fiction is on optical media.

Fiction is mostly across a couple hard drives, and is probably around 200,000 titles if not more. I am consolidating that now, and my ultimate nightmare would be loss of data due to software failure or deprecation.

Currently I use calbre on books with ambiguous or damaged titles, and export the etext, jpg, and opf files it generates to a category directory.
Flush, rinse, repeat.

Currently each author gets a single category, but in time I intend to fix that with symlinks.

The final steps would be a database with all relevant info on the etexts, but *not* the etexts themselves. Just a location pointer. MySQL or mariaDB com to mind.

The reason for the scripts is to automate as much as possible the processes for categorization, elimination of duplicates (for fiction I archive only .epub, and will eliminate by script any extra title that matches the .epub title, for example). another script will come in and then fix title formats,and then roughly categorize the etexts, as well as eliminate some garbage authors.

I am particularly intersted in mining the metadata out of the etexts themselves. With an epub it should be failry simple to batch process them and yank out the .opf files to the metadata. I can dothat by hand quicker than Calibre. other file formats are a bit more opaque, and I have had minimal success with scraping PDFs for info using scripts.

Most of all the system needs to be portble across Win and Linux. so i use Perl, and may switch to Ruby in the near future. With those languages i can build the database interface, into perhaps something like Perl based GC-Star

This is long term project, and for now I am still building on the basics. Like looking for a good author->category list to assst in my processing.
And perhaps a script for grabbing titles like what Calibre uses.

I can get access to the ABEBooks API, which is particularly useful, as a large portion of my collection is pre-ISBN .

PLUS, there is the additional requirment that I will be needing when I finally get around to cataloguing my modest personal collection of *real* books (around 5,000).

Or, to put it much more simply - as we are dealing with terabytes of data, do not see calibre or any typical reader software as being more than a tool in a larger chain. And there, GUI software just gets in the way - unless Calibre can be run from the command line with personalized scripts.
karta is offline   Reply With Quote
Advert
Old 11-08-2017, 02:50 AM   #6
pdurrant
The Grand Mouse 高貴的老鼠
pdurrant ought to be getting tired of karma fortunes by now.pdurrant ought to be getting tired of karma fortunes by now.pdurrant ought to be getting tired of karma fortunes by now.pdurrant ought to be getting tired of karma fortunes by now.pdurrant ought to be getting tired of karma fortunes by now.pdurrant ought to be getting tired of karma fortunes by now.pdurrant ought to be getting tired of karma fortunes by now.pdurrant ought to be getting tired of karma fortunes by now.pdurrant ought to be getting tired of karma fortunes by now.pdurrant ought to be getting tired of karma fortunes by now.pdurrant ought to be getting tired of karma fortunes by now.
 
pdurrant's Avatar
 
Posts: 73,946
Karma: 315160596
Join Date: Jul 2007
Location: Norfolk, England
Device: Kindle Oasis
Quote:
Originally Posted by karta View Post
unless Calibre can be run from the command line with personalized scripts.
calibre can be run from the command line. It's also open source.

For what you describe (since you seem to be happy to learn different tools/languages), you'd probably be better off working from the calibre base. Adapting it to easily work with offline references to books should be trivial. You could, of course, do that without any programming with just the use of custom columns (which can be created and manipulated from the command line).


And I hate to ask a silly question, but you do have backups of your texts, don't you?
pdurrant is offline   Reply With Quote
Reply


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Seriously thoughtful Verso un Forum italiano/Toward an Italian Forum beppe Lounge 262 01-19-2022 05:50 AM
MobileRead forum is requesting authentication upon entering any forum Katsunami Feedback 19 03-16-2014 02:11 AM
WOW! The Kobo Forum is Almost Getting as Much Action as the Kindle Forum!!! pokee Kobo Reader 16 11-13-2011 09:50 AM
Sondaggio su apertura forum italiano/Italian forum Poll kya General Discussions 27 11-07-2011 06:32 AM


All times are GMT -4. The time now is 06:00 PM.


MobileRead.com is a privately owned, operated and funded community.