Register Guidelines E-Books Today's Posts Search

Go Back   MobileRead Forums > E-Book Software > Calibre > Development

Notices

Reply
 
Thread Tools Search this Thread
Old 09-04-2021, 06:47 PM   #1
Trenchant Edges
Junior Member
Trenchant Edges began at the beginning.
 
Posts: 1
Karma: 10
Join Date: Sep 2021
Device: Kindle 10
Post [New Plugin Development Plan] Extracting footnotes/endnotes and Indexing dates

I've got a research project I'd like to be able to pull specific information for and I think the shortest path to getting that is building two simple plugins.

In short, I've got a pile of pdfs, epubs, and mobis that all have footnotes/endnotes I'd like extracted and put into a text file.

I have another pile of the same type I'd like to create an index for any mention of a date or day of the week. Ideally, output to a .csv.

I think python can do this through Calibre.

If I'm wrong about that, I'd appreciate someone letting me know before I sink too much time into it.

Now, I'm returning to programming after about 15 years of not doing any, so this is going to be a bumpy janky mess but I don't really need anything better. So it'll be a fun project.

I want to start with the hyperlink/footnote/endnote extractor first because it's simpler.


I'm spending this week catching up on the basics, but I want to make sure I'm also building a plan for how this program will work. It looks like python and Calibre already have libraries for most of what I want to do, so it'll just be a question of reading the documentation and looking at extant plugin code.

I'm considering updating a couple of the python 2 plugins still not updated to python 3 just so I see what less inept people are doing, but I've been unable to find anything quite like what I want.

A friend suggested this library for the date parsing.

My questions for anyone who's made it this far are:
  1. Is what I want to do possible?
  2. What do I need to learn to make this work?
  3. Is there any existing project that might make this easier?

Anyway, thanks.

Trenchant Edges is offline   Reply With Quote
Old 09-04-2021, 06:57 PM   #2
BetterRed
null operator (he/him)
BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.
 
Posts: 21,715
Karma: 29711016
Join Date: Mar 2012
Location: Sydney Australia
Device: none
Moderator Notice
Moved from Plugins forum to Development forum

BR
BetterRed is offline   Reply With Quote
Advert
Old 09-04-2021, 10:08 PM   #3
kovidgoyal
creator of calibre
kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.kovidgoyal ought to be getting tired of karma fortunes by now.
 
kovidgoyal's Avatar
 
Posts: 45,331
Karma: 27182818
Join Date: Oct 2006
Location: Mumbai, India
Device: Various
Extracting dates should be trivial, look at the extract ISBN plugin for a template to get you started. As for footnotes/endotes, thats not so easy, especially from PDF.
kovidgoyal is offline   Reply With Quote
Reply


Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Help with endnotes/footnotes? r_avital ePub 4 10-28-2019 03:11 PM
Footnotes, endnotes and links Sissinghurst Calibre 0 07-17-2019 04:30 PM
Footnotes and Endnotes Ken Irving Writer2ePub 2 01-02-2011 09:05 AM
Footnotes/Endnotes crutledge Sigil 17 07-17-2010 11:56 AM
How do endnotes/footnotes work? JSWolf ePub 4 04-22-2009 05:54 PM


All times are GMT -4. The time now is 02:57 PM.


MobileRead.com is a privately owned, operated and funded community.