View Single Post
Old 10-21-2013, 07:05 PM   #12
BetterRed
null operator (he/him)
BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.BetterRed ought to be getting tired of karma fortunes by now.
 
Posts: 21,820
Karma: 30277270
Join Date: Mar 2012
Location: Sydney Australia
Device: none
Quote:
Originally Posted by Sabardeyn View Post
No, I cannot recall exactly what I did to index epubs. My guess, and it really is just a guess, is that I handled it as an alternate extension for ZIP files. Only the text portions of the archive are relevant to Indexing, so ignoring the other stuff in an EPUB (ToC, Spine, OPF, etc) would be fine. Images might have been ignored as well, which wouldn't really mater as I wouldn't be searching on them anyway.
I've tried do the same with the zip iFilter from the iFiltershop. In fact it still there see attachment, but all it does is to index contents of the zip in terms of file names, but not the contents thereof. Citeknet have an iFilter zip, I think I tried it with similar results - but it could be that there's another.

I know my library folders are being indexed because docx, rtf, pdf, chms etc show up in search, and its that list of file paths that I massage with notepad++ macros to create a csv that I give to the ImportList PI to create a Reading List.

I just found this http://www.sigmasd.com/product/Zip-IFilter/322038.

It refers to the IFiltershop product. Sigma appears to be a UK based systems integrator - I'll send them a mail and ask how to make it work for zips dressed up as ePUBs. But I wont hold my breath expecting response.

Quote:
Originally Posted by Sabardeyn View Post
PS: If you can figure out how to generate an EPUB iFilter, it appears as though you might have a lucrative niche product, with a growing market, without any competitors in sight.
10-20 years ago I would have done it by now, but these days whilst the back brain knows what must done, the front brain says it has better things to do with its time, disk space is cheap, and not all PDFs can be readily converted to ePUB etc, etc. :lol:

What did you mean by 'I almost maxed out Indexing at 1.97 Gb after going through my entire library'? I didn't think there was a limit on its size other than available drive space. I index much more than what's in the calibre library folders.

BR

PS : thanks for your interest in this, it may lead to getting the zip Ifilter to do on epubs what sigma claims it can do on zips, then I might be tempted to clone the Recoll plugin for WDS.
Attached Thumbnails
Click image for larger version

Name:	Capture.JPG
Views:	197
Size:	187.7 KB
ID:	113814  

Last edited by BetterRed; 10-21-2013 at 07:09 PM.
BetterRed is offline   Reply With Quote