I’ve a couple of hundred scientific papers I’ve collected in PDF format – some of which are named in a fashion that one can figure out what they are about, but many (downloaded from Journal websites) have names that are in some Hogwarts code.
I’d like a piece fo software that could scan multiple PDF’s and make an index from those articles – one document indexing to articles (even, optimially, with links, but that’s a bit of a dreamj). An intelligent indexing algorithm that would ignore ‘a’ and ‘the’ would be nice but that’s not imperative…
Anybody know of such a utility? I can find things that index single articles but none that will index a group.\
Cheers
Richard