PDFx - Extract references and metadata from PDF documents, and download all referenced PDFs

1 · Chris Hager · Jan. 1, 2001, midnight
Reading over this paper and its references recently, I thought it would be great to be able to download all the references at once… This inspired me to write a little tool to do just that, and now it’s done and released under the Apache open source license: https://github.com/metachris/pdfx Features Extract references and metadata from a given PDF Detects pdf, url, arxiv and doi references Fast, parallel download of all referenced PDFs Find broken hyperlinks (using the -c flag) (more) Output a...