MOVED TO https://gitlab.com/crossref/pdfextract
☆510Jul 26, 2017Updated 8 years ago
Alternatives and similar repositories for pdfextract
Users that are interested in pdfextract are comparing it to the libraries listed below
Sorting:
- Extract text, metadata and references (pdf, url, doi, arxiv) from PDF. Optionally download all referenced PDFs.☆1,071Jun 15, 2023Updated 2 years ago
- An open-source CRF Reference String Parsing Package☆161May 6, 2020Updated 5 years ago
- Extract citations from PDFs.☆28Feb 26, 2014Updated 12 years ago
- Content ExtRactor and MINEr☆513Jun 30, 2022Updated 3 years ago
- High-level build project for all LAPDF-Text submodules☆103Jul 2, 2015Updated 10 years ago
- A machine learning software for extracting information from scholarly documents☆4,707Mar 13, 2026Updated last week
- MOVED TO https://gitlab.com/crossref/pdfmark☆34Nov 22, 2018Updated 7 years ago
- The repository of Icecite, a research paper management system.☆15Mar 29, 2018Updated 7 years ago
- Extract bibliographic references from (High-Energy Physics) articles.☆142Mar 11, 2026Updated last week
- A parser for Google Scholar, written in Python☆2,168Sep 10, 2022Updated 3 years ago
- Documentation for Crossref's REST API. For questions or suggestions, see https://community.crossref.org/☆792Sep 25, 2024Updated last year
- Statistical mixed effects models in Ruby☆21Jul 8, 2016Updated 9 years ago
- Performs a search and uses the resulting DOI to create a new bibtex entry. Uses the crossref API.☆63Aug 2, 2023Updated 2 years ago
- Android application for the Bodytrack project☆13Sep 8, 2011Updated 14 years ago
- Track the impact of research software.☆207Jul 6, 2022Updated 3 years ago
- The One True Open Access Button - cross-compatible extension for research papers and data.☆49Oct 8, 2024Updated last year
- Fast citation reference parsing☆1,218May 11, 2025Updated 10 months ago
- ☆41Feb 25, 2018Updated 8 years ago
- Neuralized version of the Reference String Parser component of the ParsCit package.☆81May 27, 2022Updated 3 years ago
- A python library to deal with scientific papers.☆17Apr 2, 2016Updated 9 years ago
- Generalized Linear Models extension for Statsample☆24Jan 24, 2019Updated 7 years ago
- The Dat in the Lab project☆32Jun 20, 2019Updated 6 years ago
- A script to synchronise PDF files in Mendeley across multiple machines☆24Jul 22, 2018Updated 7 years ago
- Multi-Entity Extraction Framework for Academic Documents (with default extraction tools)☆31Oct 3, 2023Updated 2 years ago
- Python client for Gogs server☆13Oct 5, 2020Updated 5 years ago
- A Node.js-based server to run Zotero translators☆146Mar 3, 2026Updated 2 weeks ago
- A library for extracting tables from PDF files☆89Sep 27, 2013Updated 12 years ago
- Yet Another Indent Finder, Almost...☆21Apr 10, 2020Updated 5 years ago
- Regular expression for matching DOIs☆31Jun 14, 2024Updated last year
- my take at a PDF text extraction utility☆25Jun 15, 2015Updated 10 years ago
- Adds Pandoc-style BibTeX citation key autocompletion to autocomplete+ for Atom.☆44Mar 26, 2022Updated 3 years ago
- Extract tables from PDF files☆359May 17, 2016Updated 9 years ago
- A browser extension providing Open Access bibliographical services☆18Dec 9, 2022Updated 3 years ago
- A FUSE filesystem for browsing the xkcd webcomic☆14Jun 14, 2023Updated 2 years ago
- stoplists for African languages generated from the ASP corpus☆14Jan 16, 2016Updated 10 years ago
- A signal processing library, currently sufficient for basic speech recognition stuff like mel frequency cepstrum☆19Mar 15, 2012Updated 14 years ago
- A python script that looks for special lines in a markdown file and uses those lines to convert, clean up, and insert content from URLs i…☆16Dec 9, 2012Updated 13 years ago
- A complete agency API program.☆12Apr 27, 2017Updated 8 years ago
- An Atom package for creating a zettelkasten style wiki. Should be used with my Academic-Markdown syntax file☆12Jun 3, 2021Updated 4 years ago