Open Access PDF harvester, metadata aggregator and full-text ingester
☆62May 3, 2024Updated last year
Alternatives and similar repositories for article_dataset_builder
Users that are interested in article_dataset_builder are comparing it to the libraries listed below
Sorting:
- A high performance bibliographic information service: https://biblio-glutton.readthedocs.io☆148Jun 19, 2025Updated 8 months ago
- Citation Extraction and Classifier☆16Jan 15, 2026Updated last month
- Open database of scholarly journals☆10Oct 26, 2022Updated 3 years ago
- Finding mentions and citations to named and implicit research datasets from within the academic literature☆30Jun 14, 2025Updated 8 months ago
- Softcite software mention recognizer, finding mentions and citations to software from within the academic literature☆82Sep 30, 2025Updated 5 months ago
- DataSeer machine-learning service☆28Sep 4, 2025Updated 5 months ago
- Search comments and highlights annotations in PDF documents.☆12May 4, 2023Updated 2 years ago
- Some examples of usage of Grobid in a third party java project.☆20Jun 14, 2023Updated 2 years ago
- Command-line tools to support meta-analysis using a library managed in Zotero☆11Feb 9, 2023Updated 3 years ago
- A template to create your own literature survey engine☆11Feb 23, 2026Updated last week
- Load, build and explore Patstat using the Google Cloud Platform☆10Jan 19, 2019Updated 7 years ago
- S2ORC: The Semantic Scholar Open Research Corpus: https://www.aclweb.org/anthology/2020.acl-main.447/☆1,016Apr 26, 2024Updated last year
- Knowtate is a sophisticated platform designed to elevate your academic research experience. Seamlessly blend reading, note-taking with ma…☆11Sep 19, 2024Updated last year
- Analytic platform for the HAL research archive (in development)☆13Oct 2, 2020Updated 5 years ago
- A machine learning software for extracting astronomical entities from scholarly documents☆10Oct 31, 2022Updated 3 years ago
- Hacking the TGAM1 Neurosky EEG chip with an Arduino.☆12Feb 2, 2018Updated 8 years ago
- Specification of a stand-off element for the TEI guidelines☆12Apr 29, 2021Updated 4 years ago
- Tag macOS Finder items via keyboard shortcut☆20Oct 14, 2017Updated 8 years ago
- ☆35Sep 16, 2022Updated 3 years ago
- A machine learning software for extracting information from scholarly documents☆4,670Updated this week
- A low-code microservices platform designed for legal engineers. Given a document, Gremlin will apply a series of Python scripts to it and…☆32May 25, 2022Updated 3 years ago
- Collection of scripts that enhance Zotero☆16Mar 22, 2025Updated 11 months ago
- Jupyter kernel for Stata based on pystata☆14Feb 29, 2024Updated 2 years ago
- Service for converting and enhancing heterogeneous publisher XML formats into TEI☆60Sep 14, 2024Updated last year
- Making Patent Citations Uncool Again☆112Jun 11, 2023Updated 2 years ago
- Scientific Document Insight Q/A☆34Sep 1, 2025Updated 6 months ago
- Repository for NAACL 2019 paper on Citation Intent prediction☆129Dec 1, 2019Updated 6 years ago
- Parsers for scientific papers (PDF2JSON, TEX2JSON, JATS2JSON)☆457Apr 11, 2024Updated last year
- ☆21Oct 7, 2022Updated 3 years ago
- A browser extension providing Open Access bibliographical services☆18Dec 9, 2022Updated 3 years ago
- A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GRO…☆53Mar 17, 2025Updated 11 months ago
- Simple story sifting and social simulation engine☆22Aug 14, 2020Updated 5 years ago
- Get the scholarly citation for any research product: software, preprint, paper, or dataset☆84May 23, 2023Updated 2 years ago
- OpenAlex Networks is a helper library to process and obtain data from the OpenAlex dataset via API. It also provides functionality to gen…☆26Apr 5, 2023Updated 2 years ago
- proposition: general-purpose AppleScript libraries that ought to be included in OS X☆19Jun 21, 2016Updated 9 years ago
- A large-scale open data lake for the science of science research.☆110Jun 2, 2025Updated 9 months ago
- Python PDF parser for scientific publications: content and figures☆451Mar 21, 2024Updated last year
- Pipeline for assessing the tractability of potential targets (starting from Gene IDs)☆29Feb 9, 2025Updated last year
- 🌟BuroTonic: Revolutionizing Sales and Marketing with AI-Driven Business Intelligence - An adaptive team of virtual agents for precise cl…☆23Jul 19, 2025Updated 7 months ago