Open Access PDF harvester, metadata aggregator and full-text ingester
☆62May 3, 2024Updated last year
Alternatives and similar repositories for article_dataset_builder
Users that are interested in article_dataset_builder are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Poor man's simple harvester for arXiv resources☆13Jul 14, 2023Updated 2 years ago
- A Knowledge Base for research software relying on large-scale text mining and curated knowledge sources☆17May 14, 2023Updated 2 years ago
- Finding mentions and citations to named and implicit research datasets from within the academic literature☆30Jun 14, 2025Updated 9 months ago
- Some examples of usage of Grobid in a third party java project.☆20Jun 14, 2023Updated 2 years ago
- Softcite software mention recognizer, finding mentions and citations to software from within the academic literature☆82Sep 30, 2025Updated 6 months ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- All the OpenAlex API endpoints that are backed by Elasticsearch☆40Updated this week
- DataSeer machine-learning service☆28Sep 4, 2025Updated 7 months ago
- Analytic platform for the HAL research archive (in development)☆13Oct 2, 2020Updated 5 years ago
- Load, build and explore Patstat using the Google Cloud Platform☆10Jan 19, 2019Updated 7 years ago
- Specification of a stand-off element for the TEI guidelines☆12Apr 29, 2021Updated 4 years ago
- A browser extension providing Open Access bibliographical services☆18Dec 9, 2022Updated 3 years ago
- Open database of scholarly journals☆11Oct 26, 2022Updated 3 years ago
- Streaming responses with Streamlit, ChatGPT and Langchain.☆11Apr 7, 2023Updated 3 years ago
- ☆35Sep 16, 2022Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A machine learning software for extracting information from scholarly documents☆4,776Updated this week
- A gold-standard dataset of software mentions in research publications.☆38Jul 27, 2023Updated 2 years ago
- Pip-installable Python package to automate handsearching and citation searching for systematic reviews.☆17Jul 13, 2024Updated last year
- Scientific Document Insight Q/A☆35Sep 1, 2025Updated 7 months ago
- Command-line tools to support meta-analysis using a library managed in Zotero☆11Feb 9, 2023Updated 3 years ago
- quarto filter extension for simple search-replace macros☆27Dec 29, 2025Updated 3 months ago
- ☆21Oct 7, 2022Updated 3 years ago
- A Named-Entity Recogniser based on Grobid.☆54May 14, 2025Updated 10 months ago
- Introduction to Reproducible Publications with Quarto☆11Jan 28, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Making Patent Citations Uncool Again☆112Jun 11, 2023Updated 2 years ago
- a Deep Learning Framework for Text https://delft.readthedocs.io/☆415Apr 3, 2026Updated last week
- Python PDF parser for scientific publications: content and figures☆452Mar 21, 2024Updated 2 years ago
- Perpetual Access To The Scholarly Record☆121Jul 31, 2024Updated last year
- Retrieve and normalize records from ICTRP☆10May 26, 2020Updated 5 years ago
- High-performance R package server☆27Mar 25, 2026Updated 2 weeks ago
- Knowtate is a sophisticated platform designed to elevate your academic research experience. Seamlessly blend reading, note-taking with ma…☆11Sep 19, 2024Updated last year
- Download metadata for all DOIs using the Crossref API☆66Sep 25, 2018Updated 7 years ago
- For extracting measurements and related entities from text☆58May 6, 2020Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Inter-annotator agreement for Doccano☆28May 3, 2020Updated 5 years ago
- PDF to XML ALTO file converter☆268Apr 1, 2026Updated last week
- A Keras version of Google's BERT model☆35Nov 4, 2019Updated 6 years ago
- GROBID extension for identifying and normalizing physical quantities.☆83Updated this week
- Content ExtRactor and MINEr☆512Jun 30, 2022Updated 3 years ago
- Sisyphe is a modulable NodeJS BIG-DATA analyser & transformer☆12Oct 16, 2023Updated 2 years ago
- Scripts used to make and evaluate OpenAlex's concept tagging model☆52Aug 17, 2023Updated 2 years ago