Poor man's simple harvester for arXiv resources
☆13Jul 14, 2023Updated 2 years ago
Alternatives and similar repositories for arxiv_harvester
Users that are interested in arxiv_harvester are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A browser extension providing Open Access bibliographical services☆18Dec 9, 2022Updated 3 years ago
- Open Access PDF harvester☆42May 3, 2024Updated last year
- A machine learning software for extracting astronomical entities from scholarly documents☆10Oct 31, 2022Updated 3 years ago
- Open Access PDF harvester, metadata aggregator and full-text ingester☆62May 3, 2024Updated last year
- ☆17Apr 6, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Some examples of usage of Grobid in a third party java project.☆20Jun 14, 2023Updated 2 years ago
- A Knowledge Base for research software relying on large-scale text mining and curated knowledge sources☆17May 14, 2023Updated 2 years ago
- Finding mentions and citations to named and implicit research datasets from within the academic literature☆30Jun 14, 2025Updated 9 months ago
- A neural dependency parser that does its best☆17Mar 6, 2026Updated 3 weeks ago
- A thorough tutorial on using and programming with the Unix shell.☆11Feb 27, 2020Updated 6 years ago
- GMap: Graph-to-Map visualization tool☆22Jun 11, 2021Updated 4 years ago
- Learning Javascript in Public☆11Apr 16, 2021Updated 4 years ago
- Locally hosted AI Agent Python Tool To Generate Novel Research Hypothesis + Titles + Abstracts☆30Apr 30, 2025Updated 11 months ago
- Service for converting and enhancing heterogeneous publisher XML formats into TEI☆62Sep 14, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Train transformer-based models.☆28Jan 23, 2026Updated 2 months ago
- Grobid module for superconductor material and properties extraction☆22May 17, 2025Updated 10 months ago
- WiNER-fr is a free named entity corpus using French Wikinews texts.☆17Feb 12, 2021Updated 5 years ago
- Discover a handpicked compilation of Git configuration settings and time-saving aliases. Enhance your productivity and simplify your work…☆18Feb 18, 2026Updated last month
- This repository contains a number of scripts that i have written or refactored to enhance its performance. All the scripts are meant to m…☆22Mar 24, 2025Updated last year
- Knowledge Base stuff☆23Mar 1, 2026Updated 3 weeks ago
- Material parsers and other tools, scripts Initially developed for Grobid Superconductor☆13Feb 21, 2025Updated last year
- Flutrack platform gathers flu related tweets from the entire world, with searching tag, words that are influenza synonyms and flu symptom…☆13Apr 22, 2019Updated 6 years ago
- Obsidian SSOV (Student Start Obsidian Vault) is a project created for Obsidian October 2022 event with the theme "Back to School"☆18Oct 27, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Course project for CS410. Drug Molecular Toxicity Prediction with GCN + Cloud ML Infra.☆10Apr 6, 2021Updated 4 years ago
- Repository hosting the common code for the entity-fishing clients☆10Jun 10, 2025Updated 9 months ago
- A Named-Entity Recogniser based on Grobid.☆54May 14, 2025Updated 10 months ago
- The EHRI project's portal interface.☆15Mar 9, 2026Updated 2 weeks ago
- Terminal tool that converts files encoding to UTF-8☆10Oct 5, 2019Updated 6 years ago
- The grobidmonkey package is an open-source package designed for postprocessing GROBID outputs.☆12Mar 27, 2024Updated 2 years ago
- Analytic platform for the HAL research archive (in development)☆13Oct 2, 2020Updated 5 years ago
- A high performance bibliographic information service: https://biblio-glutton.readthedocs.io☆148Mar 6, 2026Updated 3 weeks ago
- Load, build and explore Patstat using the Google Cloud Platform☆10Jan 19, 2019Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Specification of a stand-off element for the TEI guidelines☆12Apr 29, 2021Updated 4 years ago
- Scripts to parse arxiv documents for NLP tasks☆19Jun 12, 2023Updated 2 years ago
- MLCommons Science benchmarking working group☆13May 19, 2023Updated 2 years ago
- Backend, IA-specific tools for crawling and processing the scholarly web. Content ends up in https://fatcat.wiki☆28Jul 31, 2024Updated last year
- 🕸 GlotWeb: Web Indexing for Minority Languages (WWW 2026)☆17Feb 27, 2026Updated last month
- WindSR Dataset contains more than 22,000 pairs of HR/LR wind speed images, which are processed using the NASA's GEOS-5 Nature Run dataset…☆12Jan 18, 2024Updated 2 years ago
- Line shuffler for huge text file which does not fit in memory☆13Dec 1, 2022Updated 3 years ago