Corpus of Open Access articles from multiple fields in Science, Technology, and Medicine.
☆74Mar 28, 2017Updated 8 years ago
Alternatives and similar repositories for OA-STM-Corpus
Users that are interested in OA-STM-Corpus are comparing it to the libraries listed below
Sorting:
- Material parsers and other tools, scripts Initially developed for Grobid Superconductor☆13Feb 21, 2025Updated last year
- ☆20May 1, 2025Updated 10 months ago
- Generating graph structures from OWL ontologies☆12Nov 21, 2017Updated 8 years ago
- curation workflow automation and coordination☆42Sep 19, 2025Updated 6 months ago
- This repository hosts the dataset for the paper Computer Science Named Entity Recognition in the Open Research Knowledge Graph☆21Jan 8, 2024Updated 2 years ago
- Open Access PDF harvester☆42May 3, 2024Updated last year
- A small python library to parse and write TSV files generated by the WebAnno software.☆11Apr 14, 2025Updated 11 months ago
- ☆12Mar 20, 2020Updated 6 years ago
- A Knowledge Base for research software relying on large-scale text mining and curated knowledge sources☆17May 14, 2023Updated 2 years ago
- ☆27May 8, 2019Updated 6 years ago
- Project code for BioHackathon Europe 2023.☆18Aug 20, 2024Updated last year
- Open Source Mycetoma's First Series of Molecules☆10Sep 22, 2025Updated 5 months ago
- A simple toolkit for conducting analyses using corpus methods☆27Nov 11, 2021Updated 4 years ago
- For extracting measurements and related entities from text☆58May 6, 2020Updated 5 years ago
- This ontology provides contribution roles for use in crediting persons or organizations.☆24Aug 16, 2023Updated 2 years ago
- A Benchmark of PDF Information Extraction Tools using a Multi-Task and Multi-Domain Evaluation Framework for Academic Documents☆30Dec 8, 2022Updated 3 years ago
- ☆18Oct 22, 2022Updated 3 years ago
- Tracking books that I {have, currently, or plan to} read☆18Apr 18, 2021Updated 4 years ago
- Finding mentions and citations to named and implicit research datasets from within the academic literature☆30Jun 14, 2025Updated 9 months ago
- Force11 Software Citation Working Group☆37Jun 12, 2018Updated 7 years ago
- PyTorch implementation of Area Attention.☆11Nov 30, 2020Updated 5 years ago
- Extract, transform, and analyze bibliographic data from Wikidata dumps☆28Mar 5, 2023Updated 3 years ago
- cicada: a hypergraph-based toolkit for statistical machine translation based on {tree, string}-to-{tree, string} models☆42Aug 9, 2021Updated 4 years ago
- R code to reproduce this Jan. 23, 2018 BuzzFeed News analysis of a year of tweets from President Donald Trump and all members of Congres…☆10Nov 8, 2019Updated 6 years ago
- ☆14Mar 7, 2019Updated 7 years ago
- [NAACL(2019)] Generating Knowledge Graph Paths from Textual Definitions using Sequence-to-Sequence Models☆11Apr 27, 2022Updated 3 years ago
- Data and all☆14Sep 30, 2019Updated 6 years ago
- BERT models pretrained on the CORD-19 Kaggle dataset☆15Jun 8, 2020Updated 5 years ago
- DataSeer machine-learning service☆28Sep 4, 2025Updated 6 months ago
- no-bullshit url shortening with node.js☆10Jan 11, 2023Updated 3 years ago
- Training data for the NLPContributionGraph Shared Task 11 at SemEval-2021☆14Jan 11, 2021Updated 5 years ago
- R package for styling graphics for RSS publications.☆17Mar 2, 2024Updated 2 years ago
- Fast and Effective Biomedical Entity Linking Using a Dual Encoder☆18Apr 21, 2022Updated 3 years ago
- Vectorizing knowledge bases for entity linking☆15Feb 21, 2021Updated 5 years ago
- DEPRECATED eXist code for Syriaca.org: The Syriac Reference Portal☆10Jun 1, 2024Updated last year
- Bioclipse2 Core.☆25Jun 11, 2021Updated 4 years ago
- An OWL vocabulary to allow the serialization and exchanging of OntoUML models in conformance with the OntoUML Metamodel.☆13Aug 22, 2023Updated 2 years ago
- This is for C2D2 Dataset: A Resource for Analyzing Cognitive Distortions and Its Impact on Mental Health☆33Nov 10, 2023Updated 2 years ago
- Argumentation Mining Tool for Lawyers☆15May 18, 2021Updated 4 years ago