paperswithcode/paperswithcode-data

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/paperswithcode/paperswithcode-data)

paperswithcode / paperswithcode-data

The full dataset behind paperswithcode.com

☆928

Alternatives and similar repositories for paperswithcode-data

Users that are interested in paperswithcode-data are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

paperswithcode / sota-extractor
View on GitHub
The SOTA extractor pipeline
☆384Mar 20, 2024Updated 2 years ago
paperswithcode / paperswithcode-client
View on GitHub
API Client for paperswithcode.com
☆188May 10, 2024Updated 2 years ago
paperswithcode / axcell
View on GitHub
Tools for extracting tables and results from Machine Learning papers
☆440Nov 28, 2022Updated 3 years ago
allenai / s2orc
View on GitHub
S2ORC: The Semantic Scholar Open Research Corpus: https://www.aclweb.org/anthology/2020.acl-main.447/
☆1,073Apr 26, 2024Updated 2 years ago
allenai / SciREX
View on GitHub
Data/Code Repository for https://api.semanticscholar.org/CorpusID:218470122
☆140Jul 25, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
jd-coderepos / contributions-ner-cs
View on GitHub
This repository hosts the dataset for the paper Computer Science Named Entity Recognition in the Open Research Knowledge Graph
☆21Jan 8, 2024Updated 2 years ago
OpenBioLink / ITO
View on GitHub
Intelligence Task Ontology (ITO)
☆79Oct 12, 2022Updated 3 years ago
michaelfaerber / data-set-knowledge-graph
View on GitHub
code for generating a high-quality knowledge graph with metadata about datasets and links to publications
☆28Apr 8, 2022Updated 4 years ago
allenai / scidocs
View on GitHub
Dataset accompanying the SPECTER model
☆148Dec 19, 2022Updated 3 years ago
malteos / scincl
View on GitHub
Neighborhood Contrastive Learning for Scientific Document Representations with Citation Embeddings (EMNLP 2022 paper)
☆79Dec 29, 2025Updated 6 months ago
viswavi / datafinder
View on GitHub
☆27Oct 30, 2023Updated 2 years ago
IBM / science-result-extractor
View on GitHub
☆100May 20, 2022Updated 4 years ago
allenai / specter
View on GitHub
SPECTER: Document-level Representation Learning using Citation-informed Transformers
☆586Jun 12, 2023Updated 3 years ago
allenai / scicite
View on GitHub
Repository for NAACL 2019 paper on Citation Intent prediction
☆130Dec 1, 2019Updated 6 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
michaelfaerber / scholarly-entity-usage-detection
View on GitHub
Identifying Used Methods and Datasets in Scientific Publications
☆18Jan 14, 2021Updated 5 years ago
ariecattan / SciCo
View on GitHub
Code for the paper SciCo: Hierarchical Cross-Document Coreference for Scientific Concepts (AKBC 2021). https://openreview.net/forum?id=OF…
☆30Oct 17, 2021Updated 4 years ago
allenai / scibert
View on GitHub
A BERT model for scientific text.
☆1,705Feb 22, 2022Updated 4 years ago
copenlu / scientific-information-change
View on GitHub
Code for the paper "Modeling Information Change in Science Communication with Semantically Matched Paraphrases" from EMNLP 2022
☆13Oct 20, 2022Updated 3 years ago
allenai / ForeCite
View on GitHub
☆35Sep 16, 2022Updated 3 years ago
danilo-dessi / SKG-pipeline
View on GitHub
☆21May 1, 2025Updated last year
dair-iitd / ECQA-Dataset
View on GitHub
Dataaset Release for Explanations for CommonsenseQA, ACL 2021 Paper
☆20Jul 30, 2021Updated 4 years ago
greenelab / opencitations
View on GitHub
Processing OpenCitations Data
☆20Aug 17, 2017Updated 8 years ago
copenlu / cite-worth
View on GitHub
Data and code for the paper "CiteWorth: Cite-Worthiness Detection for Improved Scientific Document Understanding"
☆14Sep 8, 2022Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
paperswithcode / releasing-research-code
View on GitHub
Tips for releasing research code in Machine Learning (with official NeurIPS 2020 recommendations)
☆2,949May 19, 2023Updated 3 years ago
ottowg / gsap-ner
View on GitHub
☆10Oct 2, 2024Updated last year
panditakshay402 / PsyCare-AI
View on GitHub
PsyCare-AI is an AI-powered mental health prediction project, offering a user-friendly interface to predict potential mental health issue…
☆10Jul 19, 2023Updated 3 years ago
kermitt2 / datastet
View on GitHub
Finding mentions and citations to named and implicit research datasets from within the academic literature
☆31Jun 14, 2025Updated last year
kochbj / Reduced_Reused_Recycled
View on GitHub
Github for "Reduced, Reused and Recycled" (NeurIPS 2021 Best Paper, D&B Track)
☆17Jan 8, 2022Updated 4 years ago
grobidOrg / grobid
View on GitHub
A machine learning software for extracting information from scholarly documents
☆5,016Updated this week
google-research / google-research
View on GitHub
Google Research
☆38,423Updated this week
armancohan / arxiv-tools
View on GitHub
Tools to bulk download arxiv data
☆134Oct 29, 2018Updated 7 years ago
myt517 / DKT
View on GitHub
Official implementation of "Disentangled Knowledge Transfer for OOD Intent Discovery with Unified Contrastive Learning", ACL2022 main con…
☆14Jul 23, 2022Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
lukasschwab / arxiv.py
View on GitHub
Python wrapper for the arXiv API
☆1,537Jul 10, 2026Updated 2 weeks ago
zycdev / L2R2
View on GitHub
PyTorch implementation of L2R2 in SIGIR 2020
☆17Jun 12, 2023Updated 3 years ago
allenai / PeerRead
View on GitHub
Data and code for Kang et al., NAACL 2018's paper titled "A Dataset of Peer Reviews (PeerRead): Collection, Insights and NLP Applications…
☆429Dec 9, 2025Updated 7 months ago
jacklxc / CORWA
View on GitHub
CORWA: A Citation-Oriented Related Work Annotation Dataset, NAACL 2022
☆17May 2, 2025Updated last year
deepcurator / DCC
View on GitHub
Deep Code Curation
☆32Dec 8, 2022Updated 3 years ago
huggingface / transformers
View on GitHub
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal model…
☆162,876Updated this week
JulesBelveze / concepcy
View on GitHub
💫 SpaCy wrapper for ConceptNet 💫
☆96Dec 30, 2025Updated 6 months ago