The full dataset behind paperswithcode.com
☆838Sep 8, 2025Updated 5 months ago
Alternatives and similar repositories for paperswithcode-data
Users that are interested in paperswithcode-data are comparing it to the libraries listed below
Sorting:
- The SOTA extractor pipeline☆380Mar 20, 2024Updated last year
- Tools for extracting tables and results from Machine Learning papers☆435Nov 28, 2022Updated 3 years ago
- Easily benchmark Machine Learning models on selected tasks and datasets☆16May 22, 2023Updated 2 years ago
- S2ORC: The Semantic Scholar Open Research Corpus: https://www.aclweb.org/anthology/2020.acl-main.447/☆1,016Apr 26, 2024Updated last year
- This repository hosts the dataset for the paper Computer Science Named Entity Recognition in the Open Research Knowledge Graph☆21Jan 8, 2024Updated 2 years ago
- Data/Code Repository for https://api.semanticscholar.org/CorpusID:218470122☆138Jul 25, 2024Updated last year
- ☆98May 20, 2022Updated 3 years ago
- Neighborhood Contrastive Learning for Scientific Document Representations with Citation Embeddings (EMNLP 2022 paper)☆76Dec 29, 2025Updated last month
- Dataset accompanying the SPECTER model☆143Dec 19, 2022Updated 3 years ago
- Identifying Used Methods and Datasets in Scientific Publications☆18Jan 14, 2021Updated 5 years ago
- ☆18Sep 15, 2025Updated 5 months ago
- A BERT model for scientific text.☆1,669Feb 22, 2022Updated 4 years ago
- Tools to bulk download arxiv data☆133Oct 29, 2018Updated 7 years ago
- Dataset and evaluation suite enabling LLM instruction-following for scientific literature understanding.☆47Mar 17, 2025Updated 11 months ago
- Repository for NAACL 2019 paper on Citation Intent prediction☆129Dec 1, 2019Updated 6 years ago
- What are the best Systems? New Perspectives on NLP Benchmarking☆13Mar 16, 2023Updated 2 years ago
- SPECTER: Document-level Representation Learning using Citation-informed Transformers☆571Jun 12, 2023Updated 2 years ago
- Python wrapper for the arXiv API☆1,450Jan 5, 2026Updated last month
- Intelligence Task Ontology (ITO)☆75Oct 12, 2022Updated 3 years ago
- Data and models for the SciFact verification task.☆249Oct 15, 2023Updated 2 years ago
- Maintenance Information Extraction (MaintIE)☆16Jun 29, 2024Updated last year
- PyTorch implementation of L2R2 in SIGIR 2020☆17Jun 12, 2023Updated 2 years ago
- BertViz: Visualize Attention in Transformer Models☆7,921Jan 8, 2026Updated last month
- ☆10Oct 2, 2024Updated last year
- 💫 SpaCy wrapper for ConceptNet 💫☆96Dec 30, 2025Updated last month
- Data and code for Kang et al., NAACL 2018's paper titled "A Dataset of Peer Reviews (PeerRead): Collection, Insights and NLP Applications…☆427Dec 9, 2025Updated 2 months ago
- ☆27Oct 30, 2023Updated 2 years ago
- code for generating a high-quality knowledge graph with metadata about datasets and links to publications☆28Apr 8, 2022Updated 3 years ago
- A Benchmark of PDF Information Extraction Tools using a Multi-Task and Multi-Domain Evaluation Framework for Academic Documents☆29Dec 8, 2022Updated 3 years ago
- Hosting examples of interactive datamapplot output☆29Feb 13, 2026Updated 2 weeks ago
- [EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search☆102Dec 2, 2024Updated last year
- ☆69May 1, 2025Updated 9 months ago
- A set of scripts to grab public datasets from resources related to arXiv☆476May 20, 2024Updated last year
- jiant is an nlp toolkit☆1,674Jul 6, 2023Updated 2 years ago
- Align, a general text alignment function☆15Dec 7, 2023Updated 2 years ago
- Custom linux kernel module that re-enables fn-keys on the Gigabyte Aero 15 SB☆10Aug 4, 2022Updated 3 years ago
- A simple IOS application that uses mobilenet to classify 1000 different images from an IOS device's video camera.☆11Aug 5, 2019Updated 6 years ago
- ☆11Dec 2, 2024Updated last year
- Code for our project CROWN (Conversational Passage Ranking by Reasoning over Word Networks)☆10Jan 11, 2024Updated 2 years ago