mattbierbaum / arxiv-public-datasets
A set of scripts to grab public datasets from resources related to arXiv
☆432Updated 10 months ago
Alternatives and similar repositories for arxiv-public-datasets:
Users that are interested in arxiv-public-datasets are comparing it to the libraries listed below
- A data set based on all arXiv publications, pre-processed for NLP, including structured full-text and citation network☆285Updated 5 months ago
- S2ORC: The Semantic Scholar Open Research Corpus: https://www.aclweb.org/anthology/2020.acl-main.447/☆892Updated 11 months ago
- Parsers for scientific papers (PDF2JSON, TEX2JSON, JATS2JSON)☆359Updated 11 months ago
- SPECTER: Document-level Representation Learning using Citation-informed Transformers☆539Updated last year
- Autoregressive Entity Retrieval☆782Updated last year
- Science-parse version 2☆240Updated 5 years ago
- The corresponding code from our paper "DeCLUTR: Deep Contrastive Learning for Unsupervised Textual Representations". Do not hesitate to o…☆380Updated last year
- Data and models for the SciFact verification task.☆227Updated last year
- Dataset accompanying the SPECTER model☆133Updated 2 years ago
- REL: Radboud Entity Linker☆306Updated 11 months ago
- A BERT model for scientific text.☆1,571Updated 3 years ago
- Tools for extracting tables and results from Machine Learning papers☆400Updated 2 years ago
- Repository for NAACL 2019 paper on Citation Intent prediction☆119Updated 5 years ago
- The full dataset behind paperswithcode.com☆340Updated 3 years ago
- Implementation of the ClausIE information extraction system for python+spacy☆221Updated 2 years ago
- Incorporating VIsual LAyout Structures for Scientific Text Classification☆175Updated 2 years ago
- [ACL 2022] LinkBERT: A Knowledgeable Language Model 😎 Pretrained with Document Links☆433Updated 2 years ago
- This repository provides details and links to the ACL anthology corpus/collection including .bib, .pdf and grobid extractions of the pdfs☆178Updated last year
- Python client for GROBID Web services☆316Updated 3 weeks ago
- Python wrapper for the arXiv API☆1,221Updated 9 months ago
- ☆221Updated last year
- Self-Supervision for Named Entity Disambiguation at the Tail☆215Updated 2 years ago
- KnowBert -- Knowledge Enhanced Contextual Word Representations☆375Updated 4 years ago
- Zero and Few shot named entity & relationships recognition☆361Updated 4 months ago
- Codebase for testing whether hidden states of neural networks encode discrete structures.☆390Updated last year
- Get answers to research questions from 200M+ papers. Link to demo -☆206Updated last year
- A Visual Analysis Tool to Explore Learned Representations in Transformers Models☆588Updated last year
- Entity Linker solution☆1,185Updated last year
- Interpretable Evaluation for AI Systems☆363Updated 2 years ago
- Heavy Workload on Reviewing Papers? ReviewAdvisor Helps out☆197Updated last year