jpwahle / lrec22-d3-dataset
The official repository for the LREC 2022 paper "D3: A Massive Dataset of Scholarly Metadata for Analyzing the State of Computer Science Research"
☆27Updated last year
Related projects ⓘ
Alternatives and complementary repositories for lrec22-d3-dataset
- Data and code for the paper "CiteWorth: Cite-Worthiness Detection for Improved Scientific Document Understanding"☆14Updated 2 years ago
- Neighborhood Contrastive Learning for Scientific Document Representations with Citation Embeddings (EMNLP 2022 paper)☆64Updated 2 years ago
- MultiCite code and data. Models are available on Huggingface.☆29Updated 2 years ago
- Repo for Aspire - A scientific document similarity model based on matching fine-grained aspects of scientific papers.☆50Updated last year
- 🌾 Universal, customizable and deployable fine-grained evaluation for text generation.☆22Updated last year
- ☆14Updated 2 years ago
- Code for the paper SciCo: Hierarchical Cross-Document Coreference for Scientific Concepts (AKBC 2021). https://openreview.net/forum?id=OF…☆25Updated 3 years ago
- Code for our TSD paper "TOKEN is a MASK: Few-shot Named Entity Recognition with Pre-trained Language Models"☆14Updated 2 years ago
- A Test Collection of Computer Science Papers for Faceted Query by Example☆21Updated 2 years ago
- ☆18Updated 2 years ago
- Workshop Home Page for Benchmarking: Past, Present and Future☆34Updated 3 years ago
- CORWA: A Citation-Oriented Related Work Annotation Dataset, NAACL 2022☆16Updated this week
- Repository with code for MaChAmp: https://aclanthology.org/2021.eacl-demos.22/☆82Updated last month
- ☆24Updated 4 years ago
- Data and code for the SciFact-Open task☆24Updated 11 months ago
- SciWING is a modern toolkit for scientific document processing from WING-NUS☆62Updated last year
- Automatically detect errors in annotated corpora.☆47Updated last year
- Inducing Taxonomic Knowledge from Pretrained Transformers☆12Updated last year
- Corpus exploration platform using advanced tools such as interactive summarization and multi document coreference resolution☆11Updated last year
- PyTAIL - Interactive and Incremental Learning of NLP Models with Human in the Loop for Online Data☆12Updated last year
- ☆36Updated 2 years ago
- T-Projection is a method to perform high-quality Annotation Projection of Sequence Labeling datasets.☆11Updated last year
- SeqScore: Scoring for named entity recognition and other sequence labeling tasks☆21Updated last month
- The official repository for "Evaluating Entity Disambiguation and the Role of Popularity in Retrieval-Based NLP" published in ACL-IJNLP 2…☆18Updated 2 years ago
- SciRepEval benchmark training and evaluation scripts☆67Updated 6 months ago
- Code & data for EMNLP 2020 paper "MOCHA: A Dataset for Training and Evaluating Reading Comprehension Metrics".☆16Updated 2 years ago
- ☆14Updated 3 years ago
- The dataset and code for ACL 2022 paper "SciNLI: A Corpus for Natural Language Inference on Scientific Text" are released here.☆25Updated last year
- ☆22Updated 3 years ago
- Entity Linking & discovery solution. Agarwal et al., "Entity Linking via Explicit Mention-Mention Coreference Modeling", NAACL 2022.☆25Updated 7 months ago