allenai / S2APLER
S2APLER: S2 Agglomeration of Papers with Low Error Rate (it's for academic paper clustering)
☆16Updated last year
Alternatives and similar repositories for S2APLER:
Users that are interested in S2APLER are comparing it to the libraries listed below
- Data and code for the paper "CiteWorth: Cite-Worthiness Detection for Improved Scientific Document Understanding"☆14Updated 2 years ago
- Neighborhood Contrastive Learning for Scientific Document Representations with Citation Embeddings (EMNLP 2022 paper)☆67Updated 2 years ago
- MultiCite code and data. Models are available on Huggingface.☆29Updated 2 years ago
- ☆53Updated 3 years ago
- Repo for Aspire - A scientific document similarity model based on matching fine-grained aspects of scientific papers.☆51Updated last year
- ☆13Updated 2 years ago
- ☆18Updated 2 years ago
- Keeping It Simple is Hard☆10Updated last year
- Multidocument Summarization for Literature Review Shared Task 2022☆29Updated 2 years ago
- A Benchmark of PDF Information Extraction Tools using a Multi-Task and Multi-Domain Evaluation Framework for Academic Documents☆23Updated 2 years ago
- The corresponding code for our paper: "Exploring the Challenges of Open Domain Multi-Document Summarization". Do not hesitate to open an …☆32Updated last year
- ☆21Updated last month
- Efficient Memory-Augmented Transformers☆34Updated 2 years ago
- Dataset, models, and code for paper "CiteSum: Citation Text-guided Scientific Extreme Summarization and Low-resource Domain Adaptation", …☆33Updated 2 years ago
- This repository hosts the dataset for the paper Computer Science Named Entity Recognition in the Open Research Knowledge Graph☆20Updated last year
- ☆14Updated 2 years ago
- Simple Questions Generate Named Entity Recognition Datasets (EMNLP 2022)☆76Updated last year
- A Test Collection of Computer Science Papers for Faceted Query by Example☆21Updated 3 years ago
- Data and code for the SciFact-Open task☆25Updated last year
- ☆11Updated last year
- ☆34Updated 2 years ago
- Official dataset repository for "SciReviewGen: A Large-scale Dataset for Automatic Literature Review Generation."☆16Updated last year
- Entity Linking & discovery solution. Agarwal et al., "Entity Linking via Explicit Mention-Mention Coreference Modeling", NAACL 2022.☆26Updated 10 months ago
- Submission archive for the MS MARCO passage ranking leaderboard☆13Updated last year
- The dataset and code for ACL 2022 paper "SciNLI: A Corpus for Natural Language Inference on Scientific Text" are released here.☆27Updated last year
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆46Updated last year
- ☆84Updated 9 months ago
- Advanced Semantics for Commonsense Knowledge Extraction (WWW 2021)☆25Updated 2 years ago
- ☆36Updated 2 years ago
- Cross language information retrieval pipeline☆18Updated last year