allenai / S2APLERLinks
S2APLER: S2 Agglomeration of Papers with Low Error Rate (it's for academic paper clustering)
☆17Updated last year
Alternatives and similar repositories for S2APLER
Users that are interested in S2APLER are comparing it to the libraries listed below
Sorting:
- Data and code for the paper "CiteWorth: Cite-Worthiness Detection for Improved Scientific Document Understanding"☆14Updated 2 years ago
- A Benchmark of PDF Information Extraction Tools using a Multi-Task and Multi-Domain Evaluation Framework for Academic Documents☆25Updated 2 years ago
- MultiCite code and data. Models are available on Huggingface.☆32Updated 3 years ago
- Neighborhood Contrastive Learning for Scientific Document Representations with Citation Embeddings (EMNLP 2022 paper)☆68Updated 2 years ago
- Repo for Aspire - A scientific document similarity model based on matching fine-grained aspects of scientific papers.☆53Updated last year
- Efficient Memory-Augmented Transformers☆34Updated 2 years ago
- Dataset, models, and code for paper "CiteSum: Citation Text-guided Scientific Extreme Summarization and Low-resource Domain Adaptation", …☆33Updated 2 years ago
- ☆91Updated last year
- The dataset and code for ACL 2022 paper "SciNLI: A Corpus for Natural Language Inference on Scientific Text" are released here.☆28Updated last year
- ☆18Updated 2 years ago
- ☆38Updated 5 months ago
- Entity Linking & discovery solution. Agarwal et al., "Entity Linking via Explicit Mention-Mention Coreference Modeling", NAACL 2022.☆27Updated last year
- ☆22Updated 4 months ago
- A Test Collection of Computer Science Papers for Faceted Query by Example☆21Updated 3 years ago
- The corresponding code for our paper: "Exploring the Challenges of Open Domain Multi-Document Summarization". Do not hesitate to open an …☆32Updated last year
- ☆28Updated last year
- Multidocument Summarization for Literature Review Shared Task 2022☆30Updated 2 years ago
- Submission archive for the MS MARCO passage ranking leaderboard☆13Updated 2 years ago
- Keeping It Simple is Hard☆10Updated last year
- 🌾 Universal, customizable and deployable fine-grained evaluation for text generation.☆23Updated last year
- Retrieval-Augmented Generation-based Relation Extraction☆39Updated this week
- Set-Encoder: Permutation-Invariant Inter-Passage Attention for Listwise Passage Re-Ranking with Cross-Encoders☆16Updated 2 weeks ago
- ☆10Updated 4 years ago
- Multi-LexSum is an abstractive summarization dataset for US Civil Rights Lawsuits☆19Updated 2 years ago
- This repository hosts the dataset for the paper Computer Science Named Entity Recognition in the Open Research Knowledge Graph☆21Updated last year
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆48Updated last year
- ☆53Updated 3 years ago
- Simple Questions Generate Named Entity Recognition Datasets (EMNLP 2022)☆76Updated 2 years ago
- ☆34Updated 2 years ago
- Starbucks: Improved Training for 2D Matryoshka Embeddings☆20Updated 4 months ago