tau-nlp / scrollsLinks

The official code of EMNLP 2022, "SCROLLS: Standardized CompaRison Over Long Language Sequences".

☆69

Alternatives and similar repositories for scrolls

Users that are interested in scrolls are comparing it to the libraries listed below

Sorting:

kernelmachine / demix
DEMix Layers for Modular Language Modeling
☆54Updated 4 years ago
GEM-benchmark / GEM-metrics
Automatic metrics for GEM tasks
☆67Updated 3 years ago
nkandpa2 / long_tail_knowledge
Repo for the paper "Large Language Models Struggle to Learn Long-Tail Knowledge"
☆78Updated 2 years ago
nyu-mll / SQuALITY
Query-focused summarization data
☆42Updated 2 years ago
frankxu2004 / knnlm-why
Repo for ICML23 "Why do Nearest Neighbor Language Models Work?"
☆59Updated 2 years ago
kernelmachine / demix-data
Benchmark API for Multidomain Language Modeling
☆25Updated 3 years ago
neulab / knn-transformers
PyTorch + HuggingFace code for RetoMaton: "Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval" (ICML 2022), including an…
☆282Updated 3 years ago
awebson / prompt_semantics
This repository accompanies our paper “Do Prompt-Based Models Really Understand the Meaning of Their Prompts?”
☆85Updated 3 years ago
McGill-NLP / polytropon
☆54Updated 2 years ago
ekinakyurek / influence
Code for "Tracing Knowledge in Language Models Back to the Training Data"
☆39Updated 2 years ago
suzgunmirac / crowd-sampling
Follow the Wisdom of the Crowd: Effective Text Generation via Minimum Bayes Risk Decoding
☆18Updated 3 years ago
nyu-mll / quality
☆143Updated 10 months ago
yanaiela / pararel
☆47Updated last year
INK-USC / CrossFit
Code for paper "CrossFit : A Few-shot Learning Challenge for Cross-task Generalization in NLP" (https://arxiv.org/abs/2104.08835)
☆113Updated 3 years ago
AkariAsai / evidentiality_qa
The official implemetation of "Evidentiality-guided Generation for Knowledge-Intensive NLP Tasks" (NAACL 2022).
☆45Updated 2 years ago
alisawuffles / DExperts
code associated with ACL 2021 DExperts paper
☆118Updated 2 years ago
nicola-decao / KnowledgeEditor
Code for Editing Factual Knowledge in Language Models
☆142Updated 3 years ago
yizhongw / Tk-Instruct
Tk-Instruct is a Transformer model that is tuned to solve many NLP tasks by following instructions.
☆182Updated 3 years ago
jzbjyb / lm-calibration
☆35Updated 4 years ago
ryokamoi / wice
This repository contains the dataset and code for "WiCE: Real-World Entailment for Claims in Wikipedia" in EMNLP 2023.
☆42Updated last year
hitz-zentroa / lm-contamination
The LM Contamination Index is a manually created database of contamination evidences for LMs.
☆81Updated last year
yuzhaouoe / pretraining-data-packing
[ACL'24 Oral] Analysing The Impact of Sequence Composition on Language Model Pre-Training
☆22Updated last year
nayeon7lee / FactualityPrompt
☆87Updated 3 years ago
cambridgeltl / composable-sft
A library for parameter-efficient and composable transfer learning for NLP with sparse fine-tunings.
☆75Updated last year
allenai / bff
☆38Updated last year
jxhe / efficient-knnlm
Pytorch implementation of paper "Efficient Nearest Neighbor Language Models" (EMNLP 2021)
☆74Updated 3 years ago
machelreid / m2d2
M2D2: A Massively Multi-domain Language Modeling Dataset (EMNLP 2022) by Machel Reid, Victor Zhong, Suchin Gururangan, Luke Zettlemoyer
☆54Updated 3 years ago
qkaren / COLD_decoding
☆113Updated 3 years ago
McGill-NLP / FaithDial
☆50Updated 2 years ago
swj0419 / kNN_prompt
TBC
☆27Updated 3 years ago