Benchmark API for Multidomain Language Modeling
☆25Aug 26, 2022Updated 3 years ago
Alternatives and similar repositories for demix-data
Users that are interested in demix-data are comparing it to the libraries listed below
Sorting:
- ☆77Apr 29, 2024Updated last year
- Suite of 500 procedurally-generated NLP tasks to study language model adaptability☆21Jul 16, 2022Updated 3 years ago
- NLQuAD: A Non-Factoid Long Question Answering Data Set. To be published at EACL2021☆13May 18, 2021Updated 4 years ago
- M2D2: A Massively Multi-domain Language Modeling Dataset (EMNLP 2022) by Machel Reid, Victor Zhong, Suchin Gururangan, Luke Zettlemoyer☆54Nov 21, 2022Updated 3 years ago
- Python source code for EMNLP 2021 Findings paper: "Subword Mapping and Anchoring Across Languages".☆13Sep 17, 2021Updated 4 years ago
- A framework for evaluating Machine Translation models.☆12May 26, 2025Updated 9 months ago
- ☆13Dec 11, 2021Updated 4 years ago
- Learning Semantic Parsers from Denotations with Latent Structured Alignments and Abstract Programs(EMNLP2019)☆19Dec 3, 2019Updated 6 years ago
- ☆24May 1, 2025Updated 10 months ago
- ☆48Jan 21, 2024Updated 2 years ago
- Find informative examples to efficiently (human)-evaluate NLG models.☆18Feb 27, 2026Updated 3 weeks ago
- ☆19Nov 14, 2022Updated 3 years ago
- ☆38Apr 29, 2023Updated 2 years ago
- A quick way to get started with Transformer Lens☆14Dec 13, 2023Updated 2 years ago
- ☆54May 8, 2023Updated 2 years ago
- Resources for the shared task on conversational question answering SCAI-QReCC 2021☆29Jun 18, 2022Updated 3 years ago
- Public code release for the paper "Reawakening knowledge: Anticipatory recovery from catastrophic interference via structured training"☆11Oct 27, 2025Updated 4 months ago
- ☆24Oct 3, 2025Updated 5 months ago
- playing with gpt4☆14Mar 17, 2023Updated 3 years ago
- ☆68May 18, 2023Updated 2 years ago
- UDapter is a multilingual dependency parser that uses "contextual" adapters together with language-typology features for language-specifi…☆31Dec 5, 2022Updated 3 years ago
- ☆32Mar 13, 2025Updated last year
- This repositary hosts my experiments for the project, I did with OffNote Labs.☆10Apr 12, 2021Updated 4 years ago
- This is the repo for constructing a comprehensive and rigorous evaluation framework for LLM calibration.☆13Apr 9, 2024Updated last year
- Vision Large Language Models trained on M3IT instruction tuning dataset☆17Aug 16, 2023Updated 2 years ago
- CIKM 2022: Evaluating Interpolation and Extrapolation Performance of Neural Retrieval Models☆11Aug 4, 2022Updated 3 years ago
- The offical code for paper "What Constitutes a Faithful Summary? Preserving Author Perspectives in News Summarization"☆10Jun 23, 2024Updated last year
- Codebase the paper "The Remarkable Robustness of LLMs: Stages of Inference?"☆19Jun 11, 2025Updated 9 months ago
- ☆18Sep 21, 2023Updated 2 years ago
- ReaSCAN is a synthetic navigation task that requires models to reason about surroundings over syntactically difficult languages. (NeurIPS…☆19Nov 28, 2021Updated 4 years ago
- Repository for paper Decrypting Cryptic Crosswords☆10Jan 15, 2022Updated 4 years ago
- ☆11Jan 10, 2020Updated 6 years ago
- Conversations with Search Engines☆14Jun 12, 2023Updated 2 years ago
- Measuring if attention is explanation with ROAR☆22Mar 3, 2023Updated 3 years ago
- Adding new tasks to T0 without catastrophic forgetting☆33Oct 20, 2022Updated 3 years ago
- ScienceMeter: Tracking Scientific Knowledge Updates in Language Models☆17Jun 28, 2025Updated 8 months ago
- NTREX -- News Test References for MT Evaluation☆88Jun 5, 2024Updated last year
- Higher Order SVD implementation in PyTorch☆13Nov 14, 2022Updated 3 years ago
- ☆14Aug 30, 2023Updated 2 years ago