M2D2: A Massively Multi-domain Language Modeling Dataset (EMNLP 2022) by Machel Reid, Victor Zhong, Suchin Gururangan, Luke Zettlemoyer
☆54Nov 21, 2022Updated 3 years ago
Alternatives and similar repositories for m2d2
Users that are interested in m2d2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A method for evaluating the high-level coherence of machine-generated texts. Identifies high-level coherence issues in transformer-based …☆11Mar 18, 2023Updated 3 years ago
- ☆44Sep 16, 2020Updated 5 years ago
- [ACL 2021] Learning to Perturb Word Embeddings for Out-of-distribution QA☆16May 11, 2022Updated 3 years ago
- Benchmark API for Multidomain Language Modeling☆25Aug 26, 2022Updated 3 years ago
- Learning Semantic Parsers from Denotations with Latent Structured Alignments and Abstract Programs(EMNLP2019)☆19Dec 3, 2019Updated 6 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- MultiCQA: Zero-Shot Transfer of Self-Supervised Text Matching Models on a Massive Scale☆14Mar 22, 2021Updated 5 years ago
- Poincaré Event Temporal Embeddings and Hyperbolic GRU for Event TempRel Extraction☆11Nov 8, 2021Updated 4 years ago
- Code of NAACL 2022 "Efficient Hierarchical Domain Adaptation for Pretrained Language Models" paper.☆32Sep 26, 2023Updated 2 years ago
- Convenient Text-to-Text Training for Transformers☆19Dec 10, 2021Updated 4 years ago
- Code and data to support the paper "PAQ 65 Million Probably-Asked Questions andWhat You Can Do With Them"☆210Aug 31, 2021Updated 4 years ago
- SHERG rule extraction and parsing tools☆24Oct 9, 2015Updated 10 years ago
- Exploring Few-Shot Adaptation of Language Models with Tables☆24Aug 22, 2022Updated 3 years ago
- Sequence modeling with Mega.☆303Jan 28, 2023Updated 3 years ago
- Official code and model checkpoints for our EMNLP 2022 paper "RankGen - Improving Text Generation with Large Ranking Models" (https://arx…☆138Aug 2, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- All-in-one repository for Fine-tuning & Pretraining (Large) Language Models☆15Mar 8, 2023Updated 3 years ago
- Query-focused summarization data☆44Feb 17, 2023Updated 3 years ago
- DialogueCSE: Dialogue-based Contrastive Learning of Sentence Embeddings☆19Nov 24, 2021Updated 4 years ago
- Repo for the paper "Large Language Models Struggle to Learn Long-Tail Knowledge"☆78Apr 12, 2023Updated 2 years ago
- Code and dataset "ZEST" from "Learning from task descriptions", Weller et al, EMNLP 2020☆17Mar 15, 2021Updated 5 years ago
- Official code for LEWIS, from: "LEWIS: Levenshtein Editing for Unsupervised Text Style Transfer", ACL-IJCNLP 2021 Findings by Machel Rei…☆31Oct 24, 2022Updated 3 years ago
- triple-encoders is a library for contextualizing distributed Sentence Transformers representations.☆15Sep 3, 2024Updated last year
- Code repository for the c-BTM paper☆108Sep 26, 2023Updated 2 years ago
- Official repository for our EACL 2023 paper "LongEval: Guidelines for Human Evaluation of Faithfulness in Long-form Summarization" (https…☆44Aug 10, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆13Feb 7, 2023Updated 3 years ago
- Implementation of ICLR 21 paper: Probing BERT in Hyperbolic Spaces☆59Mar 23, 2021Updated 5 years ago
- Code for embedding and retrieval research.☆16Oct 24, 2023Updated 2 years ago
- Code for the paper "Modeling Information Change in Science Communication with Semantically Matched Paraphrases" from EMNLP 2022☆13Oct 20, 2022Updated 3 years ago
- The original implementation of Min et al. "Nonparametric Masked Language Modeling" (paper https//arxiv.org/abs/2212.01349)☆158Jan 6, 2023Updated 3 years ago
- Efficient Language Model Training through Cross-Lingual and Progressive Transfer Learning☆30Jan 25, 2023Updated 3 years ago
- Suite of 500 procedurally-generated NLP tasks to study language model adaptability☆21Jul 16, 2022Updated 3 years ago
- Tensorflow implementation of "Meta Dropout: Learning to Perturb Latent Features for Generalization" (ICLR 2020)☆27Apr 27, 2020Updated 5 years ago
- ☆99Jul 25, 2023Updated 2 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- Code for NeurIPS 2023 paper "Non-autoregressive Machine Translation with Probabilistic Context-free Grammar".☆12Jan 4, 2024Updated 2 years ago
- ReConsider is a re-ranking model that re-ranks the top-K (passage, answer-span) predictions of an Open-Domain QA Model like DPR (Karpukhi…☆49Apr 26, 2021Updated 4 years ago
- Library for Knowledge Intensive Language Tasks☆970Mar 31, 2022Updated 3 years ago
- Code and data for "Inferring Rewards from Language in Context" [ACL 2022].☆16May 22, 2022Updated 3 years ago
- ☆539Feb 13, 2024Updated 2 years ago
- A template primarily for PhD theses but also suitable for Bachelor's or Master's theses☆11Nov 10, 2021Updated 4 years ago
- Ukrainian ELECTRA model☆12Mar 11, 2023Updated 3 years ago