machelreid/m2d2

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/machelreid/m2d2)

machelreid / m2d2

M2D2: A Massively Multi-domain Language Modeling Dataset (EMNLP 2022) by Machel Reid, Victor Zhong, Suchin Gururangan, Luke Zettlemoyer

☆54

Alternatives and similar repositories for m2d2

Users that are interested in m2d2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

jungokasai / deep-shallow
View on GitHub
☆43Sep 16, 2020Updated 5 years ago
kernelmachine / demix
View on GitHub
DEMix Layers for Modular Language Modeling
☆54Feb 25, 2026Updated 5 months ago
seanie12 / SWEP
View on GitHub
[ACL 2021] Learning to Perturb Word Embeddings for Out-of-distribution QA
☆16May 11, 2022Updated 4 years ago
kernelmachine / demix-data
View on GitHub
Benchmark API for Multidomain Language Modeling
☆25Aug 26, 2022Updated 3 years ago
berlino / weaksp_em19
View on GitHub
Learning Semantic Parsers from Denotations with Latent Structured Alignments and Abstract Programs(EMNLP2019)
☆19Dec 3, 2019Updated 6 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
UKPLab / emnlp2020-multicqa
View on GitHub
MultiCQA: Zero-Shot Transfer of Self-Supervised Text Matching Models on a Massive Scale
☆14Mar 22, 2021Updated 5 years ago
alexandra-chron / hierarchical-domain-adaptation
View on GitHub
Code of NAACL 2022 "Efficient Hierarchical Domain Adaptation for Pretrained Language Models" paper.
☆32Sep 26, 2023Updated 2 years ago
Xingwei-Tan / hyper-event-TempRel
View on GitHub
Poincaré Event Temporal Embeddings and Hyperbolic GRU for Event TempRel Extraction
☆11Nov 8, 2021Updated 4 years ago
tlkh / t2t-tuner
View on GitHub
Convenient Text-to-Text Training for Transformers
☆18Dec 10, 2021Updated 4 years ago
RUCAIBox / DCLR
View on GitHub
Code of ACL 2022 paper Debiased Contrastive Learning of Unsupervised Sentence Representations
☆32Mar 16, 2022Updated 4 years ago
facebookresearch / PAQ
View on GitHub
Code and data to support the paper "PAQ 65 Million Probably-Asked Questions andWhat You Can Do With Them"
☆211Aug 31, 2021Updated 4 years ago
JunShern / few-shot-adaptation
View on GitHub
Exploring Few-Shot Adaptation of Language Models with Tables
☆25Aug 22, 2022Updated 3 years ago
facebookresearch / mega
View on GitHub
Sequence modeling with Mega.
☆303Jan 28, 2023Updated 3 years ago
nyu-mll / SQuALITY
View on GitHub
Query-focused summarization data
☆44Feb 17, 2023Updated 3 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
martiansideofthemoon / rankgen
View on GitHub
Official code and model checkpoints for our EMNLP 2022 paper "RankGen - Improving Text Generation with Large Ranking Models" (https://arx…
☆140Aug 2, 2023Updated 2 years ago
joeljang / FLM
View on GitHub
All-in-one repository for Fine-tuning & Pretraining (Large) Language Models
☆15Mar 8, 2023Updated 3 years ago
nkandpa2 / long_tail_knowledge
View on GitHub
Repo for the paper "Large Language Models Struggle to Learn Long-Tail Knowledge"
☆77Apr 12, 2023Updated 3 years ago
wangruicn / DialogueCSE
View on GitHub
DialogueCSE: Dialogue-based Contrastive Learning of Sentence Embeddings
☆19Nov 24, 2021Updated 4 years ago
kernelmachine / cbtm
View on GitHub
Code repository for the c-BTM paper
☆109Sep 26, 2023Updated 2 years ago
UKPLab / acl2024-triple-encoders
View on GitHub
triple-encoders is a library for contextualizing distributed Sentence Transformers representations.
☆15Sep 3, 2024Updated last year
martiansideofthemoon / longeval-summarization
View on GitHub
Official repository for our EACL 2023 paper "LongEval: Guidelines for Human Evaluation of Faithfulness in Long-form Summarization" (https…
☆45Aug 10, 2024Updated last year
zoranmedic / mdcr
View on GitHub
Benchmark dataset for the evaluation of scientific article representations on the task of citation recommendation across various scientif…
☆12Oct 21, 2022Updated 3 years ago
emorynlp / seq2seq-corenlp
View on GitHub
☆13Feb 7, 2023Updated 3 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
FranxYao / PoincareProbe
View on GitHub
Implementation of ICLR 21 paper: Probing BERT in Hyperbolic Spaces
☆60Mar 23, 2021Updated 5 years ago
copenlu / scientific-information-change
View on GitHub
Code for the paper "Modeling Information Change in Science Communication with Semantically Matched Paraphrases" from EMNLP 2022
☆13Oct 20, 2022Updated 3 years ago
facebookresearch / NPM
View on GitHub
The original implementation of Min et al. "Nonparametric Masked Language Modeling" (paper https//arxiv.org/abs/2212.01349)
☆159Jan 6, 2023Updated 3 years ago
belindal / TaskBench500
View on GitHub
Suite of 500 procedurally-generated NLP tasks to study language model adaptability
☆21Jul 16, 2022Updated 4 years ago
malteos / clp-transfer
View on GitHub
Efficient Language Model Training through Cross-Lingual and Progressive Transfer Learning
☆30Jan 25, 2023Updated 3 years ago
bigscience-workshop / architecture-objective
View on GitHub
☆100Jul 25, 2023Updated 3 years ago
ictnlp / PCFG-NAT
View on GitHub
Code for NeurIPS 2023 paper "Non-autoregressive Machine Translation with Probabilistic Context-free Grammar".
☆12Jan 4, 2024Updated 2 years ago
isi-nlp / bolinas
View on GitHub
SHERG rule extraction and parsing tools
☆24Oct 9, 2015Updated 10 years ago
jongwooko / NASH-Pruning-Official
View on GitHub
Code Implementation for "NASH: A Simple Unified Framework of Structured Pruning for Accelerating Encoder-Decoder Language Models" (EMNLP …
☆17Oct 17, 2023Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
facebookresearch / reconsider
View on GitHub
ReConsider is a re-ranking model that re-ranks the top-K (passage, answer-span) predictions of an Open-Domain QA Model like DPR (Karpukhi…
☆50Apr 26, 2021Updated 5 years ago
haebeom-lee / metadrop
View on GitHub
Tensorflow implementation of "Meta Dropout: Learning to Perturb Latent Features for Generalization" (ICLR 2020)
☆27Apr 27, 2020Updated 6 years ago
spraakbanken / multiged-2023
View on GitHub
☆15Apr 12, 2023Updated 3 years ago
jlin816 / rewards-from-language
View on GitHub
Code and data for "Inferring Rewards from Language in Context" [ACL 2022].
☆16May 22, 2022Updated 4 years ago
facebookresearch / KILT
View on GitHub
Library for Knowledge Intensive Language Tasks
☆979Mar 31, 2022Updated 4 years ago
google-research / byt5
View on GitHub
☆547Feb 13, 2024Updated 2 years ago
Arvid-pku / ATOKE
View on GitHub
[AAAI 2024] History Matters: Temporal Knowledge Editing in Large Language Model
☆13Dec 17, 2023Updated 2 years ago