alexa/massive

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/alexa/massive)

alexa / massive

Tools and Modeling Code for the MASSIVE dataset

☆564

Alternatives and similar repositories for massive

Users that are interested in massive are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

pswietojanski / slurp
View on GitHub
Repository for SLURP paper
☆109Apr 20, 2022Updated 4 years ago
amazon-science / multiatis
View on GitHub
Data and code for the paper "End-to-End Slot Alignment and Recognition for Cross-Lingual NLU" (Accepted to EMNLP 2020)
☆27Jan 13, 2022Updated 4 years ago
google-research / xtreme
View on GitHub
XTREME is a benchmark for the evaluation of the cross-lingual generalization ability of pre-trained multilingual models that covers 40 ty…
☆651Jan 4, 2023Updated 3 years ago
google-research-datasets / tydiqa
View on GitHub
TyDi QA contains 200k human-annotated question-answer pairs in 11 Typologically Diverse languages, written without seeing the answer and …
☆319May 28, 2020Updated 6 years ago
google-research-datasets / dstc8-schema-guided-dialogue
View on GitHub
The Schema-Guided Dialogue Dataset
☆608Aug 7, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
google-research / dialog-inpainting
View on GitHub
☆97Aug 6, 2022Updated 3 years ago
amazon-science / mintaka
View on GitHub
Dataset from the paper "Mintaka: A Complex, Natural, and Multilingual Dataset for End-to-End Question Answering" (COLING 2022)
☆119Oct 25, 2022Updated 3 years ago
facebookresearch / anli
View on GitHub
Adversarial Natural Language Inference Benchmark
☆402May 12, 2022Updated 4 years ago
fyvo / WMT-Biomed-Test
View on GitHub
☆13Aug 23, 2024Updated last year
monologg / JointBERT
View on GitHub
Pytorch implementation of JointBERT: "BERT for Joint Intent Classification and Slot Filling"
☆747Jan 11, 2024Updated 2 years ago
alexa / dialoglue
View on GitHub
DialoGLUE: A Natural Language Understanding Benchmark for Task-Oriented Dialogue
☆288Jul 6, 2023Updated 3 years ago
facebookresearch / metaseq
View on GitHub
Repo for external large-scale work
☆6,551Apr 27, 2024Updated 2 years ago
LooperXX / ProSLU
View on GitHub
Open source code and data for AAAI 2022 Oral Paper "Text is no more Enough! A Benchmark for Profile-based Spoken Language Understanding"
☆35May 26, 2024Updated 2 years ago
machelreid / m2d2
View on GitHub
M2D2: A Massively Multi-domain Language Modeling Dataset (EMNLP 2022) by Machel Reid, Victor Zhong, Suchin Gururangan, Luke Zettlemoyer
☆54Nov 21, 2022Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
facebookresearch / Zero-Shot-DST
View on GitHub
Zero-shot dialogue state tracking (DST)
☆83Nov 18, 2021Updated 4 years ago
google-research-datasets / Taskmaster
View on GitHub
Please see the readme file as well as our 2019 EMNLP paper linked here -->
☆222Apr 24, 2024Updated 2 years ago
facebookresearch / fairseq
View on GitHub
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
☆32,250Sep 30, 2025Updated 9 months ago
coastalcph / seq2sparql
View on GitHub
Multilingual Compositional Wikidata Questions (MCWQ)
☆20Jun 12, 2023Updated 3 years ago
jianguoz / Few-Shot-Intent-Detection
View on GitHub
Few-Shot-Intent-Detection includes popular challenging intent detection datasets with/without OOS queries and state-of-the-art baselines …
☆159Jul 19, 2023Updated 3 years ago
alexa / Topical-Chat
View on GitHub
A dataset containing human-human knowledge-grounded open-domain conversations.
☆673Aug 2, 2024Updated last year
PolyAI-LDN / conversational-datasets
View on GitHub
Large datasets for conversational AI
☆1,402Nov 16, 2019Updated 6 years ago
allenai / macaw
View on GitHub
Multi-angle c(q)uestion answering
☆459Aug 22, 2022Updated 3 years ago
microsoft / task_oriented_dialogue_as_dataflow_synthesis
View on GitHub
Code to reproduce experiments in the paper "Task-Oriented Dialogue as Dataflow Synthesis" (TACL 2020).
☆310Apr 30, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
jasonwu0731 / ToD-BERT
View on GitHub
Pre-Trained Models for ToD-BERT
☆295Jul 17, 2023Updated 3 years ago
salesforce / Converse
View on GitHub
☆132Jun 2, 2026Updated last month
facebookresearch / LASER
View on GitHub
Language-Agnostic SEntence Representations
☆3,662May 2, 2024Updated 2 years ago
facebookresearch / MLQA
View on GitHub
New dataset
☆311Aug 31, 2021Updated 4 years ago
amazon-science / pizza-semantic-parsing-dataset
View on GitHub
The PIZZA dataset continues the exploration of task-oriented parsing by introducing a new dataset for parsing pizza and drink orders, who…
☆20Dec 7, 2022Updated 3 years ago
huggingface / setfit
View on GitHub
Efficient few-shot learning with Sentence Transformers
☆2,777May 26, 2026Updated 2 months ago
grill-lab / OAT
View on GitHub
An open source toolkit for multimodal generative conversational task assistants, helping assist people with real-world complex tasks
☆37May 23, 2024Updated 2 years ago
clinc / oos-eval
View on GitHub
Repository that accompanies "An Evaluation Dataset for Intent Classification and Out-of-Scope Prediction" (EMNLP 2019)
☆222Jun 1, 2021Updated 5 years ago
microsoft / DialogLM
View on GitHub
Official Implementation of "DialogLM: Pre-trained Model for Long Dialogue Understanding and Summarization."
☆144Nov 1, 2022Updated 3 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
ChestnutWYN / ACL2021-Novel-Slot-Detection
View on GitHub
☆17Jul 9, 2021Updated 5 years ago
theblackcat102 / unify-learning-paradigms
View on GitHub
data collator for UL2 and U-PaLM
☆29Aug 20, 2023Updated 2 years ago
andreamad8 / TASK-ORIENTED-LM-FEWSHOT
View on GitHub
Language Models as Few-Shot Learner for Task-Oriented Dialogue Systems
☆22May 28, 2021Updated 5 years ago
jwieting / paraphrastic-representations-at-scale
View on GitHub
☆74Jul 2, 2021Updated 5 years ago
PAIR-code / lit
View on GitHub
The Learning Interpretability Tool: Interactively analyze ML models to understand their behavior in an extensible and framework agnostic …
☆3,658Updated this week
sonos / nlu-benchmark
View on GitHub
☆510Sep 23, 2020Updated 5 years ago
facebookresearch / accentor
View on GitHub
Data & Code for ACCENTOR: "Adding Chit-Chat to Enhance Task-Oriented Dialogues" (NAACL 2021)
☆72Oct 12, 2021Updated 4 years ago