Tools and Modeling Code for the MASSIVE dataset
☆561Nov 28, 2022Updated 3 years ago
Alternatives and similar repositories for massive
Users that are interested in massive are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Repository for SLURP paper☆109Apr 20, 2022Updated 4 years ago
- Data and code for the paper "End-to-End Slot Alignment and Recognition for Cross-Lingual NLU" (Accepted to EMNLP 2020)☆27Jan 13, 2022Updated 4 years ago
- XTREME is a benchmark for the evaluation of the cross-lingual generalization ability of pre-trained multilingual models that covers 40 ty…☆652Jan 4, 2023Updated 3 years ago
- TyDi QA contains 200k human-annotated question-answer pairs in 11 Typologically Diverse languages, written without seeing the answer and …☆319May 28, 2020Updated 6 years ago
- The Schema-Guided Dialogue Dataset☆604Aug 7, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆97Aug 6, 2022Updated 3 years ago
- Dataset from the paper "Mintaka: A Complex, Natural, and Multilingual Dataset for End-to-End Question Answering" (COLING 2022)☆119Oct 25, 2022Updated 3 years ago
- Adversarial Natural Language Inference Benchmark☆400May 12, 2022Updated 4 years ago
- ☆13Aug 23, 2024Updated last year
- ☆363Nov 15, 2024Updated last year
- Pytorch implementation of JointBERT: "BERT for Joint Intent Classification and Slot Filling"☆744Jan 11, 2024Updated 2 years ago
- DialoGLUE: A Natural Language Understanding Benchmark for Task-Oriented Dialogue☆287Jul 6, 2023Updated 2 years ago
- Repo for external large-scale work☆6,545Apr 27, 2024Updated 2 years ago
- Open source code and data for AAAI 2022 Oral Paper "Text is no more Enough! A Benchmark for Profile-based Spoken Language Understanding"☆35May 26, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- M2D2: A Massively Multi-domain Language Modeling Dataset (EMNLP 2022) by Machel Reid, Victor Zhong, Suchin Gururangan, Luke Zettlemoyer☆54Nov 21, 2022Updated 3 years ago
- Please see the readme file as well as our 2019 EMNLP paper linked here -->☆222Apr 24, 2024Updated 2 years ago
- Facebook AI Research Sequence-to-Sequence Toolkit written in Python.☆32,233Sep 30, 2025Updated 8 months ago
- Code to reproduce experiments in the paper "Task-Oriented Dialogue as Dataflow Synthesis" (TACL 2020).☆310Apr 30, 2024Updated 2 years ago
- Zero-shot dialogue state tracking (DST)☆83Nov 18, 2021Updated 4 years ago
- A dataset containing human-human knowledge-grounded open-domain conversations.☆671Aug 2, 2024Updated last year
- Large datasets for conversational AI☆1,398Nov 16, 2019Updated 6 years ago
- Multilingual Compositional Wikidata Questions (MCWQ)☆21Jun 12, 2023Updated 3 years ago
- Few-Shot-Intent-Detection includes popular challenging intent detection datasets with/without OOS queries and state-of-the-art baselines …☆157Jul 19, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Multi-angle c(q)uestion answering☆458Aug 22, 2022Updated 3 years ago
- Repository that accompanies "An Evaluation Dataset for Intent Classification and Out-of-Scope Prediction" (EMNLP 2019)☆222Jun 1, 2021Updated 5 years ago
- Pre-Trained Models for ToD-BERT☆295Jul 17, 2023Updated 2 years ago
- Finetune wav2vec2-large-xlsr-53 with Thai Common Voice Corpus 7.0☆53Apr 23, 2022Updated 4 years ago
- ☆133Jun 2, 2026Updated 2 weeks ago
- New dataset☆310Aug 31, 2021Updated 4 years ago
- Overview of corpora/datasets for Germanic low-resource languages and dialects. Accompanies "A Survey of Corpora for Germanic Low-Resource…☆27Feb 16, 2026Updated 4 months ago
- Efficient few-shot learning with Sentence Transformers☆2,746May 26, 2026Updated 3 weeks ago
- The PIZZA dataset continues the exploration of task-oriented parsing by introducing a new dataset for parsing pizza and drink orders, who…☆20Dec 7, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.☆160Jun 18, 2024Updated 2 years ago
- ☆510Sep 23, 2020Updated 5 years ago
- data collator for UL2 and U-PaLM☆29Aug 20, 2023Updated 2 years ago
- Language Models as Few-Shot Learner for Task-Oriented Dialogue Systems☆22May 28, 2021Updated 5 years ago
- The Learning Interpretability Tool: Interactively analyze ML models to understand their behavior in an extensible and framework agnostic …☆3,654Jun 9, 2026Updated last week
- Official Implementation of "DialogLM: Pre-trained Model for Long Dialogue Understanding and Summarization."☆144Nov 1, 2022Updated 3 years ago
- ☆75Jul 2, 2021Updated 4 years ago