Tools and Modeling Code for the MASSIVE dataset
☆557Nov 28, 2022Updated 3 years ago
Alternatives and similar repositories for massive
Users that are interested in massive are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Repository for SLURP paper☆109Apr 20, 2022Updated 3 years ago
- Data and code for the paper "End-to-End Slot Alignment and Recognition for Cross-Lingual NLU" (Accepted to EMNLP 2020)☆27Jan 13, 2022Updated 4 years ago
- XTREME is a benchmark for the evaluation of the cross-lingual generalization ability of pre-trained multilingual models that covers 40 ty…☆652Jan 4, 2023Updated 3 years ago
- TyDi QA contains 200k human-annotated question-answer pairs in 11 Typologically Diverse languages, written without seeing the answer and …☆318May 28, 2020Updated 5 years ago
- The Schema-Guided Dialogue Dataset☆602Aug 7, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆97Aug 6, 2022Updated 3 years ago
- Dataset from the paper "Mintaka: A Complex, Natural, and Multilingual Dataset for End-to-End Question Answering" (COLING 2022)☆118Oct 25, 2022Updated 3 years ago
- Adversarial Natural Language Inference Benchmark☆399May 12, 2022Updated 3 years ago
- ☆13Aug 23, 2024Updated last year
- Pytorch implementation of JointBERT: "BERT for Joint Intent Classification and Slot Filling"☆740Jan 11, 2024Updated 2 years ago
- DialoGLUE: A Natural Language Understanding Benchmark for Task-Oriented Dialogue☆287Jul 6, 2023Updated 2 years ago
- Repo for external large-scale work☆6,540Apr 27, 2024Updated last year
- Open source code and data for AAAI 2022 Oral Paper "Text is no more Enough! A Benchmark for Profile-based Spoken Language Understanding"☆35May 26, 2024Updated last year
- M2D2: A Massively Multi-domain Language Modeling Dataset (EMNLP 2022) by Machel Reid, Victor Zhong, Suchin Gururangan, Luke Zettlemoyer☆54Nov 21, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Please see the readme file as well as our 2019 EMNLP paper linked here -->☆221Apr 24, 2024Updated last year
- Facebook AI Research Sequence-to-Sequence Toolkit written in Python.☆32,201Sep 30, 2025Updated 6 months ago
- Code to reproduce experiments in the paper "Task-Oriented Dialogue as Dataflow Synthesis" (TACL 2020).☆309Apr 30, 2024Updated last year
- Zero-shot dialogue state tracking (DST)☆83Nov 18, 2021Updated 4 years ago
- A dataset containing human-human knowledge-grounded open-domain conversations.☆671Aug 2, 2024Updated last year
- Large datasets for conversational AI☆1,392Nov 16, 2019Updated 6 years ago
- Multilingual Compositional Wikidata Questions (MCWQ)☆20Jun 12, 2023Updated 2 years ago
- Few-Shot-Intent-Detection includes popular challenging intent detection datasets with/without OOS queries and state-of-the-art baselines …☆155Jul 19, 2023Updated 2 years ago
- Multi-angle c(q)uestion answering☆458Aug 22, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Repository that accompanies "An Evaluation Dataset for Intent Classification and Out-of-Scope Prediction" (EMNLP 2019)☆221Jun 1, 2021Updated 4 years ago
- Pre-Trained Models for ToD-BERT☆295Jul 17, 2023Updated 2 years ago
- Finetune wav2vec2-large-xlsr-53 with Thai Common Voice Corpus 7.0☆53Apr 23, 2022Updated 3 years ago
- ☆133Jul 5, 2023Updated 2 years ago
- New dataset☆311Aug 31, 2021Updated 4 years ago
- Overview of corpora/datasets for Germanic low-resource languages and dialects. Accompanies "A Survey of Corpora for Germanic Low-Resource…☆27Feb 16, 2026Updated 2 months ago
- Efficient few-shot learning with Sentence Transformers☆2,715Updated this week
- The PIZZA dataset continues the exploration of task-oriented parsing by introducing a new dataset for parsing pizza and drink orders, who…☆20Dec 7, 2022Updated 3 years ago
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.☆160Jun 18, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆511Sep 23, 2020Updated 5 years ago
- data collator for UL2 and U-PaLM☆29Aug 20, 2023Updated 2 years ago
- Language Models as Few-Shot Learner for Task-Oriented Dialogue Systems☆22May 28, 2021Updated 4 years ago
- The Learning Interpretability Tool: Interactively analyze ML models to understand their behavior in an extensible and framework agnostic …☆3,652Updated this week
- Official Implementation of "DialogLM: Pre-trained Model for Long Dialogue Understanding and Summarization."☆143Nov 1, 2022Updated 3 years ago
- ☆75Jul 2, 2021Updated 4 years ago
- Data & Code for ACCENTOR: "Adding Chit-Chat to Enhance Task-Oriented Dialogues" (NAACL 2021)☆72Oct 12, 2021Updated 4 years ago