alexa / massiveLinks
Tools and Modeling Code for the MASSIVE dataset
☆544Updated 2 years ago
Alternatives and similar repositories for massive
Users that are interested in massive are comparing it to the libraries listed below
Sorting:
- Web-scale retrieval for knowledge-intensive NLP☆553Updated 2 years ago
- Library for Textless Spoken Language Processing☆543Updated last year
- Multi-angle c(q)uestion answering☆458Updated 2 years ago
- Stanford's Alexa Prize socialbot☆133Updated last year
- A library for preparing data for machine translation research (monolingual preprocessing, bitext mining, etc.) built by the FAIR NLLB te…☆280Updated 5 months ago
- DialoGLUE: A Natural Language Understanding Benchmark for Task-Oriented Dialogue☆283Updated last year
- NL-Augmenter 🦎 → 🐍 A Collaborative Repository of Natural Language Transformations☆785Updated last year
- ☆510Updated last year
- Facebook Low Resource (FLoRes) MT Benchmark☆739Updated last year
- xfspell — the Transformer Spell Checker☆190Updated 5 years ago
- SLING - A natural language frame semantics parser☆163Updated this week
- Search Engines with Autoregressive Language models☆288Updated 2 years ago
- We introduce MKQA, an open-domain question answering evaluation set comprising 10k question-answer pairs aligned across 26 typologically …☆182Updated 3 years ago
- XTREME is a benchmark for the evaluation of the cross-lingual generalization ability of pre-trained multilingual models that covers 40 ty…☆644Updated 2 years ago
- An Analysis Toolkit for Natural Language Generation (Translation, Captioning, Summarization, etc.)☆445Updated this week
- Interpretable Evaluation for (Almost) All NLP Tasks☆195Updated 2 years ago
- A suite of tools for managing crowdsourcing tasks from the inception through to data packaging for research use.☆312Updated 6 months ago
- [ACL 2021] Learning Dense Representations of Phrases at Scale; EMNLP'2021: Phrase Retrieval Learns Passage Retrieval, Too https://arxiv.o…☆604Updated 3 years ago
- Scripts and links to recreate the ELI5 dataset.☆325Updated 3 years ago
- Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)☆363Updated last year
- 📃Language Model based sentences scoring library☆308Updated 3 years ago
- The AI Knowledge Editor☆182Updated 2 years ago
- ☆182Updated 2 years ago
- Code and data to support the paper "PAQ 65 Million Probably-Asked Questions andWhat You Can Do With Them"☆203Updated 3 years ago
- Summarization, translation, sentiment-analysis, text-generation and more at blazing speed using a T5 version implemented in ONNX.☆254Updated 2 years ago
- XtremeDistil framework for distilling/compressing massive multilingual neural network models to tiny and efficient models for AI at scale☆155Updated last year
- A word2vec negative sampling implementation with correct CBOW update.☆261Updated 3 years ago
- Repository containing code for "How to Train BERT with an Academic Budget" paper☆313Updated last year
- This dataset contains synthetic training data for grammatical error correction. The corpus is generated by corrupting clean sentences fro…☆161Updated 9 months ago
- GitHub Typo Corpus: A Large-Scale Multilingual Dataset of Misspellings and Grammatical Errors☆509Updated 5 years ago