facebookresearch / dynabench
Dynamic Adversarial Benchmarking platform
☆26Updated 2 years ago
Alternatives and similar repositories for dynabench:
Users that are interested in dynabench are comparing it to the libraries listed below
- Documentation effort for the BookCorpus dataset☆34Updated 3 years ago
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆22Updated 2 years ago
- ☆12Updated 3 years ago
- ☆19Updated last year
- ☆14Updated 6 months ago
- The repository for the paper "When Do You Need Billions of Words of Pretraining Data?"☆21Updated 4 years ago
- ☆33Updated 2 weeks ago
- Efficiently computing & storing token n-grams from large corpora☆23Updated 6 months ago
- ☆29Updated last year
- Generative Retrieval Transformer☆28Updated last year
- Repo for ICML23 "Why do Nearest Neighbor Language Models Work?"☆56Updated 2 years ago
- ☆11Updated 2 years ago
- ☆27Updated 4 years ago
- A library for computing diverse text characteristics and using them to analyze data sets and models with ease.☆40Updated 2 years ago
- ☆44Updated 5 months ago
- Simplifying parsing of large jsonline files in NLP Workflows☆12Updated 3 years ago
- Starbucks: Improved Training for 2D Matryoshka Embeddings☆19Updated 2 months ago
- ☆30Updated 3 years ago
- ☆22Updated 3 years ago
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…☆34Updated last year
- EMNLP Findings 2020: Reevaluating Adversarial Examples in Natural Language☆7Updated 4 years ago
- 🌾 Universal, customizable and deployable fine-grained evaluation for text generation.☆22Updated last year
- Code for the paper "Code-Mixing on Sesame Street: Dawn of the Adversarial Polyglots" (NAACL-HLT 2021)☆10Updated 3 years ago
- Bayesian Assessment of Hypotheses☆24Updated last year
- Code for Analyzing Redundancy in Pretrained Transformer Models accepted at EMNLP 2020☆12Updated 4 years ago
- This repo contains data and code for the paper "Reasoning over Public and Private Data in Retrieval-Based Systems."☆46Updated 9 months ago
- Minimum Description Length probing for neural network representations☆19Updated 2 months ago
- Multilingual Compositional Wikidata Questions (MCWQ)☆18Updated last year
- Embedding Recycling for Language models☆38Updated last year
- Large-scale query-focused multi-document Summarization dataset☆10Updated 3 years ago