facebookresearch / dynabench
Dynamic Adversarial Benchmarking platform
☆26Updated 2 years ago
Alternatives and similar repositories for dynabench:
Users that are interested in dynabench are comparing it to the libraries listed below
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆22Updated 2 years ago
- Efficiently computing & storing token n-grams from large corpora☆18Updated 4 months ago
- Documentation effort for the BookCorpus dataset☆33Updated 3 years ago
- Interview-based evaluation of LLMs☆15Updated last month
- ☆8Updated 7 months ago
- ☆29Updated last year
- ☆12Updated 2 years ago
- Generative Retrieval Transformer☆28Updated last year
- ☆17Updated 6 months ago
- ☆44Updated 3 months ago
- Minimum Description Length probing for neural network representations☆18Updated 3 weeks ago
- Code for "Incorporating Relevance Feedback for Information-Seeking Retrieval using Few-Shot Document Re-Ranking" (https://arxiv.org/abs/2…☆13Updated last year
- This repo contains data and code for the paper "Reasoning over Public and Private Data in Retrieval-Based Systems."☆47Updated 7 months ago
- SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batchi…☆32Updated 8 months ago
- ☆14Updated 4 months ago
- Measuring and Controlling Persona Drift in Language Model Dialogs☆16Updated 11 months ago
- ☆22Updated 2 years ago
- ☆30Updated 3 years ago
- Submission to the inverse scaling prize☆23Updated last year
- Cortex-compatible model server for Python and TensorFlow☆17Updated 2 years ago
- ☆13Updated 6 months ago
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…☆34Updated last year
- Simplifying parsing of large jsonline files in NLP Workflows☆12Updated 3 years ago
- Supervised instruction finetuning for LLM with HF trainer and Deepspeed☆34Updated last year
- Code for paper "Do Language Models Have Beliefs? Methods for Detecting, Updating, and Visualizing Model Beliefs"☆28Updated 2 years ago
- URL downloader supporting checkpointing and continuous checksumming.☆19Updated last year
- Robust Cross-lingual Embeddings from Parallel Sentences☆21Updated 4 years ago
- NLG Best Practices for Data-Efficient Modeling How to Train Production-Ready Models with Little Data☆10Updated 3 years ago
- The Python library with command line tools to interact with Dynabench(https://dynabench.org/), such as uploading models.☆55Updated 2 years ago