mlcommons / dynabenchLinks
☆22Updated this week
Alternatives and similar repositories for dynabench
Users that are interested in dynabench are comparing it to the libraries listed below
Sorting:
- Probabilistic LLM evaluations. [CogSci2023; ACL2023]☆73Updated 11 months ago
- Documentation effort for the BookCorpus dataset☆34Updated 4 years ago
- ☆79Updated last year
- Code for SaGe subword tokenizer (EACL 2023)☆25Updated 7 months ago
- Stuff related to scraping the Code Review StackExchange☆11Updated 2 years ago
- Minimum Description Length probing for neural network representations☆18Updated 5 months ago
- Code for Relevance-guided Supervision for OpenQA with ColBERT (TACL'21)☆41Updated 3 years ago
- ☆55Updated this week
- Embedding Recycling for Language models☆39Updated 2 years ago
- ☆25Updated 2 years ago
- ☆19Updated last year
- SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batchi…☆33Updated last year
- ☆90Updated 3 years ago
- This repo contains data and code for the paper "Reasoning over Public and Private Data in Retrieval-Based Systems."☆46Updated last year
- ☆48Updated last year
- arXiv plain text extraction☆41Updated 2 years ago
- ☆67Updated 2 years ago
- A diff tool for language models☆42Updated last year
- The official code of LM-Debugger, an interactive tool for inspection and intervention in transformer-based language models.☆177Updated 3 years ago
- Code for Stage-wise Fine-tuning for Graph-to-Text Generation☆26Updated 2 years ago
- Open-source Human Feedback Library☆11Updated last year
- Hugging Face and Pyserini interoperability☆20Updated 2 years ago
- ☆44Updated 8 months ago
- Weakly Supervised Text-to-SQL Parsing through Question Decomposition☆22Updated last year
- Repo for ICML23 "Why do Nearest Neighbor Language Models Work?"☆58Updated 2 years ago
- A highly sophisticated sequence-to-sequence model for code generation☆40Updated 4 years ago
- ☆54Updated 2 years ago
- The codebase for our ACL2023 paper: Did You Read the Instructions? Rethinking the Effectiveness of Task Definitions in Instruction Learni…☆30Updated 2 years ago
- 🌾 Universal, customizable and deployable fine-grained evaluation for text generation.☆23Updated last year
- Starbucks: Improved Training for 2D Matryoshka Embeddings☆21Updated 3 weeks ago