Evaluation suite for large-scale language models.
☆129Aug 15, 2021Updated 4 years ago
Alternatives and similar repositories for lm-evaluation
Users that are interested in lm-evaluation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆13Jun 7, 2022Updated 3 years ago
- Learning to Model Editing Processes☆26Aug 3, 2025Updated 8 months ago
- ☆17Oct 12, 2023Updated 2 years ago
- An implementation of Compositional Attention: Disentangling Search and Retrieval by MILA☆14Jun 1, 2022Updated 3 years ago
- Task-based datasets, preprocessing, and evaluation for sequence models.☆593Apr 22, 2026Updated last week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Source code and data for the paper "Towards String-to-Tree Neural Machine Translation"☆16Dec 31, 2017Updated 8 years ago
- Open Instruction Generalist is an assistant trained on massive synthetic instructions to perform many millions of tasks☆210Jan 13, 2024Updated 2 years ago
- ☆23Jun 30, 2025Updated 10 months ago
- codebase for the SIMAT dataset and evaluation☆38Feb 16, 2022Updated 4 years ago
- Repository for code from "On Adversarial Removal of Hypothesis-only Bias in Natural Language Inference" (StarSem 2019) and "Don’t Take th…☆15Apr 6, 2020Updated 6 years ago
- Stochastic Optimization for Global Contrastive Learning without Large Mini-batches☆20Mar 31, 2023Updated 3 years ago
- ☆12Aug 14, 2019Updated 6 years ago
- ☆26May 30, 2023Updated 2 years ago
- ☆11Aug 26, 2021Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Source code for "Gradient Based Memory Editing for Task-Free Continual Learning", 4th Lifelong ML Workshop@ICML 2020☆17Dec 8, 2022Updated 3 years ago
- MARNNs Can Learn Generalized Dyck Languages☆12Nov 11, 2019Updated 6 years ago
- ☆77Apr 29, 2024Updated 2 years ago
- Directed masked autoencoders☆14Mar 25, 2026Updated last month
- Japanese--Russian--English News Commentary Parallel Data☆18Jul 9, 2019Updated 6 years ago
- For experiments involving instruct gpt. Currently used for documenting open research questions.☆71Nov 8, 2022Updated 3 years ago
- Official pytorch implementation of I2I translation with low resolution conditioning☆23Sep 2, 2021Updated 4 years ago
- A repository of projects and datasets under active development by Alignment Lab AI☆22Dec 22, 2023Updated 2 years ago
- ☆158Mar 18, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Host CIFAR-10.2 Data Set☆13Sep 22, 2021Updated 4 years ago
- The Concept Bottleneck Shift Detection (CBSD) methods for explaining and detecting various dataset shifts.☆14Jun 22, 2021Updated 4 years ago
- Simple examples of serving HuggingFace models with TensorFlow Serving☆16Oct 21, 2023Updated 2 years ago
- Quick start for MicroFlo on Arduino - clone and go!☆15Dec 31, 2017Updated 8 years ago
- ☆13Jun 16, 2021Updated 4 years ago
- ☆12Dec 20, 2018Updated 7 years ago
- A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)☆4,745Jan 8, 2024Updated 2 years ago
- Creative Instructions Project☆11Sep 4, 2023Updated 2 years ago
- ☆49Jun 12, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- This is the official code used for WAT 2017 Description Paper titled A Bag of Useful Tricks for Practical Neural Machine Translation: Emb…☆12Oct 24, 2017Updated 8 years ago
- ☆18Mar 18, 2024Updated 2 years ago
- ☆35Feb 15, 2026Updated 2 months ago
- Un-*** 50 billions multimodality dataset☆24Sep 14, 2022Updated 3 years ago
- ☆2,964Apr 21, 2026Updated last week
- ImageNetV2 Pytorch Dataset☆43Apr 17, 2023Updated 3 years ago
- Comparing PyTorch Catalyst, Ignite, Lightning by sample code☆20Dec 8, 2022Updated 3 years ago