AI21Labs / lm-evaluationView external linksLinks
Evaluation suite for large-scale language models.
☆129Aug 15, 2021Updated 4 years ago
Alternatives and similar repositories for lm-evaluation
Users that are interested in lm-evaluation are comparing it to the libraries listed below
Sorting:
- This is the code for loading the SenseBERT model, described in our paper from ACL 2020.☆47Mar 24, 2023Updated 2 years ago
- Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing☆49Jan 27, 2022Updated 4 years ago
- ☆26Mar 11, 2023Updated 2 years ago
- OSLO: Open Source for Large-scale Optimization☆175Sep 9, 2023Updated 2 years ago
- Task-based datasets, preprocessing, and evaluation for sequence models.☆594Feb 3, 2026Updated last week
- ☆26May 30, 2023Updated 2 years ago
- Official pytorch implementation of I2I translation with low resolution conditioning☆23Sep 2, 2021Updated 4 years ago
- A repository of projects and datasets under active development by Alignment Lab AI☆22Dec 22, 2023Updated 2 years ago
- Source code for "Gradient Based Memory Editing for Task-Free Continual Learning", 4th Lifelong ML Workshop@ICML 2020☆17Dec 8, 2022Updated 3 years ago
- Directed masked autoencoders☆14Feb 5, 2026Updated last week
- Open Instruction Generalist is an assistant trained on massive synthetic instructions to perform many millions of tasks☆209Jan 13, 2024Updated 2 years ago
- Formalization of Statement of Local Langlands Correspondence for Tori☆12Dec 18, 2018Updated 7 years ago
- Creative Instructions Project☆11Sep 4, 2023Updated 2 years ago
- An implementation of Compositional Attention: Disentangling Search and Retrieval by MILA☆14Jun 1, 2022Updated 3 years ago
- ImageNetV2 Pytorch Dataset☆42Apr 17, 2023Updated 2 years ago
- ☆77Apr 29, 2024Updated last year
- ☆17Oct 12, 2023Updated 2 years ago
- Basic Lisp-like programming language☆13Dec 25, 2020Updated 5 years ago
- A molecule generation benchmarking platform☆13Feb 22, 2018Updated 7 years ago
- Code for the "Overcoming Sparsity Artifacts in Crosscoders to Interpret Chat-Tuning" paper.☆16Nov 21, 2025Updated 2 months ago
- Cytoscape 3 desktop version.☆17Dec 17, 2025Updated last month
- Codes for "EDG-based Question Decomposition for Complex Question Answering over Knowledge Bases"☆13Nov 12, 2021Updated 4 years ago
- ☆13Jun 16, 2021Updated 4 years ago
- Experimental pathtracing 3D renderer written in Ink☆14Jul 22, 2020Updated 5 years ago
- ☆11Aug 26, 2021Updated 4 years ago
- ☆13Jul 20, 2021Updated 4 years ago
- Host CIFAR-10.2 Data Set☆13Sep 22, 2021Updated 4 years ago
- Experiments for the NeurIPS 2021 paper "Cockpit: A Practical Debugging Tool for the Training of Deep Neural Networks"☆13Oct 25, 2021Updated 4 years ago
- This repository contains an example of how to use the Weaviate vector search engine's text2vec-openai module☆30Apr 7, 2023Updated 2 years ago
- ☆158Mar 18, 2023Updated 2 years ago
- Quick start for MicroFlo on Arduino - clone and go!☆15Dec 31, 2017Updated 8 years ago
- ☆16Jul 2, 2025Updated 7 months ago
- ☆14Nov 28, 2022Updated 3 years ago
- The Concept Bottleneck Shift Detection (CBSD) methods for explaining and detecting various dataset shifts.☆14Jun 22, 2021Updated 4 years ago
- A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)☆4,742Jan 8, 2024Updated 2 years ago
- ☆14Aug 29, 2023Updated 2 years ago
- CLaMR: Contextualized Late-Interaction for Multimodal Content Retrieval☆23Jun 28, 2025Updated 7 months ago
- Measuring the Signal to Noise Ratio in Language Model Evaluation☆28Aug 19, 2025Updated 5 months ago
- ☆2,947Jan 15, 2026Updated 3 weeks ago