Evaluation suite for large-scale language models.
☆129Aug 15, 2021Updated 4 years ago
Alternatives and similar repositories for lm-evaluation
Users that are interested in lm-evaluation are comparing it to the libraries listed below
Sorting:
- OSLO: Open Source for Large-scale Optimization☆175Sep 9, 2023Updated 2 years ago
- codebase for the SIMAT dataset and evaluation☆38Feb 16, 2022Updated 4 years ago
- Task-based datasets, preprocessing, and evaluation for sequence models.☆594Updated this week
- Un-*** 50 billions multimodality dataset☆23Sep 14, 2022Updated 3 years ago
- Patching open-vocabulary models by interpolating weights☆91Sep 28, 2023Updated 2 years ago
- Source code for "Gradient Based Memory Editing for Task-Free Continual Learning", 4th Lifelong ML Workshop@ICML 2020☆17Dec 8, 2022Updated 3 years ago
- ☆26May 30, 2023Updated 2 years ago
- A repository of projects and datasets under active development by Alignment Lab AI☆22Dec 22, 2023Updated 2 years ago
- Official pytorch implementation of I2I translation with low resolution conditioning☆23Sep 2, 2021Updated 4 years ago
- Directed masked autoencoders☆14Feb 20, 2026Updated 2 weeks ago
- ☆10Nov 17, 2022Updated 3 years ago
- Open Instruction Generalist is an assistant trained on massive synthetic instructions to perform many millions of tasks☆209Jan 13, 2024Updated 2 years ago
- ☆33Jan 4, 2026Updated 2 months ago
- Formalization of Statement of Local Langlands Correspondence for Tori☆12Dec 18, 2018Updated 7 years ago
- ☆12Dec 20, 2018Updated 7 years ago
- ImageNetV2 Pytorch Dataset☆42Apr 17, 2023Updated 2 years ago
- An implementation of Compositional Attention: Disentangling Search and Retrieval by MILA☆14Jun 1, 2022Updated 3 years ago
- ☆77Apr 29, 2024Updated last year
- Implementation in the framework of my bachelor thesis: Generative Modelling using Capsule Generative Adversarial Networks☆12Feb 20, 2026Updated 2 weeks ago
- Basic Lisp-like programming language☆13Dec 25, 2020Updated 5 years ago
- Codes for "EDG-based Question Decomposition for Complex Question Answering over Knowledge Bases"☆13Nov 12, 2021Updated 4 years ago
- ☆11Aug 26, 2021Updated 4 years ago
- ☆13Jun 16, 2021Updated 4 years ago
- Experimental pathtracing 3D renderer written in Ink☆14Jul 22, 2020Updated 5 years ago
- Code for the "Overcoming Sparsity Artifacts in Crosscoders to Interpret Chat-Tuning" paper.☆16Nov 21, 2025Updated 3 months ago
- ☆17Oct 12, 2023Updated 2 years ago
- A molecule generation benchmarking platform☆13Feb 22, 2018Updated 8 years ago
- ☆16Jul 2, 2025Updated 8 months ago
- Neural MMO - A Massively Multiagent Environment for Artificial Intelligence Research☆15May 30, 2024Updated last year
- Quick start for MicroFlo on Arduino - clone and go!☆15Dec 31, 2017Updated 8 years ago
- The Concept Bottleneck Shift Detection (CBSD) methods for explaining and detecting various dataset shifts.☆14Jun 22, 2021Updated 4 years ago
- The Avogadro website☆12Oct 24, 2023Updated 2 years ago
- Host CIFAR-10.2 Data Set☆13Sep 22, 2021Updated 4 years ago
- ☆14Nov 28, 2022Updated 3 years ago
- This repository contains an example of how to use the Weaviate vector search engine's text2vec-openai module☆30Apr 7, 2023Updated 2 years ago
- ☆14Jun 8, 2018Updated 7 years ago
- ☆13Jul 20, 2021Updated 4 years ago
- ☆158Mar 18, 2023Updated 2 years ago
- Experiments for the NeurIPS 2021 paper "Cockpit: A Practical Debugging Tool for the Training of Deep Neural Networks"☆13Oct 25, 2021Updated 4 years ago