Evaluation suite for large-scale language models.
☆130Aug 15, 2021Updated 4 years ago
Alternatives and similar repositories for lm-evaluation
Users that are interested in lm-evaluation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is the code for loading the SenseBERT model, described in our paper from ACL 2020.☆48Mar 24, 2023Updated 3 years ago
- Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing☆49Jan 27, 2022Updated 4 years ago
- ☆14Jun 8, 2018Updated 8 years ago
- ☆13Jun 7, 2022Updated 4 years ago
- ☆27Mar 21, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Learning to Model Editing Processes☆26Aug 3, 2025Updated 10 months ago
- An implementation of Compositional Attention: Disentangling Search and Retrieval by MILA☆14Jun 1, 2022Updated 4 years ago
- Task-based datasets, preprocessing, and evaluation for sequence models.☆594May 12, 2026Updated 3 weeks ago
- A remark plugin for making interactive markdown documents with Tangle.☆13Oct 25, 2021Updated 4 years ago
- OSLO: Open Source for Large-scale Optimization☆174Sep 9, 2023Updated 2 years ago
- Open Instruction Generalist is an assistant trained on massive synthetic instructions to perform many millions of tasks☆210Jan 13, 2024Updated 2 years ago
- ☆24Jun 30, 2025Updated 11 months ago
- Stochastic Optimization for Global Contrastive Learning without Large Mini-batches☆20Mar 31, 2023Updated 3 years ago
- ☆12Aug 14, 2019Updated 6 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆26May 30, 2023Updated 3 years ago
- ☆11Aug 26, 2021Updated 4 years ago
- MARNNs Can Learn Generalized Dyck Languages☆12Nov 11, 2019Updated 6 years ago
- ☆77Apr 29, 2024Updated 2 years ago
- Directed masked autoencoders☆15Mar 25, 2026Updated 2 months ago
- In-Context Alignment: Chat with Vanilla Language Models Before Fine-Tuning☆34Aug 9, 2023Updated 2 years ago
- For experiments involving instruct gpt. Currently used for documenting open research questions.☆71Nov 8, 2022Updated 3 years ago
- Official pytorch implementation of I2I translation with low resolution conditioning☆23Sep 2, 2021Updated 4 years ago
- This is a reproduction of the paper 'Beyond Fully-Connected Layers with Quaternions: Parameterization of Hypercomplex Multiplications wit…☆13Aug 22, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆157Mar 18, 2023Updated 3 years ago
- Host CIFAR-10.2 Data Set☆13Sep 22, 2021Updated 4 years ago
- The Concept Bottleneck Shift Detection (CBSD) methods for explaining and detecting various dataset shifts.☆14Jun 22, 2021Updated 4 years ago
- Simple examples of serving HuggingFace models with TensorFlow Serving☆16Oct 21, 2023Updated 2 years ago
- ☆13Jun 16, 2021Updated 4 years ago
- ☆12Dec 20, 2018Updated 7 years ago
- Patching open-vocabulary models by interpolating weights☆91Sep 28, 2023Updated 2 years ago
- A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)☆4,750Jan 8, 2024Updated 2 years ago
- Creative Instructions Project☆11Sep 4, 2023Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆14Nov 28, 2022Updated 3 years ago
- ☆49Jun 12, 2023Updated 2 years ago
- ☆18Mar 18, 2024Updated 2 years ago
- Language model with phrase induction☆14Jun 13, 2019Updated 6 years ago
- Experimental pathtracing 3D renderer written in Ink☆14Jul 22, 2020Updated 5 years ago
- Un-*** 50 billions multimodality dataset☆24Sep 14, 2022Updated 3 years ago
- ☆2,966May 20, 2026Updated 3 weeks ago