stair-lab / mlhpLinks
☆12Updated last month
Alternatives and similar repositories for mlhp
Users that are interested in mlhp are comparing it to the libraries listed below
Sorting:
- ☆21Updated 7 months ago
- Official codebase for "Analyzing the Generalization and Reliability of Steering Vectors"☆15Updated 9 months ago
- Package of Pathways-on-Cloud utilities☆19Updated this week
- LMAct: A Benchmark for In-Context Imitation Learning with Long Multimodal Demonstrations☆23Updated 4 months ago
- A tool for an analysis of LLM generations.☆40Updated last month
- Common tools for data processing☆20Updated 2 weeks ago
- Official repo of dataset-decomposition paper [NeurIPS 2024]☆19Updated 8 months ago
- Learning from preferences is a common paradigm for fine-tuning language models. Yet, many algorithmic design decisions come into play. Ou…☆32Updated last year
- AlgoTune is a NeurIPS 2025 benchmark made up of 154 math, physics, and computer science problems. The goal is write code that solves each…☆60Updated this week
- ☆12Updated 2 months ago
- ☆56Updated 10 months ago
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆99Updated last month
- Collection of LLM completions for reasoning-gym task datasets☆29Updated 2 months ago
- ☆15Updated 9 months ago
- KV cache compression via sparse coding☆14Updated 4 months ago
- minimal GRPO implementation from scratch☆97Updated 6 months ago
- [ICML 2025] Roll the dice & look before you leap: Going beyond the creative limits of next-token prediction☆70Updated 4 months ago
- Code for I-RAVEN-X generation and experiments☆16Updated last week
- A specialized RWKV-7 model for Othello(a.k.a. Reversi) that predicts legal moves, evaluates positions, and performs in-context search. It…☆42Updated 8 months ago
- Papers about infrastructure (deployment & serving) and systems for compound AI☆11Updated 10 months ago
- [ICML 24 NGSM workshop] Associative Recurrent Memory Transformer implementation and scripts for training and evaluation☆52Updated 3 weeks ago
- Efficiently discovering algorithms via LLMs with evolutionary search and reinforcement learning.☆111Updated last month
- NeurIPS 2024 tutorial on LLM Inference☆48Updated 9 months ago
- Simple repository for training small reasoning models☆40Updated 7 months ago
- Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers☆20Updated 6 months ago
- ☆21Updated 5 months ago
- ☆67Updated last year
- ☆146Updated 10 months ago
- official implementation of ICLR'2025 paper: Rethinking Bradley-Terry Models in Preference-based Reward Modeling: Foundations, Theory, and…☆66Updated 5 months ago
- Esoteric Language Models☆99Updated 2 months ago