stair-lab / mlhpLinks
☆15Updated last month
Alternatives and similar repositories for mlhp
Users that are interested in mlhp are comparing it to the libraries listed below
Sorting:
- Official codebase for "Analyzing the Generalization and Reliability of Steering Vectors"☆15Updated 10 months ago
- ☆22Updated 9 months ago
- Package of Pathways-on-Cloud utilities☆20Updated 3 weeks ago
- Papers about infrastructure (deployment & serving) and systems for compound AI☆11Updated last year
- Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers☆23Updated 8 months ago
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆111Updated 3 weeks ago
- Official repo of dataset-decomposition paper [NeurIPS 2024]☆20Updated 10 months ago
- AlgoTune is a NeurIPS 2025 benchmark made up of 154 math, physics, and computer science problems. The goal is write code that solves each…☆66Updated this week
- Code for I-RAVEN-X generation and experiments☆17Updated last month
- We study toy models of skill learning.☆31Updated 9 months ago
- ☆147Updated 11 months ago
- Learning from preferences is a common paradigm for fine-tuning language models. Yet, many algorithmic design decisions come into play. Ou…☆32Updated last year
- KV cache compression via sparse coding☆14Updated last week
- NeurIPS 2024 tutorial on LLM Inference☆47Updated 10 months ago
- Implementation of PatchSAE as presented in "Sparse autoencoders reveal selective remapping of visual concepts during adaptation"☆28Updated last week
- ☆33Updated 10 months ago
- ☆15Updated 11 months ago
- Efficiently discovering algorithms via LLMs with evolutionary search and reinforcement learning.☆116Updated 2 weeks ago
- A tool for an analysis of LLM generations.☆40Updated 3 weeks ago
- LMAct: A Benchmark for In-Context Imitation Learning with Long Multimodal Demonstrations☆25Updated 5 months ago
- ☆68Updated 3 months ago
- Custom triton kernels for training Karpathy's nanoGPT.☆19Updated last year
- ☆38Updated last year
- ☆21Updated 7 months ago
- Code for NeurIPS 2024 Spotlight: "Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations"☆84Updated last year
- Code for "Echo Chamber: RL Post-training Amplifies Behaviors Learned in Pretraining"☆24Updated 3 weeks ago
- ☆17Updated last year
- ☆70Updated last year
- Simple repository for training small reasoning models☆44Updated 9 months ago
- Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"☆27Updated last year