stair-lab / mlhpLinks
☆17Updated last month
Alternatives and similar repositories for mlhp
Users that are interested in mlhp are comparing it to the libraries listed below
Sorting:
- ☆22Updated 9 months ago
- Official codebase for "Analyzing the Generalization and Reliability of Steering Vectors"☆17Updated 11 months ago
- Physics of Language Models, Part 4☆260Updated 4 months ago
- Implementation of PatchSAE as presented in "Sparse autoencoders reveal selective remapping of visual concepts during adaptation"☆28Updated 3 weeks ago
- Official repo of dataset-decomposition paper [NeurIPS 2024]☆20Updated 10 months ago
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆112Updated last month
- ☆38Updated last year
- ☆22Updated last week
- ☆33Updated 10 months ago
- RLP: Reinforcement as a Pretraining Objective☆201Updated last month
- KV cache compression via sparse coding☆14Updated last month
- Package of Pathways-on-Cloud utilities☆21Updated last week
- Code for I-RAVEN-X generation and experiments☆18Updated 2 months ago
- ☆78Updated 4 months ago
- ☆28Updated last month
- ☆56Updated last year
- [ICLR 2025] DSBench: How Far are Data Science Agents from Becoming Data Science Experts?☆84Updated 3 months ago
- Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers☆23Updated 8 months ago
- Official implementation of Recurrent Action Transformer with Memory, an offline RL agent with memory mechanisms. https://sites.google.com…☆15Updated last week
- Benchmarking Optimizers for LLM Pretraining☆41Updated 2 weeks ago
- A brief and partial summary of RLHF algorithms.☆139Updated 8 months ago
- Code and data for paper "(How) do Language Models Track State?"☆20Updated 7 months ago
- Memory Mosaics are networks of associative memories working in concert to achieve a prediction task.☆52Updated 9 months ago
- Privacy backdoors☆51Updated last year
- NeurIPS 2024 tutorial on LLM Inference☆47Updated 11 months ago
- Esoteric Language Models☆107Updated this week
- AlgoTune is a NeurIPS 2025 benchmark made up of 154 math, physics, and computer science problems. The goal is write code that solves each…☆71Updated this week
- ☆17Updated 6 months ago
- Reinforcing General Reasoning without Verifiers☆92Updated 5 months ago
- ☆43Updated last week