stair-lab / mlhpLinks
Machine Learning from Human Preferences
☆24Updated last week
Alternatives and similar repositories for mlhp
Users that are interested in mlhp are comparing it to the libraries listed below
Sorting:
- [ICLR 2025] This repository contains the code to reproduce the results from our paper From Sparse Dependence to Sparse Attention: Unveili…☆12Updated 10 months ago
- Official codebase for "Analyzing the Generalization and Reliability of Steering Vectors"☆19Updated last year
- Code for I-RAVEN-X generation and experiments☆19Updated 4 months ago
- KV cache compression via sparse coding☆17Updated 3 months ago
- NeurIPS 2024 tutorial on LLM Inference☆47Updated last year
- AIRA-dojo: a framework for developing and evaluating AI research agents☆124Updated last week
- A brief and partial summary of RLHF algorithms.☆143Updated 10 months ago
- Official implementation of GRAPE: Group Representational Position Encoding (https://arxiv.org/abs/2512.07805)☆74Updated 3 weeks ago
- ☆23Updated last year
- Python package for generating datasets to evaluate reasoning and retrieval of large language models☆19Updated 4 months ago
- Universal Neurons in GPT2 Language Models☆30Updated last year
- [ICLR 2026] Official PyTorch Implementation of RLP: Reinforcement as a Pretraining Objective☆226Updated this week
- ☆33Updated last year
- Official repo of dataset-decomposition paper [NeurIPS 2024]☆20Updated last year
- Efficiently discovering algorithms via LLMs with evolutionary search and reinforcement learning.☆125Updated 2 months ago
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆127Updated 3 months ago
- Brax + Pufferlib + CARBS for gpu-accelerated robotics RL☆11Updated 7 months ago
- Common tools for data processing☆22Updated last month
- Learning from preferences is a common paradigm for fine-tuning language models. Yet, many algorithmic design decisions come into play. Ou…☆32Updated last year
- ☆27Updated 2 months ago
- Universal Reasoning Model☆121Updated 2 weeks ago
- ☆16Updated last year
- ☆167Updated 5 months ago
- Collection of LLM completions for reasoning-gym task datasets☆30Updated 6 months ago
- How to create rational LLM-based agents? Using game-theoretic workflows!☆90Updated 7 months ago
- ☆134Updated last month
- [ICLR 2025] Official implementation of DICL (Disentangled In-Context Learning), featured in the paper "Zero-shot Model-based Reinforcemen…☆26Updated 11 months ago
- Materials for a language modeling class, broadly construed☆33Updated last week
- ☆465Updated 5 months ago
- Physics of Language Models: Part 4.2, Canon Layers at Scale where Synthetic Pretraining Resonates in Reality☆314Updated 3 weeks ago