stair-lab / mlhpLinks
☆23Updated 3 months ago
Alternatives and similar repositories for mlhp
Users that are interested in mlhp are comparing it to the libraries listed below
Sorting:
- [ICLR 2025] This repository contains the code to reproduce the results from our paper From Sparse Dependence to Sparse Attention: Unveili…☆11Updated 10 months ago
- Official codebase for "Analyzing the Generalization and Reliability of Steering Vectors"☆19Updated last year
- ☆23Updated 11 months ago
- LMAct: A Benchmark for In-Context Imitation Learning with Long Multimodal Demonstrations☆25Updated 7 months ago
- Brax + Pufferlib + CARBS for gpu-accelerated robotics RL☆11Updated 6 months ago
- Code for I-RAVEN-X generation and experiments☆19Updated 3 months ago
- Efficiently discovering algorithms via LLMs with evolutionary search and reinforcement learning.☆121Updated last month
- ☆69Updated 9 months ago
- NeurIPS 2024 tutorial on LLM Inference☆47Updated last year
- Learning from preferences is a common paradigm for fine-tuning language models. Yet, many algorithmic design decisions come into play. Ou…☆32Updated last year
- Memory Mosaics are networks of associative memories working in concert to achieve a prediction task.☆57Updated 11 months ago
- [ICLR 2025] Code for the paper "Implicit Search via Discrete Diffusion: A Study on Chess"☆36Updated 10 months ago
- AIRA-dojo: a framework for developing and evaluating AI research agents☆122Updated last month
- Official implementation of Recurrent Action Transformer with Memory, an offline RL agent with memory mechanisms. https://sites.google.com…☆17Updated last month
- This repo contains the source code for the paper "Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning"☆284Updated last month
- Official repo of dataset-decomposition paper [NeurIPS 2024]☆20Updated last year
- AlgoTune is a NeurIPS 2025 benchmark made up of 154 math, physics, and computer science problems. The goal is write code that solves each…☆81Updated this week
- ☆33Updated last year
- ☆27Updated last month
- ☆92Updated 5 months ago
- Package of Pathways-on-Cloud utilities☆22Updated this week
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆92Updated last year
- Physics of Language Models, Part 4☆291Updated this week
- Python package for generating datasets to evaluate reasoning and retrieval of large language models☆18Updated 3 months ago
- Learn online intrinsic rewards from LLM feedback☆45Updated last year
- Benchmarking Optimizers for LLM Pretraining☆47Updated last week
- ☆133Updated last month
- ☆33Updated last year
- ☆178Updated 3 weeks ago
- ☆38Updated last year