SakanaAI / robust-kbenchLinks
☆22Updated this week
Alternatives and similar repositories for robust-kbench
Users that are interested in robust-kbench are comparing it to the libraries listed below
Sorting:
- Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers☆20Updated 6 months ago
- Efficiently discovering algorithms via LLMs with evolutionary search and reinforcement learning.☆110Updated last month
- Resa: Transparent Reasoning Models via SAEs☆41Updated last month
- ☆33Updated 8 months ago
- Lottery Ticket Adaptation☆39Updated 10 months ago
- Code for the paper "Function-Space Learning Rates"☆23Updated 3 months ago
- ☆32Updated last year
- A basic pure pytorch implementation of flash attention☆16Updated 10 months ago
- 📄Small Batch Size Training for Language Models☆62Updated 3 weeks ago
- Official implementation of Regularized Policy Gradient (RPG) (https://arxiv.org/abs/2505.17508)☆37Updated last week
- implementation of dualformer☆20Updated 6 months ago
- Flash Attention Triton kernel with support for second-order derivatives☆86Updated this week
- [ICML 2025] Roll the dice & look before you leap: Going beyond the creative limits of next-token prediction☆68Updated 3 months ago
- Code and data for paper "(How) do Language Models Track State?"☆18Updated 5 months ago
- Triton Implementation of HyperAttention Algorithm☆48Updated last year
- Official code for the paper "Attention as a Hypernetwork"☆42Updated last year
- RS-IMLE☆42Updated 9 months ago
- https://x.com/BlinkDL_AI/status/1884768989743882276☆28Updated 4 months ago
- A simple, performant and scalable JAX-based world modeling codebase☆73Updated this week
- Fork of Flame repo for training of some new stuff in development☆17Updated 2 weeks ago
- ☆34Updated last year
- Multi-Turn RL Training System with AgentTrainer for Language Model Game Reinforcement Learning☆28Updated this week
- Implementation of a transformer for reinforcement learning using `x-transformers`☆68Updated last month
- ☆19Updated 5 months ago
- 😊 TPTT: Transforming Pretrained Transformers into Titans☆27Updated this week
- Exploration into the Scaling Value Iteration Networks paper, from Schmidhuber's group☆36Updated 11 months ago
- Code for☆27Updated 9 months ago
- Official Code Repository for EnvGen: Generating and Adapting Environments via LLMs for Training Embodied Agents (COLM 2024)☆37Updated last year
- Explorations into the recently proposed Taylor Series Linear Attention☆100Updated last year
- Official PyTorch implementation and models for paper "Diffusion Beats Autoregressive in Data-Constrained Settings". We find diffusion mod…☆92Updated 3 weeks ago