FROM $f(x)$ AND $g(x)$ TO $f(g(x))$: LLMs Learn New Skills in RL by Composing Old Ones
☆64Jan 26, 2026Updated last month
Alternatives and similar repositories for RL-Compositionality
Users that are interested in RL-Compositionality are comparing it to the libraries listed below
Sorting:
- [ICLR 26] The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".☆17Feb 9, 2026Updated last month
- Single-pass Adaptive Image Tokenization for Minimum Program Search | What's the Kolmogorov Complexity of an Image?☆42Jul 26, 2025Updated 7 months ago
- Benchmarking Optimizers for LLM Pretraining☆56Dec 30, 2025Updated 2 months ago
- Toolathlon-Gym for testing AI agents real-world tool-use capabilities across diverse MCP servers.☆87Updated this week
- Code for "What really matters in matrix-whitening optimizers?"☆23Oct 31, 2025Updated 4 months ago
- 🔥 [ICLR 2025] Official PyTorch Model "Visual Haystacks: A Vision-Centric Needle-In-A-Haystack Benchmark"☆26Feb 9, 2025Updated last year
- [CVPR2024 highlight] Generalized Large-Scale Data Condensation via Various Backbone and Statistical Matching (G-VBSM)☆28Oct 9, 2024Updated last year
- ☆13Nov 21, 2025Updated 3 months ago
- ☆33Jan 7, 2025Updated last year
- A Framework for LLM-based Multi-Agent Reinforced Training and Inference☆458Feb 19, 2026Updated last month
- Official Code for What Makes and Breaks Safety Fine-tuning? A Mechanistic Study (NeurIPS 2024)☆12Oct 31, 2024Updated last year
- ROS2 Bag file parsing☆10Mar 14, 2020Updated 6 years ago
- Measuring the Signal to Noise Ratio in Language Model Evaluation☆29Aug 19, 2025Updated 7 months ago
- ☆64Jan 12, 2026Updated 2 months ago
- Code release for "MORE: Multi-mOdal REtrieval Augmented Generative Commonsense Reasoning"☆11Oct 11, 2024Updated last year
- ☆28Feb 15, 2026Updated last month
- General Reasoner: Advancing LLM Reasoning Across All Domains [NeurIPS25]☆222Nov 27, 2025Updated 3 months ago
- We introduce new approach, Token Reduction using CLIP Metric (TRIM), aimed at improving the efficiency of MLLMs without sacrificing their…☆21Jan 11, 2026Updated 2 months ago
- Omnigrok: Grokking Beyond Algorithmic Data☆63Feb 24, 2023Updated 3 years ago
- ☆10Nov 6, 2024Updated last year
- Towards Better Graph Representation Learning with Parameterized Decomposition & Filtering☆13Aug 22, 2023Updated 2 years ago
- Evolutionary Quantitative Trading Strategy Development System. Fork of OpenEvolve☆33May 30, 2025Updated 9 months ago
- Tooling for exact and MinHash deduplication of large-scale text datasets☆73Mar 9, 2026Updated last week
- A meta-repo that watches karpathy/autoresearch and adjacent systems, distills portable patterns for bounded agent-verifier research lo…☆38Mar 11, 2026Updated last week
- ☆12Mar 3, 2023Updated 3 years ago
- tuimorphic choose-your-own-adventure story game☆18Mar 3, 2026Updated 2 weeks ago
- [ICLR 2026] dParallel: Learnable Parallel Decoding for dLLMs☆62Feb 22, 2026Updated 3 weeks ago
- P1: Mastering Physics Olympiads with Reinforcement Learning☆79Dec 29, 2025Updated 2 months ago
- [NeurIPS 2024] Image Understanding Makes for A Good Tokenizer for Image Generation☆22Dec 17, 2024Updated last year
- ☆31Nov 30, 2025Updated 3 months ago
- ☆12Jul 30, 2025Updated 7 months ago
- CMPhysBench: A Benchmark for Evaluating Large Language Models in Condensed Matter Physics☆28Nov 1, 2025Updated 4 months ago
- ☆22Dec 18, 2025Updated 3 months ago
- lol☆10Mar 12, 2021Updated 5 years ago
- My toy model for natural language inference task.☆11Aug 6, 2018Updated 7 years ago
- ☆14Dec 13, 2022Updated 3 years ago
- Pytorch implementation of ICML-2024 "Navigating Complexity: Toward Lossless Graph Condensation via Expanding Window Matching"☆26Jun 23, 2024Updated last year
- The official github repo for "Diffusion Language Models are Super Data Learners".☆223Nov 6, 2025Updated 4 months ago
- Library that provides metrics to assess representation quality☆24Feb 5, 2025Updated last year