FROM $f(x)$ AND $g(x)$ TO $f(g(x))$: LLMs Learn New Skills in RL by Composing Old Ones
☆68Jan 26, 2026Updated 5 months ago
Alternatives and similar repositories for RL-Compositionality
Users that are interested in RL-Compositionality are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR 26] The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".☆18Feb 9, 2026Updated 4 months ago
- Single-pass Adaptive Image Tokenization for Minimum Program Search | What's the Kolmogorov Complexity of an Image?☆43Jul 26, 2025Updated 11 months ago
- Benchmarking Optimizers for LLM Pretraining☆60May 3, 2026Updated last month
- Code for "What really matters in matrix-whitening optimizers?"☆24Oct 31, 2025Updated 7 months ago
- 🔥 [ICLR 2025] Official PyTorch Model "Visual Haystacks: A Vision-Centric Needle-In-A-Haystack Benchmark"☆27Feb 9, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [AAAI 2026] ReCode: Reinforced Code Knowledge Editing for API Updates☆27Jul 1, 2025Updated 11 months ago
- [CVPR2024 highlight] Generalized Large-Scale Data Condensation via Various Backbone and Statistical Matching (G-VBSM)☆27Oct 9, 2024Updated last year
- Toolathlon-Gym for testing AI agents real-world tool-use capabilities across diverse MCP servers.☆137Apr 2, 2026Updated 2 months ago
- ☆33Jan 7, 2025Updated last year
- Official Code for What Makes and Breaks Safety Fine-tuning? A Mechanistic Study (NeurIPS 2024)☆12Oct 31, 2024Updated last year
- ROS2 Bag file parsing☆10Mar 14, 2020Updated 6 years ago
- Flax (JAX) implementation of Progressive Growing of GANs for Improved Quality, Stability, and Variation☆12May 24, 2021Updated 5 years ago
- [ACL 2025] How Do LLMs Acquire New Knowledge? A Knowledge Circuits Perspective on Continual Pre-Training☆50Jul 18, 2025Updated 11 months ago
- Code of EMNLP 2025 paper 'UltraIF: Advancing Instruction Following from the Wild'.☆21Apr 3, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆64Mar 30, 2026Updated 3 months ago
- Code release for "MORE: Multi-mOdal REtrieval Augmented Generative Commonsense Reasoning"☆11Oct 11, 2024Updated last year
- ☆33Oct 15, 2025Updated 8 months ago
- System behavior is often expressed by causal relations in requirements (e.g. if event 1 then event 2). Automatically extracting this embe…☆12Oct 24, 2021Updated 4 years ago
- [ICML 2026] Reasoning in Parallelism via Self-Distilled RL☆114Updated this week
- We introduce new approach, Token Reduction using CLIP Metric (TRIM), aimed at improving the efficiency of MLLMs without sacrificing their…☆22Jan 11, 2026Updated 5 months ago
- Omnigrok: Grokking Beyond Algorithmic Data☆65Feb 24, 2023Updated 3 years ago
- The "CoT-ICL Lab" framework for meta-training transformers☆11Jun 3, 2026Updated 3 weeks ago
- ☆10Nov 6, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Towards Better Graph Representation Learning with Parameterized Decomposition & Filtering☆13Aug 22, 2023Updated 2 years ago
- Prioritize Alignment in Dataset Distillation☆21Dec 3, 2024Updated last year
- Official Implementation of wd1☆31Sep 25, 2025Updated 9 months ago
- A meta-repo that watches karpathy/autoresearch and adjacent systems, distills portable patterns for bounded agent-verifier research lo…☆43May 8, 2026Updated last month
- Tooling for exact and MinHash deduplication of large-scale text datasets☆90Mar 24, 2026Updated 3 months ago
- ☆12Mar 3, 2023Updated 3 years ago
- Towards a Unified View of Large Language Model Post-Training☆211Sep 8, 2025Updated 9 months ago
- [ICLR 2026] dParallel: Learnable Parallel Decoding for dLLMs☆63Apr 12, 2026Updated 2 months ago
- This is the official repository for NeurIPS 2023 paper "Curriculum Learning for Graph Neural Networks: Which Edges Should We Learn First"☆17Oct 27, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆12Jul 30, 2025Updated 11 months ago
- Code for the paper "Spectrum Guided Topology Augmentation for Graph Contrastive Learning"☆11Jul 18, 2023Updated 2 years ago
- Fork of Flame repo for training of some new stuff in development☆19Jun 19, 2026Updated last week
- Code for "Echo Chamber: RL Post-training Amplifies Behaviors Learned in Pretraining"☆29Oct 14, 2025Updated 8 months ago
- tuimorphic choose-your-own-adventure story game☆20Apr 30, 2026Updated 2 months ago
- ☆24Dec 18, 2025Updated 6 months ago
- lol☆10Mar 12, 2021Updated 5 years ago