FROM $f(x)$ AND $g(x)$ TO $f(g(x))$: LLMs Learn New Skills in RL by Composing Old Ones
☆65Jan 26, 2026Updated 3 months ago
Alternatives and similar repositories for RL-Compositionality
Users that are interested in RL-Compositionality are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR 26] The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".☆17Feb 9, 2026Updated 2 months ago
- ☆70Updated this week
- Single-pass Adaptive Image Tokenization for Minimum Program Search | What's the Kolmogorov Complexity of an Image?☆43Jul 26, 2025Updated 9 months ago
- Benchmarking Optimizers for LLM Pretraining☆57Dec 30, 2025Updated 4 months ago
- 🔥 [ICLR 2025] Official PyTorch Model "Visual Haystacks: A Vision-Centric Needle-In-A-Haystack Benchmark"☆26Feb 9, 2025Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- [AAAI 2026] ReCode: Reinforced Code Knowledge Editing for API Updates☆24Jul 1, 2025Updated 10 months ago
- [CVPR2024 highlight] Generalized Large-Scale Data Condensation via Various Backbone and Statistical Matching (G-VBSM)☆28Oct 9, 2024Updated last year
- Toolathlon-Gym for testing AI agents real-world tool-use capabilities across diverse MCP servers.☆116Apr 2, 2026Updated 3 weeks ago
- ☆13Nov 21, 2025Updated 5 months ago
- ☆33Jan 7, 2025Updated last year
- Official Code for What Makes and Breaks Safety Fine-tuning? A Mechanistic Study (NeurIPS 2024)☆12Oct 31, 2024Updated last year
- Flax (JAX) implementation of Progressive Growing of GANs for Improved Quality, Stability, and Variation☆12May 24, 2021Updated 4 years ago
- ROS2 Bag file parsing☆10Mar 14, 2020Updated 6 years ago
- Measuring the Signal to Noise Ratio in Language Model Evaluation☆29Aug 19, 2025Updated 8 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code of EMNLP 2025 paper 'UltraIF: Advancing Instruction Following from the Wild'.☆21Apr 3, 2025Updated last year
- ☆64Mar 30, 2026Updated last month
- Official PyTorch implementation and models for paper "Diffusion Beats Autoregressive in Data-Constrained Settings". We find diffusion mod…☆124Jan 10, 2026Updated 3 months ago
- Code release for "MORE: Multi-mOdal REtrieval Augmented Generative Commonsense Reasoning"☆11Oct 11, 2024Updated last year
- ☆33Oct 15, 2025Updated 6 months ago
- ☆29Updated this week
- Official Repository of Native Parallel Reasoner☆107Feb 5, 2026Updated 2 months ago
- General Reasoner: Advancing LLM Reasoning Across All Domains [NeurIPS25]☆223Nov 27, 2025Updated 5 months ago
- We introduce new approach, Token Reduction using CLIP Metric (TRIM), aimed at improving the efficiency of MLLMs without sacrificing their…☆22Jan 11, 2026Updated 3 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [ACL 2025] Official implementation of the "CoT-ICL Lab" framework☆11Oct 10, 2025Updated 6 months ago
- ☆10Nov 6, 2024Updated last year
- Omnigrok: Grokking Beyond Algorithmic Data☆64Feb 24, 2023Updated 3 years ago
- Towards Better Graph Representation Learning with Parameterized Decomposition & Filtering☆13Aug 22, 2023Updated 2 years ago
- AHN: Artificial Hippocampus Networks for Efficient Long-Context Modeling☆174Oct 17, 2025Updated 6 months ago
- Prioritize Alignment in Dataset Distillation☆21Dec 3, 2024Updated last year
- Official Implementation of wd1☆28Sep 25, 2025Updated 7 months ago
- Tooling for exact and MinHash deduplication of large-scale text datasets☆80Mar 24, 2026Updated last month
- A meta-repo that watches karpathy/autoresearch and adjacent systems, distills portable patterns for bounded agent-verifier research lo…☆43Mar 11, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Towards a Unified View of Large Language Model Post-Training☆209Sep 8, 2025Updated 7 months ago
- [ICLR 2026] dParallel: Learnable Parallel Decoding for dLLMs☆61Apr 12, 2026Updated 2 weeks ago
- [NeurIPS 2024] Image Understanding Makes for A Good Tokenizer for Image Generation☆22Dec 17, 2024Updated last year
- This is the official repository for NeurIPS 2023 paper "Curriculum Learning for Graph Neural Networks: Which Edges Should We Learn First"☆17Oct 27, 2023Updated 2 years ago
- P1: Mastering Physics Olympiads with Reinforcement Learning☆84Dec 29, 2025Updated 4 months ago
- ☆12Jul 30, 2025Updated 9 months ago
- Code for the paper "Spectrum Guided Topology Augmentation for Graph Contrastive Learning"☆11Jul 18, 2023Updated 2 years ago