FROM $f(x)$ AND $g(x)$ TO $f(g(x))$: LLMs Learn New Skills in RL by Composing Old Ones
☆68Jan 26, 2026Updated 4 months ago
Alternatives and similar repositories for RL-Compositionality
Users that are interested in RL-Compositionality are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR 26] The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".☆17Feb 9, 2026Updated 4 months ago
- ☆69May 26, 2026Updated 2 weeks ago
- Single-pass Adaptive Image Tokenization for Minimum Program Search | What's the Kolmogorov Complexity of an Image?☆43Jul 26, 2025Updated 10 months ago
- Benchmarking Optimizers for LLM Pretraining☆60May 3, 2026Updated last month
- [CVPR2024 highlight] Generalized Large-Scale Data Condensation via Various Backbone and Statistical Matching (G-VBSM)☆27Oct 9, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆13Nov 21, 2025Updated 6 months ago
- Toolathlon-Gym for testing AI agents real-world tool-use capabilities across diverse MCP servers.☆130Apr 2, 2026Updated 2 months ago
- ☆33Jan 7, 2025Updated last year
- Official Code for What Makes and Breaks Safety Fine-tuning? A Mechanistic Study (NeurIPS 2024)☆12Oct 31, 2024Updated last year
- A Framework for LLM-based Multi-Agent Reinforced Training and Inference☆523Apr 14, 2026Updated last month
- [ACL 2025] How Do LLMs Acquire New Knowledge? A Knowledge Circuits Perspective on Continual Pre-Training☆48Jul 18, 2025Updated 10 months ago
- Measuring the Signal to Noise Ratio in Language Model Evaluation☆29Aug 19, 2025Updated 9 months ago
- Code of EMNLP 2025 paper 'UltraIF: Advancing Instruction Following from the Wild'.☆21Apr 3, 2025Updated last year
- ☆64Mar 30, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Official PyTorch implementation and models for paper "Diffusion Beats Autoregressive in Data-Constrained Settings". We find diffusion mod…☆127Jan 10, 2026Updated 5 months ago
- 6,080-param transformer achieving 100% accuracy on 10-digit addition. Trained from scratch in 10 minutes.☆22Feb 19, 2026Updated 3 months ago
- Code release for "MORE: Multi-mOdal REtrieval Augmented Generative Commonsense Reasoning"☆11Oct 11, 2024Updated last year
- ☆33Oct 15, 2025Updated 7 months ago
- ☆30Apr 28, 2026Updated last month
- System behavior is often expressed by causal relations in requirements (e.g. if event 1 then event 2). Automatically extracting this embe…☆12Oct 24, 2021Updated 4 years ago
- [ICML 2026] Reasoning in Parallelism via Self-Distilled RL☆112Feb 5, 2026Updated 4 months ago
- General Reasoner: Advancing LLM Reasoning Across All Domains [NeurIPS25]☆226Nov 27, 2025Updated 6 months ago
- ☆10Nov 6, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Towards Better Graph Representation Learning with Parameterized Decomposition & Filtering☆13Aug 22, 2023Updated 2 years ago
- Prioritize Alignment in Dataset Distillation☆21Dec 3, 2024Updated last year
- AHN: Artificial Hippocampus Networks for Efficient Long-Context Modeling☆178Oct 17, 2025Updated 7 months ago
- Official Implementation of wd1☆30Sep 25, 2025Updated 8 months ago
- Tooling for exact and MinHash deduplication of large-scale text datasets☆85Mar 24, 2026Updated 2 months ago
- [ICLR 2026] dParallel: Learnable Parallel Decoding for dLLMs☆64Apr 12, 2026Updated last month
- ☆32Nov 30, 2025Updated 6 months ago
- Code for the paper "Spectrum Guided Topology Augmentation for Graph Contrastive Learning"☆11Jul 18, 2023Updated 2 years ago
- ☆24Dec 18, 2025Updated 5 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- The official github repo for "Diffusion Language Models are Super Data Learners".☆228Nov 6, 2025Updated 7 months ago
- Repo of paper "Free Process Rewards without Process Labels"☆171Mar 14, 2025Updated last year
- In-Context Reinforcement Learning for Tool Use in Large Language Models☆48Mar 26, 2026Updated 2 months ago
- Code for "Evidence of Learned Look-Ahead in a Chess-Playing Neural Network"☆29Jun 4, 2024Updated 2 years ago
- Library that provides metrics to assess representation quality☆27Feb 5, 2025Updated last year
- ☆15Apr 26, 2025Updated last year
- ☆26Feb 20, 2026Updated 3 months ago