FROM $f(x)$ AND $g(x)$ TO $f(g(x))$: LLMs Learn New Skills in RL by Composing Old Ones
☆63Jan 26, 2026Updated last month
Alternatives and similar repositories for RL-Compositionality
Users that are interested in RL-Compositionality are comparing it to the libraries listed below
Sorting:
- [ICLR 26] The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".☆15Feb 9, 2026Updated 2 weeks ago
- ☆67Feb 13, 2026Updated 2 weeks ago
- ☆25Oct 15, 2025Updated 4 months ago
- Evolutionary Quantitative Trading Strategy Development System. Fork of OpenEvolve☆33May 30, 2025Updated 9 months ago
- 🔥 [ICLR 2025] Official PyTorch Model "Visual Haystacks: A Vision-Centric Needle-In-A-Haystack Benchmark"☆26Feb 9, 2025Updated last year
- Benchmarking Optimizers for LLM Pretraining☆52Dec 30, 2025Updated 2 months ago
- ☆60Jan 12, 2026Updated last month
- Tooling for exact and MinHash deduplication of large-scale text datasets☆72Feb 19, 2026Updated last week
- AHN: Artificial Hippocampus Networks for Efficient Long-Context Modeling☆170Oct 17, 2025Updated 4 months ago
- ☆37Sep 21, 2025Updated 5 months ago
- [ICLR 2026] dParallel: Learnable Parallel Decoding for dLLMs☆59Updated this week
- [CVPR2024 highlight] Generalized Large-Scale Data Condensation via Various Backbone and Statistical Matching (G-VBSM)☆28Oct 9, 2024Updated last year
- [ACL 2025] How Do LLMs Acquire New Knowledge? A Knowledge Circuits Perspective on Continual Pre-Training☆47Jul 18, 2025Updated 7 months ago
- Next-Toggle is just a simple plug and use, theme toggle button with multiple light and dark themes.☆11May 9, 2024Updated last year
- ☆33Jan 7, 2025Updated last year
- A Framework for LLM-based Multi-Agent Reinforced Training and Inference☆433Feb 19, 2026Updated last week
- Solving Inequality Proofs with Large Language Models.☆57Dec 15, 2025Updated 2 months ago
- Creates CMM script that can directly executed on Kaggle from easy merge script☆14Jan 12, 2026Updated last month
- ☆10Apr 26, 2023Updated 2 years ago
- Visualize ONNX models with model-explorer☆69Feb 13, 2026Updated 2 weeks ago
- ☆12Oct 29, 2024Updated last year
- my first ever browser game☆10Jun 21, 2025Updated 8 months ago
- GALL.AI (prev. Generall.AI) - Telegram Advanced AI Agent System Chat Bot☆14Feb 7, 2026Updated 3 weeks ago
- Integrating neurosymbolic representations into LLMs for interpretability, steering, and running symbolic algorithms☆14Feb 2, 2026Updated 3 weeks ago
- ☆12Apr 14, 2025Updated 10 months ago
- Phase Vocoder and Wavelet Transform Implementation for Pitch Shifting a sound signal☆11Jul 27, 2020Updated 5 years ago
- ☆36Feb 13, 2026Updated 2 weeks ago
- Using large language models to maintain AI_CHANGELOG.md☆14Jul 15, 2024Updated last year
- MVP for updated PEP 543 proposal☆14Feb 13, 2026Updated 2 weeks ago
- Koel Labs innovates open-source speech research, inclusive speech technologies, and real-time pronunciation feedback for language learner…☆18Updated this week
- lncRNA-Py is a development package for applying machine learning and deep learning to the problem of lncRNA classification, i.e. predicti…☆12Jan 24, 2025Updated last year
- ☆12Mar 3, 2023Updated 2 years ago
- An implementation of "Subspace Representations for Soft Set Operations and Sentence Similarities" (NAACL 2024)☆10May 31, 2024Updated last year
- OWASP Zed Attack Proxy plugin for py.test☆13Sep 10, 2015Updated 10 years ago
- ☆16Feb 18, 2024Updated 2 years ago
- Commandline utility for OSX that reloads the frontmost browser tab☆11Jan 18, 2016Updated 10 years ago
- Write SQL-like queries over JavaScript data structures☆10Jan 30, 2020Updated 6 years ago
- Langchain + Docker + Neo4j☆10Oct 29, 2024Updated last year
- Tool to apply Legal Matter Specification Standard (LMSS) to documents☆12Aug 15, 2024Updated last year