☆142Sep 29, 2025Updated 5 months ago
Alternatives and similar repositories for RSA
Users that are interested in RSA are comparing it to the libraries listed below
Sorting:
- Training Proactive and Personalized LLM Agents☆100Jan 20, 2026Updated last month
- ☆45Nov 9, 2025Updated 3 months ago
- BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution☆58Oct 13, 2025Updated 4 months ago
- ☆27Oct 15, 2025Updated 4 months ago
- Fork of cma-es library by Nikolaus Hansen☆12Jul 12, 2017Updated 8 years ago
- ☆15Sep 7, 2022Updated 3 years ago
- Forcing Diffuse Distributions out of Language Models☆18Sep 10, 2024Updated last year
- NeuroBLAST v3 architecture code☆36Jan 6, 2026Updated last month
- Agent skill that stress-tests technical plans — verifies claims against real docs, runs POCs, updates the plan before you build☆34Feb 20, 2026Updated last week
- Fluid Language Model Benchmarking☆26Sep 16, 2025Updated 5 months ago
- Perf monitoring CLI tool for Apple Silicon☆16Jan 1, 2024Updated 2 years ago
- [Preprint] RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments☆184Jan 12, 2026Updated last month
- Codes for the paper "Optimizing Mode Connectivity via Neuron Alignment" from NeurIPS 2020.☆16Dec 10, 2020Updated 5 years ago
- ☆53Jan 23, 2026Updated last month
- Official Project Page for HLA: Higher-order Linear Attention (https://arxiv.org/abs/2510.27258)☆45Jan 6, 2026Updated last month
- Code for "Tracing Knowledge in Language Models Back to the Training Data"☆39Dec 27, 2022Updated 3 years ago
- Official implementation for the paper: "Shallow Updates for Deep Reinforcement Learning"☆18Nov 2, 2017Updated 8 years ago
- DSPy module for OpenAI Codex SDK - signature-driven agentic workflows☆153Dec 8, 2025Updated 2 months ago
- ☆22Mar 4, 2025Updated 11 months ago
- NeurIPS 2018. Linear-time model comparison tests.☆18Feb 15, 2020Updated 6 years ago
- FROM $f(x)$ AND $g(x)$ TO $f(g(x))$: LLMs Learn New Skills in RL by Composing Old Ones☆64Jan 26, 2026Updated last month
- rl from zero pretrain, can it be done? yes.☆287Sep 28, 2025Updated 5 months ago
- ☆17Jul 3, 2017Updated 8 years ago
- A Domain-Specific Language, Jailbreak Attack Synthesizer and Dynamic LLM Redteaming Toolkit☆27Dec 5, 2024Updated last year
- ☆37May 15, 2025Updated 9 months ago
- Extra IO support for the Point Cloud Library (E57, ptx, LAS...)☆31May 11, 2015Updated 10 years ago
- Syntax Error-Free and Generalizable Tool Use for LLMs via Finite-State Decoding☆28Jan 28, 2024Updated 2 years ago
- ☆29Oct 3, 2022Updated 3 years ago
- Influence Estimation for Gradient-Boosted Decision Trees☆29May 27, 2024Updated last year
- NanoGPT (124M) quality in 2.67B tokens☆28Sep 17, 2025Updated 5 months ago
- Code for paper "Optima: Optimizing Effectiveness and Efficiency for LLM-Based Multi-Agent System"☆69Nov 14, 2024Updated last year
- Multi-step AI agents powered by Gemini 2.0 and the LangGraph framework. These agents orchestrate complex workflows and enhance their reas…☆10Dec 19, 2024Updated last year
- Forked from https://gitlab.com/MatejB/PrePoMax☆13Jan 8, 2024Updated 2 years ago
- Imitation and relaxation reinforcement learning☆29Sep 26, 2022Updated 3 years ago
- The official github repo for "Diffusion Language Models are Super Data Learners".☆223Nov 6, 2025Updated 3 months ago
- ☆28Apr 28, 2019Updated 6 years ago
- Code for NAACL 2022 paper "Reframing Human-AI Collaboration for Generating Free-Text Explanations"☆31Apr 28, 2023Updated 2 years ago
- ☆130Oct 1, 2024Updated last year
- [ICLR 2026] Learning to Reason without External Rewards☆392Jan 26, 2026Updated last month