HyperPotatoNeo/RSA

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/HyperPotatoNeo/RSA)

HyperPotatoNeo / RSA

☆153

Alternatives and similar repositories for RSA

Users that are interested in RSA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

rsa-llm / RSA-ARC
View on GitHub
Recursive Self-Aggregation evals on ARC-AGI
☆36Jan 26, 2026Updated 5 months ago
roger-creus / Wave-Defense-Learning-Environment
View on GitHub
A videogame made with PyGame turned into an Open AI Gym Learning Environment for Reinforcement Learning agents.
☆14Jan 3, 2023Updated 3 years ago
halfprice06 / rlmgrep
View on GitHub
☆66Feb 14, 2026Updated 5 months ago
sail-sg / variational-reasoning
View on GitHub
Code for "Variational Reasoning for Language Models"
☆60Sep 29, 2025Updated 9 months ago
Andrewzh112 / AI-Research-Interview-Lab
View on GitHub
☆31Nov 14, 2025Updated 8 months ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
dspy-community / dspy-template-adapter
View on GitHub
A DSPy Adapter for exact-fidelity prompt templates with full control over messages.
☆51Feb 23, 2026Updated 5 months ago
olivkoch / TinyRecursiveModels
View on GitHub
☆35Nov 11, 2025Updated 8 months ago
test-time-training / e2e
View on GitHub
Official JAX implementation of End-to-End Test-Time Training for Long Context
☆625Feb 15, 2026Updated 5 months ago
michaelbzhu / lora-without-regret
View on GitHub
☆47Oct 23, 2025Updated 9 months ago
Archelunch / dspy-repl
View on GitHub
☆46Feb 20, 2026Updated 5 months ago
ewang26 / HorizonMath
View on GitHub
A benchmark to measure AI progress on unsolved research problems in mathematics.
☆28May 6, 2026Updated 2 months ago
stanford-oval / sliders
View on GitHub
Repository for paper: Contexts are Never Long Enough: Structured Reasoning for Scalable Question Answering over Long Document Sets
☆27Apr 27, 2026Updated 2 months ago
planned-diffusion / planned-diffusion
View on GitHub
☆20Nov 14, 2025Updated 8 months ago
GBATZOLIS / BitstreamDiffusion
View on GitHub
☆15Updated this week
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
alexzhang13 / rlm-minimal
View on GitHub
Super basic implementation (gist-like) of RLMs with REPL environments.
☆822Jan 7, 2026Updated 6 months ago
liushulinle / UloRL
View on GitHub
An Ultra-Long Output Reinforcement Learning Approach
☆23Jul 31, 2025Updated 11 months ago
zafstojano / policy-gradients
View on GitHub
A minimal hackable implementation of policy gradient methods (GRPO, PPO, REINFORCE)
☆16Feb 20, 2026Updated 5 months ago
Chengsong-Huang / G-Zero
View on GitHub
☆25May 14, 2026Updated 2 months ago
McGill-NLP / the-markovian-thinker
View on GitHub
Code for paper "The Markovian Thinker: Architecture-Agnostic Linear Scaling of Reasoning"
☆350Mar 16, 2026Updated 4 months ago
hallerite / ludic
View on GitHub
Ludic – an LLM-RL library for the era of experience
☆67Jan 9, 2026Updated 6 months ago
open-thought / reasoning-gym
View on GitHub
[NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards
☆1,464Apr 17, 2026Updated 3 months ago
sandyresearch / parcae
View on GitHub
Stable Looped Models and their Scaling Laws
☆171May 17, 2026Updated 2 months ago
PrimeIntellect-ai / renderers
View on GitHub
Programmable chat templates for LLM training and inference.
☆134Updated this week
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
AlexGoldie / discogen
View on GitHub
Official implementation of DiscoGen, for "Procedural Generation of Algorithm Discovery Tasks in Machine Learning"
☆48Jul 2, 2026Updated 3 weeks ago
alexzhang13 / rlm
View on GitHub
General plug-and-play inference library for Recursive Language Models (RLMs), supporting various sandboxes.
☆5,312Jun 26, 2026Updated 3 weeks ago
ESHyperscale / HyperscaleES
View on GitHub
Jax Codebase for Evolutionary Strategies at the Hyperscale
☆350Feb 27, 2026Updated 4 months ago
MasterVito / SwS
View on GitHub
Official Repo for SwS: A Weakness-driven Problem Synthesis Framework in RL for LLM Reasoning
☆42Nov 11, 2025Updated 8 months ago
uridr / GTWiki
View on GitHub
Dataset for the paper: "A multi-task semi-supervised framework for Text2Graph & Graph2Text"
☆25Feb 19, 2022Updated 4 years ago
alon-albalak / online-data-mixing
View on GitHub
An implementation of online data mixing for the Pile dataset, based on the GPT-NeoX library.
☆14Jan 9, 2024Updated 2 years ago
sparkle-reasoning / sparkle
View on GitHub
[NeurIPS'25] Beyond Accuracy: Dissecting Mathematical Reasoning for LLMs Under Reinforcement Learning
☆16Dec 12, 2025Updated 7 months ago
multimodal-art-projection / TreePO
View on GitHub
☆65Mar 30, 2026Updated 3 months ago
BaohaoLiao / frac-cot
View on GitHub
[COLM 2026] An efficient 3D sampling method for long-CoT LLM.
☆16May 25, 2025Updated last year
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
HazyResearch / cartridges
View on GitHub
Storing long contexts in tiny caches with self-study
☆305Mar 23, 2026Updated 4 months ago
PrimeIntellect-ai / prime-rl
View on GitHub
Agentic RL Training at Scale
☆1,714Updated this week
Frostlinx / Socratic-Zero
View on GitHub
Socratic-Zero is a fully autonomous framework that generates high-quality training data for mathematical reasoning
☆37Oct 26, 2025Updated 8 months ago
ypwang61 / ThetaEvolve
View on GitHub
ThetaEvolve: Test-time Learning on Open Problems, enabling RL training on AlphaEvolve/OpenEvolve and emphasizing scaling test-time comput…
☆170Feb 27, 2026Updated 4 months ago
microsoft / post-training-toolkit
View on GitHub
☆25Jan 28, 2026Updated 5 months ago
thepowerfuldeez / sample_efficient_gpt
View on GitHub
Training framework with a goal to explore the frontier of sample efficiency of small language models
☆101Jan 25, 2026Updated 5 months ago
seal-rg / streaming
View on GitHub
Code for the paper Multi-Stream LLMs: Unblocking Language Models with Parallel Streams of Thoughts, Inputs and Outputs
☆63Jun 23, 2026Updated last month