subconscious-systems / subconsciousLinks
☆66Updated this week
Alternatives and similar repositories for subconscious
Users that are interested in subconscious are comparing it to the libraries listed below
Sorting:
- Esoteric Language Models☆109Updated 2 months ago
- ☆91Updated last year
- ☆63Updated 7 months ago
- Resa: Transparent Reasoning Models via SAEs☆47Updated 4 months ago
- ☆29Updated 2 months ago
- Official repo of paper LM2☆46Updated 11 months ago
- [EMNLP'25 Industry] Repo for "Z1: Efficient Test-time Scaling with Code"☆68Updated 9 months ago
- ☆71Updated last year
- The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling☆42Updated last month
- [EMNLP 2025] The official implementation for paper "Agentic-R1: Distilled Dual-Strategy Reasoning"☆102Updated 4 months ago
- [ICML 24 NGSM workshop] Associative Recurrent Memory Transformer implementation and scripts for training and evaluation☆61Updated 2 weeks ago
- Code Implementation, Evaluations, Documentation, Links and Resources for Min P paper☆46Updated 5 months ago
- ☆112Updated last year
- [NeurIPS 2025] The official repo of SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond☆190Updated 6 months ago
- This is the official implementation for paper "PENCIL: Long Thoughts with Short Memory".☆69Updated 8 months ago
- A repository for research on medium sized language models.☆77Updated last year
- [ICLR 2026] RPG: KL-Regularized Policy Gradient (https://arxiv.org/abs/2505.17508)☆64Updated this week
- Ring-V2 is a reasoning MoE LLM provided and open-sourced by InclusionAI.☆89Updated 3 months ago
- FROM $f(x)$ AND $g(x)$ TO $f(g(x))$: LLMs Learn New Skills in RL by Composing Old Ones☆57Updated this week
- Defeating the Training-Inference Mismatch via FP16☆180Updated 2 months ago
- [ACL 2025] Agentic Knowledgeable Self-awareness☆91Updated 7 months ago
- Multiplex Thinking: Reasoning via Token-wise Branch-and-Merge☆94Updated 2 weeks ago
- [ICML 2025] From Low Rank Gradient Subspace Stabilization to Low-Rank Weights: Observations, Theories and Applications☆52Updated 3 months ago
- ☆85Updated 2 months ago
- PeRL: Parameter-Efficient Reinforcement Learning☆67Updated last week
- SSRL: Self-Search Reinforcement Learning☆205Updated 5 months ago
- ☆88Updated 3 months ago
- The official implementation of Self-Exploring Language Models (SELM)☆63Updated last year
- Process Reward Models That Think☆77Updated 2 months ago
- ☆110Updated 4 months ago