dinobby / Symbolic-MoELinks
The code implementation of Symbolic-MoE
☆43Updated 2 months ago
Alternatives and similar repositories for Symbolic-MoE
Users that are interested in Symbolic-MoE are comparing it to the libraries listed below
Sorting:
- JudgeLRM: Large Reasoning Models as a Judge☆40Updated last month
 - [ACL 2025] A Generalizable and Purely Unsupervised Self-Training Framework☆71Updated 5 months ago
 - ☆215Updated 2 weeks ago
 - [EMNLP'2025 Industry] Repo for "Z1: Efficient Test-time Scaling with Code"☆66Updated 6 months ago
 - SSRL: Self-Search Reinforcement Learning☆148Updated 2 months ago
 - [ACL 2025] An inference-time decoding strategy with adaptive foresight sampling☆106Updated 5 months ago
 - [ACL'25 Oral] What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective☆75Updated 4 months ago
 - SIFT: Grounding LLM Reasoning in Contexts via Stickers☆58Updated 7 months ago
 - ☆50Updated 8 months ago
 - The official implementation of Self-Exploring Language Models (SELM)☆64Updated last year
 - ☆35Updated 5 months ago
 - TraceRL & TraDo-8B: Revolutionizing Reinforcement Learning Framework for Diffusion Large Language Models☆289Updated 2 weeks ago
 - [ICLR 2025 Oral] "Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free"☆82Updated last year
 - ☆133Updated last month
 - ☆101Updated 3 weeks ago
 - RL Scaling and Test-Time Scaling (ICML'25)☆111Updated 9 months ago
 - ☆63Updated 4 months ago
 - Official implementation of the NeurIPS 2025 paper "Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space"☆265Updated last month
 - Chain of Experts (CoE) enables communication between experts within Mixture-of-Experts (MoE) models☆222Updated last month
 - Discriminative Constrained Optimization for Reinforcing Large Reasoning Models☆39Updated last week
 - Geometric-Mean Policy Optimization☆88Updated 2 weeks ago
 - [ICML 2025] Teaching Language Models to Critique via Reinforcement Learning☆114Updated 5 months ago
 - Process Reward Models That Think☆60Updated 2 weeks ago
 - Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆132Updated last year
 - ☆108Updated last year
 - [NeurIPS 2025] The official repo of SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond☆176Updated 3 months ago
 - ☆129Updated 7 months ago
 - [ACL 2025] Are Your LLMs Capable of Stable Reasoning?☆31Updated 2 months ago
 - [ICCV 2025] Auto Interpretation Pipeline and many other functionalities for Multimodal SAE Analysis.☆159Updated last month
 - ☆100Updated last month