MobileLLM/ParaThinker

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/MobileLLM/ParaThinker)

MobileLLM / ParaThinker

☆47

Alternatives and similar repositories for ParaThinker

Users that are interested in ParaThinker are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

zhengkid / Parallel-R1
View on GitHub
The offical repo for "Parallel-R1: Towards Parallel Thinking via Reinforcement Learning"
☆260Feb 4, 2026Updated 5 months ago
lblankl / Short-RL
View on GitHub
Short RL
☆19Apr 16, 2026Updated 3 months ago
pangjh3 / AnLLM
View on GitHub
☆20Jun 17, 2024Updated 2 years ago
caiqizh / LUQ
View on GitHub
☆14Jan 14, 2026Updated 6 months ago
ShadeCloak / ADORA
View on GitHub
☆47Apr 9, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
jingtaozhan / extrapolate-eval
View on GitHub
CIKM 2022: Evaluating Interpolation and Extrapolation Performance of Neural Retrieval Models
☆10Aug 4, 2022Updated 3 years ago
shengliu66 / FractionalReason
View on GitHub
Official github repo for "Fractional Reasoning via Latent Steering Vectors Improves Inference Time Compute"
☆17Jun 30, 2025Updated last year
LGAI-Research / SetR
View on GitHub
☆28Sep 11, 2025Updated 10 months ago
zjunlp / LightThinker
View on GitHub
[EMNLP 2025] LightThinker: Thinking Step-by-Step Compression
☆165Jun 22, 2026Updated last month
idanshen / Value-Augmented-Sampling
View on GitHub
☆20May 16, 2024Updated 2 years ago
OscarXZQ / delta_activations
View on GitHub
Official code release for Delta Activations: A Representation for Finetuned Large Language Models
☆20Sep 5, 2025Updated 10 months ago
Aloriosa / srmt
View on GitHub
The original Shared Recurrent Memory Transformer implementation
☆36Jul 11, 2025Updated last year
bigai-nlco / Native-Parallel-Reasoner
View on GitHub
[ICML 2026] Reasoning in Parallelism via Self-Distilled RL
☆113Jun 28, 2026Updated 3 weeks ago
Multiverse4FM / Multiverse
View on GitHub
☆88Jun 16, 2025Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
lauyikfung / SDPG
View on GitHub
SDPG: Self-Distilled Policy Gradient
☆46Jun 15, 2026Updated last month
JackKuo666 / a_numpy_based_implement_cnn
View on GitHub
这是我的博客《不用框架，使用Python搭建基于numpy的卷积神经网络来进行cifar-10分类的深度学习系统》的代码实现。
☆10Jul 1, 2019Updated 7 years ago
KangsanKim07 / MemoryTransferLearning
View on GitHub
Memory Transfer Learning: How Memories are Transferred Across Domains in Coding Agents
☆31Apr 16, 2026Updated 3 months ago
beanie00 / self-distillation-analysis
View on GitHub
Codebase for the work “Why Does Self-Distillation (Sometimes) Degrade the Reasoning Capability of LLMs?”
☆74Apr 14, 2026Updated 3 months ago
kaiwenzha / RL-Tango
View on GitHub
[NeurIPS 2025] RL Tango: Reinforcing Generator and Verifier Together for Language Reasoning
☆57Oct 23, 2025Updated 9 months ago
Kordi-AI / Multi-User-LLM-Agent
View on GitHub
Official code for the paper: "Multi-User Large Language Model Agents"
☆27May 11, 2026Updated 2 months ago
miniHuiHui / SimpleRL-reason-GRPO
View on GitHub
☆12Feb 27, 2025Updated last year
liushulinle / MarsRL
View on GitHub
MarsRL: Advancing Multi-Agent Reasoning System via Reinforcement Learning with Agentic Pipeline Parallelism
☆18Nov 18, 2025Updated 8 months ago
Parallel-Reasoning / APR
View on GitHub
[COLM 2025] Code for Paper: Learning Adaptive Parallel Reasoning with Language Models
☆144Dec 17, 2025Updated 7 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
Trae1ounG / BuPO
View on GitHub
[arxiv: 2512.19673] Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies
☆60Feb 6, 2026Updated 5 months ago
MrZilinXiao / ProxyThinker
View on GitHub
[ICLR 2026] Official Implementation of ProxyThinker: Test-Time Guidance through Small Visual Reasoners.
☆22Sep 24, 2025Updated 9 months ago
JinjieNi / Quokka
View on GitHub
The official github repo for "Training Optimal Large Diffusion Language Models", the first-ever large-scale diffusion language models sca…
☆46Nov 6, 2025Updated 8 months ago
zaydzuhri / flame
View on GitHub
Fork of Flame repo for training of some new stuff in development
☆20Jul 15, 2026Updated last week
liangyupu / DIMTDA
View on GitHub
The official repository of "Document Image Machine Translation with Dynamic Multi-pre-trained Models Assembling"
☆14Nov 26, 2025Updated 7 months ago
NineAbyss / S2R
View on GitHub
This is the official implementation of the paper "S²R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning"
☆77Apr 22, 2025Updated last year
thu-coai / CROPI
View on GitHub
[ACL'26] Official Repository for for paper "Data-Efficient RLVR via Off-Policy Influence Guidance"
☆24Mar 29, 2026Updated 3 months ago
violetxi / ExpRL
View on GitHub
☆20Jun 16, 2026Updated last month
thu-ml / Noise-Contrastive-Alignment
View on GitHub
Code accompanying the paper "Noise Contrastive Alignment of Language Models with Explicit Rewards" (NeurIPS 2024)
☆59Nov 8, 2024Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
McGill-NLP / the-markovian-thinker
View on GitHub
Code for paper "The Markovian Thinker: Architecture-Agnostic Linear Scaling of Reasoning"
☆349Mar 16, 2026Updated 4 months ago
AI9Stars / AStar-Thought
View on GitHub
[NeurIPS 2025] A*-Thought: Efficient Reasoning via Bidirectional Compression for Low-Resource Settings
☆16Jun 12, 2026Updated last month
RM-R1-UIUC / RM-R1
View on GitHub
[ICLR'26] RM-R1: Unleashing the Reasoning Potential of Reward Models
☆167Jun 26, 2025Updated last year
FoundationAgents / AOrchestra
View on GitHub
Automating Sub-Agent Creation for Agentic Orchestration
☆150May 25, 2026Updated last month
Dianshu-Liao / AAA-Code-Generation-Framework-for-Code-Repository-Local-Aware-Global-Aware-Third-Party-Aware
View on GitHub
☆26Dec 16, 2023Updated 2 years ago
ZJU-REAL / cooper
View on GitHub
☆29Aug 19, 2025Updated 11 months ago
AMD-AGI / PARD
View on GitHub
PARD: Accelerating LLM Inference with Low-Cost PARallel Draft Model Adaptation (ICLR 26)
☆33Jun 10, 2026Updated last month