tpoisonooo/open-r1

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/tpoisonooo/open-r1)

tpoisonooo / open-r1

Fully open reproduction of DeepSeek-R1

☆11

Alternatives and similar repositories for open-r1

Users that are interested in open-r1 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

shijunbao / prompt-manager
View on GitHub
集中管理所有的prompt。
☆14Nov 27, 2024Updated last year
thu-ml / Efficient-Diffusion-Alignment
View on GitHub
Official Codebase for "Aligning Diffusion Behaviors with Q-functions for Efficient Continuous Control" (NeurIPS 2024)
☆15Oct 29, 2024Updated last year
JJXiangJiaoJun / cutlass_gemv
View on GitHub
GEMV implementation with CUTLASS
☆21Aug 21, 2025Updated 10 months ago
hilllief / polarquant-kv
View on GitHub
LLM KV Cache compression - K+V dual compression, 73-99% VRAM savings, zero accuracy loss
☆57Mar 30, 2026Updated 3 months ago
sands321 / znote
View on GitHub
🖖 图谱式笔记系统，旨在提高个人笔记的使用率！
☆11Jan 17, 2021Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
LLM360 / TxT360
View on GitHub
☆25Dec 18, 2024Updated last year
ScottishFold007 / The-Accidental-CTO_Chinese-Version
View on GitHub
[Ebook]从零到百万店铺：一个没有计算机学位的普通人的系统设计实战之旅
☆27Nov 11, 2025Updated 7 months ago
yoyoberenguer / Sobel-Feldman
View on GitHub
Sobel–Feldman, Prewitt, Canny filter
☆19Nov 9, 2019Updated 6 years ago
ronsavage / Regexp-Assemble
View on GitHub
Assemble multiple Regular Expressions into a single RE
☆15Nov 24, 2023Updated 2 years ago
lcqysl / VideoSSR
View on GitHub
[CVPR 2026] Official repo for "VideoSSR: Video Self-Supervised Reinforcement Learning"
☆40Nov 11, 2025Updated 7 months ago
GAIR-NLP / self-improvement-reversal
View on GitHub
☆13Jul 14, 2024Updated last year
BUPT-GAMMA / CITGNN
View on GitHub
☆10Dec 26, 2023Updated 2 years ago
Mr-hongji / PyQt5VideoPlayer
View on GitHub
基于PyQt5的视频播放器
☆12Jun 23, 2019Updated 7 years ago
Freder-chen / ReasonGenRM
View on GitHub
A simple implementation of ReasonGenRM.
☆19Apr 21, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
PRIS-CV / GRPO-for-Llava
View on GitHub
GRPO Algorithm for Llava Architecture (Based on Verl)
☆49May 9, 2025Updated last year
atfortes / LLMSymbolicReasoningBench
View on GitHub
Synthetic data generation for evaluating LLM symbolic and logic reasoning
☆23Mar 6, 2026Updated 4 months ago
annoymity2022 / Chinese-Dataset
View on GitHub
☆14Oct 12, 2024Updated last year
fudan-zvg / PARTNER
View on GitHub
[ICCV 2023 & IJCV 2026] PARTNER: Level up the Polar Representation for LiDAR 3D Object Detection
☆23Aug 12, 2024Updated last year
Paul33333 / SFT-and-DPO
View on GitHub
This is a detailed code demo on how to conduct Full-Param Supervised Fine-tuning (SFT) and DPO (Direct Preference Optimization)
☆19Jan 9, 2025Updated last year
FreedomIntelligence / Overview-of-ChatGPT
View on GitHub
☆17Jan 21, 2024Updated 2 years ago
yzhangchuck / awesome-llm-reasoning-long2short-papers
View on GitHub
☆17Apr 11, 2025Updated last year
sunzhihao18 / ForgerySleuth
View on GitHub
☆30May 22, 2025Updated last year
litwellchi / M2Chat
View on GitHub
☆36Feb 6, 2025Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
QiangZiBro / Qdotfiles
View on GitHub
🚀 Qdotfiles: A Unix configuration manager, caring for your developing environment.
☆13Sep 10, 2025Updated 9 months ago
TIGER-AI-Lab / VISTA
View on GitHub
The code for "VISTA: Enhancing Long-Duration and High-Resolution Video Understanding by VIdeo SpatioTemporal Augmentation" [CVPR2025]
☆20Feb 27, 2025Updated last year
junkangwu / alpha-DPO
View on GitHub
[ICML 2025] Official code of "AlphaDPO: Adaptive Reward Margin for Direct Preference Optimization"
☆31Jan 10, 2026Updated 5 months ago
PENGUINLIONG / graphi-t
View on GitHub
Handy tools & graphics API abstraction for blazing fast prototyping
☆10Jan 17, 2024Updated 2 years ago
abdelfattah-lab / attamba
View on GitHub
☆13Nov 29, 2024Updated last year
OstrichAlgorithm / hasaki
View on GitHub
lol助手秒选亚索
☆11Jun 12, 2022Updated 4 years ago
wutaiqiang / LLM_KD_AKL
View on GitHub
☆22Oct 22, 2024Updated last year
PRIS-CV / EAFT
View on GitHub
EAFT(Entropy-Adaptive Fine-Tuning: Resolving Confident Conflicts to Mitigate Forgetting) official repo
☆106Jan 15, 2026Updated 5 months ago
SCUT-DLVCLab / AutoHDR
View on GitHub
[ACL 2025 main] The official GitHub page of "Reviving Cultural Heritage: A Novel Approach for Comprehensive Historical Document Restorati…
☆59Jun 28, 2026Updated last week
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
abdelfattah-lab / TokenButler
View on GitHub
☆27May 12, 2026Updated last month
QuasarApp / QStyleSheet
View on GitHub
StyleSheet for qt apps
☆17Jul 13, 2018Updated 7 years ago
Essential-AI / eai-taxonomy
View on GitHub
☆60Aug 19, 2025Updated 10 months ago
kartikay18 / FaceGAN
View on GitHub
Face completion using Generative Adversarial Networks
☆10Sep 21, 2017Updated 8 years ago
gabrieleilertsen / nws
View on GitHub
Dissecting the weight space of neural networks
☆18Apr 16, 2021Updated 5 years ago
zhouqiu / SOGDet
View on GitHub
sogdet
☆19Jun 17, 2024Updated 2 years ago
aresbit / fetch-skill
View on GitHub
fetch-skill
☆163Mar 22, 2026Updated 3 months ago