mkantwala/DeepSeek-R1-TrainingSuite

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/mkantwala/DeepSeek-R1-TrainingSuite)

mkantwala / DeepSeek-R1-TrainingSuite

Advanced implementation of DeepSeek-R1 featuring Group Relative Policy Optimization (GRPO) for mathematical reasoning AI. Integrates safe distillation, modular reward systems, and efficient LoRA fine-tuning. Open-source Apache 2.0 licensed framework for developing aligned AI systems.

☆13

Alternatives and similar repositories for DeepSeek-R1-TrainingSuite

Users that are interested in DeepSeek-R1-TrainingSuite are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Clearedge-AI / clearedge
View on GitHub
Build a RAG preprocessing pipeline
☆12Apr 7, 2024Updated 2 years ago
sudanl / kNN-TL
View on GitHub
kNN-TL: k-Nearest-Neighbor Transfer Learning for Low-Resource Neural Machine Translation (ACL2023)
☆11Jul 26, 2023Updated 2 years ago
Satarupa22-SD / Cryptocurrency-Sentiment-Analysis
View on GitHub
This repository has been created as part of the kaggleXBIPOC Mentorship Program. The aim of this project is to establish the sentiment a…
☆12Mar 18, 2023Updated 3 years ago
YROOM / hydro-model-xaj
View on GitHub
新安江水文模型
☆16Aug 9, 2020Updated 5 years ago
Ah-miu / Ah-miu.github.io
View on GitHub
☆18Jul 7, 2026Updated 2 weeks ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
sileod / llm-theory-of-mind
View on GitHub
Testing Theory of Mind (ToM) in language models with epistemic logic
☆22Jul 3, 2026Updated 3 weeks ago
yash9439 / Detectron-Layout-Parser
View on GitHub
This code performs PDF layout analysis and optical character recognition (OCR) using the layoutparser library and Tesseract OCR Engine. I…
☆22Jun 12, 2023Updated 3 years ago
Vaelek / node-google-shared-locations
View on GitHub
Google Shared Locations provides a NodeJS interface to reading location information from people that share theirs with you.
☆19Dec 19, 2018Updated 7 years ago
wangqinsi1 / CoreInfer
View on GitHub
This is the official Python version of CoreInfer: Accelerating Large Language Model Inference with Semantics-Inspired Adaptive Sparse Act…
☆18Oct 25, 2024Updated last year
Yuzhe-Fu / FlashFPS
View on GitHub
[DAC 2026] FlashFPS
☆15Jun 1, 2026Updated last month
SerenaFarina / Pan-Tompkins_Python
View on GitHub
A simple Python implementation of Pan-Tompkins algorithm for QRS complex detection
☆12Jul 21, 2016Updated 10 years ago
wangqinsi1 / 2025-ICML-CoreMatching
View on GitHub
[ICML 2025] CoreMatching: Co-adaptive Sparse Inference Framework for Comprehensive Acceleration of Vision Language Model
☆16May 27, 2025Updated last year
HankYe / KVCOMM
View on GitHub
[NeurIPS'25] KVCOMM: Online Cross-context KV-cache Communication for Efficient LLM-based Multi-agent Systems
☆17Nov 1, 2025Updated 8 months ago
shire-lang / shire-compiler
View on GitHub
Empowering everyone to create reliable and safety AI coding agent.
☆12Sep 2, 2024Updated last year
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
Ah-miu / Dobi-SVD.page
View on GitHub
"Knock, knock!" "Who's there?" "Dobi."
☆17Aug 11, 2025Updated 11 months ago
zcccccz / Awesome-LLM-Implicit-Reasoning
View on GitHub
Papers of Implicit Reasoning in LLMs.
☆25Mar 13, 2025Updated last year
cordercorder / knn-models
View on GitHub
A retrieval augmented sequence modeling toolkit implemented based on Fairseq
☆29Mar 3, 2023Updated 3 years ago
linyueqian / HippoMM
View on GitHub
HippoMM: Hippocampal-inspired Multimodal Memory
☆22May 22, 2025Updated last year
T2S-Bench / T2S-Bench
View on GitHub
This is Official implementation for T2S-Bench & Structure-of-Thought: Benchmarking and Prompting Comprehensive Text-to-Structure Reasonin…
☆24Mar 5, 2026Updated 4 months ago
Yuzhe-Fu / FractalCloud
View on GitHub
[HPCA 2026] FractalCloud: A Fractal-Inspired Architecture for Efficient Large-Scale Point Cloud Processing
☆22Apr 21, 2026Updated 3 months ago
renato145 / RpSalWeaklyDet
View on GitHub
Code for paper: "Region Proposals for Saliency Map Refinement for Weakly-supervised Disease Localisation and Classification"
☆14Jun 29, 2021Updated 5 years ago
haoyangliu123 / awesome-deepseek-r1
View on GitHub
A collection on the recent reproduction papers and projects on DeepSeek-R1
☆31Feb 27, 2025Updated last year
labcin-ufes / PAD-UFES-20
View on GitHub
☆16Mar 3, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
bbd03 / check-swear
View on GitHub
a robust AI library for detecting profanity in russian language (regex/SVM based), библиотека для детекции нецензурных слов в русском язы…
☆38Mar 9, 2024Updated 2 years ago
PaperReviewer / PaperReviewer.github.io
View on GitHub
一个教你如何Review的学习平台
☆17Oct 20, 2022Updated 3 years ago
localauthor / .emacs.d
View on GitHub
☆11Jul 9, 2026Updated 2 weeks ago
FederatedAI / InterOp
View on GitHub
Repository for Interoperability of FATE
☆12Dec 31, 2025Updated 6 months ago
GaryJurgens / ProjectIndexer
View on GitHub
This is a lightweight script designed to index the structure of a project by identifying the locations of classes, functions, and files. …
☆56Apr 18, 2025Updated last year
LiYouBioinfo / DELTA-Toolkit
View on GitHub
DNA Encoded Library Data Analysis toolkit for DEL data analysis
☆10Jun 12, 2025Updated last year
Qi-Pang / MPCDiff
View on GitHub
This repository contains the evaluation code for the NDSS 2024 paper: MPCDIFF: Testing and Repairing MPC-Hardened Deep Learning Models.
☆16Sep 5, 2023Updated 2 years ago
azuredsky / retinaface_pt
View on GitHub
☆11Nov 6, 2019Updated 6 years ago
dnhkng / PCAonGPU
View on GitHub
A GPU-based Incremental PCA implementation.
☆32Feb 18, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
leonardltk / Shazam-An-Industrial-Strength-Audio-Search-Algorithm-
View on GitHub
Detecting segments belonging to which song in database, and return Nil if does not exist in a database.
☆22May 13, 2021Updated 5 years ago
Macintoshxz / books
View on GitHub
☆14Sep 25, 2021Updated 4 years ago
yash9439 / codetoprompt
View on GitHub
Transform any codebase, web page, or document into an optimized LLM prompt. CodeToPrompt intelligently compresses code and filters conten…
☆49Apr 7, 2026Updated 3 months ago
mathiasdahl / shell-underscore
View on GitHub
Add _ as a shorthand in shell mode for the last shell output
☆16Aug 30, 2022Updated 3 years ago
Bomps4 / Multi_Resolution_Rescored_ByteTrack
View on GitHub
☆11Mar 30, 2026Updated 3 months ago
KeyWeeUsr / typewriter-roll-mode
View on GitHub
Aid for distraction-free writing
☆15Jul 18, 2025Updated last year
rob137 / Corsair
View on GitHub
LLM utils for Emacs
☆21May 5, 2026Updated 2 months ago