Advanced implementation of DeepSeek-R1 featuring Group Relative Policy Optimization (GRPO) for mathematical reasoning AI. Integrates safe distillation, modular reward systems, and efficient LoRA fine-tuning. Open-source Apache 2.0 licensed framework for developing aligned AI systems.
☆13Jan 29, 2025Updated last year
Alternatives and similar repositories for DeepSeek-R1-TrainingSuite
Users that are interested in DeepSeek-R1-TrainingSuite are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Build a RAG preprocessing pipeline☆12Apr 7, 2024Updated 2 years ago
- kNN-TL: k-Nearest-Neighbor Transfer Learning for Low-Resource Neural Machine Translation (ACL2023)☆11Jul 26, 2023Updated 2 years ago
- This repository has been created as part of the kaggleXBIPOC Mentorship Program. The aim of this project is to establish the sentiment a…☆12Mar 18, 2023Updated 3 years ago
- A retrieval augmented sequence modeling toolkit implemented based on Fairseq☆29Mar 3, 2023Updated 3 years ago
- Falling Pickaxe Game inspired from YouTube shorts livestreams.☆58Feb 7, 2026Updated 4 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A collection on the recent reproduction papers and projects on DeepSeek-R1☆31Feb 27, 2025Updated last year
- ☆11Jun 18, 2026Updated 2 weeks ago
- a robust AI library for detecting profanity in russian language (regex/SVM based), библиотека для детекции нецензурных слов в русском язы…☆38Mar 9, 2024Updated 2 years ago
- ☆11Nov 6, 2019Updated 6 years ago
- ☆13Sep 25, 2021Updated 4 years ago
- Add _ as a shorthand in shell mode for the last shell output☆16Aug 30, 2022Updated 3 years ago
- Run TFLITE models on the web☆13Jan 2, 2022Updated 4 years ago
- Some tips on using stable diffusion inpainting with diffusers☆14Jul 19, 2023Updated 2 years ago
- ☆11Mar 30, 2026Updated 3 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Unofficial Implementation of Latent Diffusion Models for Layout-to-image Generation☆12Nov 10, 2022Updated 3 years ago
- Stable Diffusion Gradio GUI☆18Aug 30, 2022Updated 3 years ago
- ☆102May 28, 2025Updated last year
- This plugin adds a basic main, options and pause menu to your godot project.☆75Jul 14, 2023Updated 2 years ago
- Keras image classfication, grad cam, tf.data + tf.keras☆13Nov 28, 2018Updated 7 years ago
- FreeRTOS-Sim (FreeRTOS POSIX port), and collection of stubbed esp-idf components to allow running esp-idf apps on POSIX OSes (macOS)☆11Sep 21, 2018Updated 7 years ago
- Improved two-stage multithreshold Otsu method.☆11Mar 11, 2019Updated 7 years ago
- 用于FasterRCNN的图像标注小工具(在自己的项目 中是给药粒标注)☆13Mar 29, 2018Updated 8 years ago
- vehicle detection by HOG and color features☆13Feb 23, 2017Updated 9 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- CornerNet-Lite的批注与学习☆12May 28, 2019Updated 7 years ago
- A general purpose library for training any type of GPT model.☆12Jun 13, 2023Updated 3 years ago
- Implementation of Faster RCNN for Vehicle Detection☆16May 17, 2017Updated 9 years ago
- Expo React-Native mobile app using the Forge Reality Capture API☆12Oct 15, 2025Updated 8 months ago
- ☆15May 13, 2024Updated 2 years ago
- TensorBasicModel☆18Aug 31, 2017Updated 8 years ago
- Official Implementation for "Age-Dependent Face Diversification via Latent Space Analysis" (CGI2023)☆15Jan 7, 2025Updated last year
- C3AE:exploring the limits of compact model for age estimation☆13Dec 4, 2019Updated 6 years ago
- Just for debug☆57Feb 15, 2024Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆18May 30, 2023Updated 3 years ago
- a user interface, infinite whiteboard with interactive elements, zoom while drawing, infiniteCanvas, fabricjs infinitewhiteboard, fabric …☆18May 20, 2025Updated last year
- ONNX models of YOLO-World (an open-vocabulary object detection).☆28Jun 29, 2024Updated 2 years ago
- Tensorflow Object Detection API☆13Sep 8, 2017Updated 8 years ago
- ☆74Apr 2, 2024Updated 2 years ago
- Ongoing research training transformer models at scale☆49Updated this week
- Documentation for Bert-VITS2☆21Nov 29, 2023Updated 2 years ago