Advanced implementation of DeepSeek-R1 featuring Group Relative Policy Optimization (GRPO) for mathematical reasoning AI. Integrates safe distillation, modular reward systems, and efficient LoRA fine-tuning. Open-source Apache 2.0 licensed framework for developing aligned AI systems.
☆13Jan 29, 2025Updated last year
Alternatives and similar repositories for DeepSeek-R1-TrainingSuite
Users that are interested in DeepSeek-R1-TrainingSuite are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Testing Theory of Mind (ToM) in language models with epistemic logic☆22Dec 13, 2023Updated 2 years ago
- Empowering everyone to create reliable and safety AI coding agent.☆12Sep 2, 2024Updated last year
- A simple Python implementation of Pan-Tompkins algorithm for QRS complex detection☆12Jul 21, 2016Updated 9 years ago
- Code for paper: "Region Proposals for Saliency Map Refinement for Weakly-supervised Disease Localisation and Classification"☆14Jun 29, 2021Updated 4 years ago
- A collection on the recent reproduction papers and projects on DeepSeek-R1☆32Feb 27, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆11Sep 17, 2024Updated last year
- 一个教你如何Review的学习平台☆17Oct 20, 2022Updated 3 years ago
- Repository for Interoperability of FATE☆12Dec 31, 2025Updated 2 months ago
- ☆15Mar 3, 2023Updated 3 years ago
- ☆12Sep 25, 2021Updated 4 years ago
- Official implementation for Text Generation Beyond Discrete Token Sampling☆24Aug 11, 2025Updated 7 months ago
- Detecting segments belonging to which song in database, and return Nil if does not exist in a database.☆22May 13, 2021Updated 4 years ago
- Add _ as a shorthand in shell mode for the last shell output☆16Aug 30, 2022Updated 3 years ago
- OpenAI Whisper demo on Axera☆14Jan 15, 2026Updated 2 months ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Run TFLITE models on the web☆12Jan 2, 2022Updated 4 years ago
- Can stable and accurate neural networks be computed? - On the barriers of deep learning and Smale's 18th problem☆14Apr 6, 2021Updated 4 years ago
- HIGHFLIP: An easy way to bridge different federal learning platforms☆21Aug 31, 2023Updated 2 years ago
- ☆10Mar 14, 2025Updated last year
- AF Classification from a short single lead ECG recording: the PhysioNet/Computing in Cardiology Challenge 2017☆26Jan 8, 2019Updated 7 years ago
- Implement llm model in pytorch, support MoE and RoPE☆56Mar 18, 2026Updated last week
- A ImHex plugin to ask the almighty Oracle (OpenAI's Davinci AI) for help identifying file formats☆19Dec 4, 2022Updated 3 years ago
- YouTube Assistant☆12May 15, 2023Updated 2 years ago
- This repository demonstrates how to leverage OpenAI's GPT-4 models with JSON Strict Mode to extract structured data from web pages. It c…☆20Aug 14, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- A front-end for the mwmbl search engine written in vanilla javascript☆13Oct 10, 2023Updated 2 years ago
- A Minimalistic Auto-Diff Optimization Framework for Teaching and Understanding Pytorch☆27Mar 12, 2026Updated last week
- Ongoing research training transformer models at scale☆44Updated this week
- Port of Helix MP3 code to ESP8266☆14Dec 8, 2017Updated 8 years ago
- Expo React-Native mobile app using the Forge Reality Capture API☆12Oct 15, 2025Updated 5 months ago
- Just for debug☆57Feb 15, 2024Updated 2 years ago
- a user interface, infinite whiteboard with interactive elements, zoom while drawing, infiniteCanvas, fabricjs infinitewhiteboard, fabric …☆17May 20, 2025Updated 10 months ago
- [DEPRECTED]知道创宇校招面试题☆10Nov 30, 2017Updated 8 years ago
- 隐私计算 Hackathon | Data Privacy Protect Hackathon website☆31Nov 7, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Using OpenVINO to speed up MeloTTS inference☆15Nov 1, 2024Updated last year
- ☆69Nov 23, 2025Updated 4 months ago
- Key 3d computer vision math, concepts, and code☆14Dec 10, 2020Updated 5 years ago
- Long CoT Fine-Tuning and Reinforcement Learning for LLMs in the Context of the 24-Point Game: A Toy Project☆25Feb 22, 2025Updated last year
- 基于可编程不经意伪随机数的多方隐私求交算法库 Programmable Oblivious PRF & multi-party PSI☆25Jul 15, 2022Updated 3 years ago
- The Shazam of animals. 🐶 | Sample app to demonstrate ML Kit Image Labeling☆10Oct 29, 2022Updated 3 years ago
- An open source Java implementation to interpret and render Computer Graphics Metafile (CGM) graphics files.☆15Jun 20, 2025Updated 9 months ago