Advanced implementation of DeepSeek-R1 featuring Group Relative Policy Optimization (GRPO) for mathematical reasoning AI. Integrates safe distillation, modular reward systems, and efficient LoRA fine-tuning. Open-source Apache 2.0 licensed framework for developing aligned AI systems.
☆13Jan 29, 2025Updated last year
Alternatives and similar repositories for DeepSeek-R1-TrainingSuite
Users that are interested in DeepSeek-R1-TrainingSuite are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Build a RAG preprocessing pipeline☆12Apr 7, 2024Updated 2 years ago
- HippoMM: Hippocampal-inspired Multimodal Memory☆20May 22, 2025Updated 10 months ago
- Testing Theory of Mind (ToM) in language models with epistemic logic☆22Dec 13, 2023Updated 2 years ago
- kNN-TL: k-Nearest-Neighbor Transfer Learning for Low-Resource Neural Machine Translation (ACL2023)☆11Jul 26, 2023Updated 2 years ago
- This repository has been created as part of the kaggleXBIPOC Mentorship Program. The aim of this project is to establish the sentiment a…☆11Mar 18, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Papers of Implicit Reasoning in LLMs.☆24Mar 13, 2025Updated last year
- This is the official Python version of CoreInfer: Accelerating Large Language Model Inference with Semantics-Inspired Adaptive Sparse Act…☆17Oct 25, 2024Updated last year
- 新安江水文模型☆16Aug 9, 2020Updated 5 years ago
- [HPCA 2026 Best Paper Candidate] Official implementation of "Focus: A Streaming Concentration Architecture for Efficient Vision-Language …☆46Feb 8, 2026Updated 2 months ago
- This code performs PDF layout analysis and optical character recognition (OCR) using the layoutparser library and Tesseract OCR Engine. I…☆23Jun 12, 2023Updated 2 years ago
- Google Shared Locations provides a NodeJS interface to reading location information from people that share theirs with you.☆19Dec 19, 2018Updated 7 years ago
- ☆18Dec 16, 2025Updated 3 months ago
- "Knock, knock!" "Who's there?" "Dobi."☆17Aug 11, 2025Updated 8 months ago
- Empowering everyone to create reliable and safety AI coding agent.☆12Sep 2, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A simple Python implementation of Pan-Tompkins algorithm for QRS complex detection☆12Jul 21, 2016Updated 9 years ago
- A retrieval augmented sequence modeling toolkit implemented based on Fairseq☆29Mar 3, 2023Updated 3 years ago
- Transform any codebase, web page, or document into an optimized LLM prompt. CodeToPrompt intelligently compresses code and filters conten…☆48Apr 7, 2026Updated last week
- Code for paper: "Region Proposals for Saliency Map Refinement for Weakly-supervised Disease Localisation and Classification"☆14Jun 29, 2021Updated 4 years ago
- Falling Pickaxe Game inspired from YouTube shorts livestreams.☆54Feb 7, 2026Updated 2 months ago
- A collection on the recent reproduction papers and projects on DeepSeek-R1☆32Feb 27, 2025Updated last year
- ☆11Sep 17, 2024Updated last year
- 一个教你如何Review的学习平台☆17Oct 20, 2022Updated 3 years ago
- Repository for Interoperability of FATE☆12Dec 31, 2025Updated 3 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆15Mar 3, 2023Updated 3 years ago
- ☆40Mar 21, 2024Updated 2 years ago
- A GPU-based Incremental PCA implementation.☆32Feb 18, 2025Updated last year
- This is a lightweight script designed to index the structure of a project by identifying the locations of classes, functions, and files. …☆56Apr 18, 2025Updated 11 months ago
- ☆12Sep 25, 2021Updated 4 years ago
- a robust AI library for detecting profanity in russian language (regex/SVM based), библиотека для детекции нецензурных слов в русском язы…☆38Mar 9, 2024Updated 2 years ago
- This repository contains the evaluation code for the NDSS 2024 paper: MPCDIFF: Testing and Repairing MPC-Hardened Deep Learning Models.☆16Sep 5, 2023Updated 2 years ago
- [AAAI 2025] HiRED strategically drops visual tokens in the image encoding stage to improve inference efficiency for High-Resolution Visio…☆45Apr 18, 2025Updated 11 months ago
- ☆15Jun 22, 2025Updated 9 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆11Nov 6, 2019Updated 6 years ago
- Official implementation for Text Generation Beyond Discrete Token Sampling☆24Aug 11, 2025Updated 8 months ago
- Missing slash commands pakage for emacs☆34Jun 27, 2025Updated 9 months ago
- Detecting segments belonging to which song in database, and return Nil if does not exist in a database.☆22May 13, 2021Updated 4 years ago
- Add _ as a shorthand in shell mode for the last shell output☆16Aug 30, 2022Updated 3 years ago
- Make Emacs write chemfig code from molfile or SMILES.☆12Oct 29, 2024Updated last year
- Aid for distraction-free writing☆15Jul 18, 2025Updated 8 months ago