Advanced implementation of DeepSeek-R1 featuring Group Relative Policy Optimization (GRPO) for mathematical reasoning AI. Integrates safe distillation, modular reward systems, and efficient LoRA fine-tuning. Open-source Apache 2.0 licensed framework for developing aligned AI systems.
☆13Jan 29, 2025Updated last year
Alternatives and similar repositories for DeepSeek-R1-TrainingSuite
Users that are interested in DeepSeek-R1-TrainingSuite are comparing it to the libraries listed below
Sorting:
- 基于多种哈希算法和孪生神经网络的短视频相似度检测系统☆27May 31, 2023Updated 2 years ago
- Big Data Analysis of Tinder done at Universitat Rovira i Virgili and Universitat Politècnica de Catalunya · BarcelonaTech☆13Jan 3, 2023Updated 3 years ago
- A tool to paste Excel ranges to Reddit☆11Sep 20, 2025Updated 5 months ago
- Run TFLITE models on the web☆12Jan 2, 2022Updated 4 years ago
- Comparative Study and Implementation of Five Factor Model and Myers-Briggs Type Indicator Model☆11Sep 28, 2023Updated 2 years ago
- Multiprocessing in python☆10Aug 20, 2021Updated 4 years ago
- The first OpenSource Mafia Bot!☆10Oct 5, 2023Updated 2 years ago
- DNH Werewolf Discord bot☆13Dec 19, 2024Updated last year
- An implementation of MSSRM method☆11Mar 23, 2023Updated 2 years ago
- 小鸡词典🐤的Alfred🎩插件 咯咯咯☆11Apr 19, 2023Updated 2 years ago
- YouTube Assistant☆12May 15, 2023Updated 2 years ago
- 李鲁鲁老师的 Copilot-Python 学习。和ChatGPT等大语言模型协同进化。☆10Jun 3, 2025Updated 9 months ago
- Dataset and codes for SEntFiN☆10May 31, 2023Updated 2 years ago
- 记录有用的Git repos☆12Jul 28, 2024Updated last year
- Inspirational post ids collected from Reddit using pushift.io and RoBERTa☆10Jan 18, 2024Updated 2 years ago
- Code for Beyond Generic: Enhancing Image Captioning with Real-World Knowledge using Vision-Language Pre-Training Model☆13Feb 15, 2024Updated 2 years ago
- Master control of robot using esp32 chip with openmv and tensorflow-lite support.☆11Mar 6, 2023Updated 3 years ago
- ☆11Jun 5, 2024Updated last year
- A mathematical model for Fibonacci Retracement and location entry and exit formulation using ML☆10Aug 2, 2022Updated 3 years ago
- An open source Java implementation to interpret and render Computer Graphics Metafile (CGM) graphics files.☆15Jun 20, 2025Updated 8 months ago
- Official implementation for Text Generation Beyond Discrete Token Sampling☆21Aug 11, 2025Updated 6 months ago
- Language Modelling, CMI vs Perplexity☆11Mar 17, 2018Updated 7 years ago
- Python3 code for the IEEE SPL paper "Auto-Tuning Spectral Clustering for SpeakerDiarization Using Normalized Maximum Eigengap"☆11Apr 6, 2020Updated 5 years ago
- ETL project to download and process both CME open interest data, COT data from the CFTC and NAV/shares-outstanding data from various ETF …☆12Jul 13, 2021Updated 4 years ago
- ☆13May 25, 2023Updated 2 years ago
- Các thí nghiệm liên quan tới LLMs cho tiếng Việt (insprised by Physics of LLMs Series)☆11Oct 21, 2024Updated last year
- Vapoursynth Python scripts☆11Feb 7, 2026Updated 3 weeks ago
- My learning note in monash FIT course include fit9131 fit9132 fit9136 fit5032 fit5057 fit5136 fit5125☆15Nov 3, 2022Updated 3 years ago
- ☆12Oct 14, 2024Updated last year
- LocalAI website, powered by Hugo☆14Nov 22, 2023Updated 2 years ago
- 🗿 Predict personalities using machine learning and Big 5 model.☆10Sep 22, 2020Updated 5 years ago
- ☆10Feb 6, 2026Updated 3 weeks ago
- Personal cheat sheet (moved off betaveros.github.io)☆11Jan 5, 2025Updated last year
- ☆10Dec 3, 2023Updated 2 years ago
- Boilerplate code to get OI data from NSE site☆12Feb 14, 2023Updated 3 years ago
- This repository contains dataset for paper FedNLP: An interpretable NLP System to Decode Federal Reserve Communications, published in SIG…☆15Feb 7, 2024Updated 2 years ago
- Shaping Language Models with Cognitive Insights☆15Feb 29, 2024Updated 2 years ago
- A fork of HumanEval-Java from the paper "Impact of Code Language Models on Automated Program Repair"☆13Dec 11, 2024Updated last year
- A front-end for the mwmbl search engine written in vanilla javascript☆13Oct 10, 2023Updated 2 years ago