An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuning
☆37May 18, 2025Updated last year
Alternatives and similar repositories for GRPO-Training
Users that are interested in GRPO-Training are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Custom triton kernels for training Karpathy's nanoGPT.☆19Oct 21, 2024Updated last year
- bulk image downloader freeware, reddit bulk image downloader, bulk image downloader extension, bulk image downloader from url, bulk image…☆26Feb 19, 2026Updated 4 months ago
- Measuring Thinking Efficiency in Reasoning Models - Research Repository☆39Dec 2, 2025Updated 7 months ago
- Unconditional music synthesis using a diffusion model in the STFT domain☆12May 31, 2022Updated 4 years ago
- Image Search Engine with HuggingFace Sentence Transformer☆12Aug 31, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- cheap & easy LLM experiments for amateurs (alpha)☆25Nov 30, 2025Updated 7 months ago
- [IEEE JBHI 2021] The convolutional neural networks training with Channel-Selectivity for human activity recognition based on sensors☆15Jul 1, 2021Updated 5 years ago
- ☆32Dec 10, 2025Updated 6 months ago
- An intuitive approach towards understanding how Retrieval Augmented Generation (RAG) systems work, for the curious yet daunted reader☆30Jul 12, 2025Updated 11 months ago
- Landing repository for the paper "Predicting the Order of Upcoming Tokens Improves Language Modeling"☆46May 13, 2026Updated last month
- ☆15Jul 18, 2022Updated 3 years ago
- Dive into Deep Learning, with Julia programming language and Flux.jl.☆11Oct 28, 2024Updated last year
- ☆20Jun 28, 2025Updated last year
- Extract, timestamp, and analyze specific content from video collections using LLM-powered audio/video processing.☆64Oct 1, 2025Updated 9 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- This repository provides the codes and data used in our paper "DanHAR: Dual Attention Network For Multimodal Human Activity Recognition U…☆17Mar 18, 2022Updated 4 years ago
- ☆16Oct 19, 2025Updated 8 months ago
- This repository will contain the presentation and python jupyter notebooks for my DataHack Summit 2025 conference talk, Building Effectiv…☆78Aug 25, 2025Updated 10 months ago
- ☆45Dec 14, 2024Updated last year
- Official repository of "TensorFlow Serving with Docker for Model Deployment" Coursera Project☆23Aug 27, 2020Updated 5 years ago
- uses all reasoning models in parallel and synthesizes an answer with o1. also has multi-chat where you can chat with any of them☆41Jan 23, 2025Updated last year
- A powerful and simple asynchronous task management system that divides complex tasks into subtasks, processes them concurrently using o1 …☆15Dec 26, 2024Updated last year
- ArterialNet reconstructs arterial blood pressure (ABP) waveform☆14Feb 24, 2025Updated last year
- Cline Browser-Use MCP☆23Apr 27, 2025Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- RIVF 2021: Deep neural network based learning to rank for address standardization☆10Jul 13, 2024Updated last year
- Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant …☆15Mar 11, 2024Updated 2 years ago
- Deep Competitive Analyst is a 'deep agent' style LLM assistant built to automate the creation of company profiles and competitive analyse…☆46Nov 25, 2025Updated 7 months ago
- SadTalker gradio_demo.py file with code section that allows you to set the eye blink and pose reference videos for the software to use wh…☆11Jun 20, 2023Updated 3 years ago
- Python object index for fast, declarative retrieval☆18Jul 21, 2024Updated last year
- ☆24Jun 27, 2024Updated 2 years ago
- Персонализированная система рекомендации вакансий для IT-специалистов. Использует API сайтов поиска работы для сбора данных, анализирует …☆14Aug 7, 2024Updated last year
- A pytorch implementation of a text to videos GAN☆12Jul 26, 2019Updated 6 years ago
- Hackable AlphaFold 3 inference pipeline.☆35Jun 18, 2025Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆15Apr 26, 2025Updated last year
- Machine Learning Course☆14May 27, 2026Updated last month
- ☆13Jun 15, 2022Updated 4 years ago
- Calculate allowed interactions in QED☆10Nov 2, 2022Updated 3 years ago
- A collection of C# projects translated from Adrian Rosebrock excellent python examples using OpenCV☆12Feb 24, 2017Updated 9 years ago
- ☆28Jan 17, 2025Updated last year
- Нескучные туториалы по Python и ML☆18Jun 21, 2026Updated last week