An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuning
☆37May 18, 2025Updated 10 months ago
Alternatives and similar repositories for GRPO-Training
Users that are interested in GRPO-Training are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Custom triton kernels for training Karpathy's nanoGPT.☆19Oct 21, 2024Updated last year
- bulk image downloader freeware, reddit bulk image downloader, bulk image downloader extension, bulk image downloader from url, bulk image…☆25Feb 19, 2026Updated last month
- Measuring Thinking Efficiency in Reasoning Models - Research Repository☆39Dec 2, 2025Updated 4 months ago
- Image Search Engine with HuggingFace Sentence Transformer☆12Aug 31, 2023Updated 2 years ago
- cheap & easy LLM experiments for amateurs (alpha)☆25Nov 30, 2025Updated 4 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- simple terminal-based AI coding agent. This is for learning purposes more than a final working app.☆27Mar 6, 2025Updated last year
- Generative AI, Multi-Agent Systems (MAS), AI Research Methodology, Industry Best Practices, and The Future of Work (Kenyon College's Inte…☆23Dec 22, 2025Updated 3 months ago
- A simple voice agent using FastRTC and Groq☆60May 16, 2025Updated 10 months ago
- An intuitive approach towards understanding how Retrieval Augmented Generation (RAG) systems work, for the curious yet daunted reader☆29Jul 12, 2025Updated 9 months ago
- Landing repository for the paper "Predicting the Order of Upcoming Tokens Improves Language Modeling"☆44Sep 12, 2025Updated 7 months ago
- Exploring and demonstrating OpenAI's Swarm framework☆20Oct 20, 2024Updated last year
- ☆19Jun 28, 2025Updated 9 months ago
- Extract, timestamp, and analyze specific content from video collections using LLM-powered audio/video processing.☆63Oct 1, 2025Updated 6 months ago
- ☆17Feb 22, 2025Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- This repository will contain the presentation and python jupyter notebooks for my DataHack Summit 2025 conference talk, Building Effectiv…☆75Aug 25, 2025Updated 7 months ago
- ☆43Dec 14, 2024Updated last year
- Official repository of "TensorFlow Serving with Docker for Model Deployment" Coursera Project☆23Aug 27, 2020Updated 5 years ago
- Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta☆13Nov 11, 2024Updated last year
- A powerful and simple asynchronous task management system that divides complex tasks into subtasks, processes them concurrently using o1 …☆16Dec 26, 2024Updated last year
- A pip installable package for optimal transport inspired loss functions in the spectral domain. Can be used for audio applications such a…☆29Apr 3, 2026Updated last week
- Making of cuda kernel☆16May 27, 2025Updated 10 months ago
- ☆27Aug 5, 2024Updated last year
- Cline Browser-Use MCP☆22Apr 27, 2025Updated 11 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant …☆15Mar 11, 2024Updated 2 years ago
- Open Source AI Database for Voice Agent Transcripts | Call Analysis & Insights | Extraction | Labelling & Classification☆23Nov 3, 2025Updated 5 months ago
- ☆32May 1, 2025Updated 11 months ago
- A pytorch implementation of a text to videos GAN☆12Jul 26, 2019Updated 6 years ago
- ☆15Apr 26, 2025Updated 11 months ago
- A collection of C# projects translated from Adrian Rosebrock excellent python examples using OpenCV☆12Feb 24, 2017Updated 9 years ago
- Calculate allowed interactions in QED☆10Nov 2, 2022Updated 3 years ago
- GPT-2 port to C#☆11Sep 27, 2022Updated 3 years ago
- Building your own AI Agent using Semantic Kernel - Microsoft Learn Zero to Hero Community 2024☆20May 23, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Opinionated AI agent dev stack with tools, guides, templates, and workflows to take you from good to great.☆45Dec 28, 2025Updated 3 months ago
- The Ollama.NET is a powerful and easy-to-use library designed to simplify the integration of Ollama's services into .NET applications.☆11Jul 31, 2024Updated last year
- Code for "What really matters in matrix-whitening optimizers?"☆23Oct 31, 2025Updated 5 months ago
- Implementation of a Hierarchical Mamba as described in the paper: "Hierarchical State Space Models for Continuous Sequence-to-Sequence Mo…☆15Nov 11, 2024Updated last year
- A Python application that loads and processes both web pages and local documents, indexing their content using embeddings, and enabling s…☆26Jul 20, 2025Updated 8 months ago
- Implementation of Infini-Transformer in Pytorch☆112Jan 4, 2025Updated last year
- ☆15Aug 5, 2022Updated 3 years ago