An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuning
☆37May 18, 2025Updated 11 months ago
Alternatives and similar repositories for GRPO-Training
Users that are interested in GRPO-Training are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Custom triton kernels for training Karpathy's nanoGPT.☆19Oct 21, 2024Updated last year
- bulk image downloader freeware, reddit bulk image downloader, bulk image downloader extension, bulk image downloader from url, bulk image…☆25Feb 19, 2026Updated 2 months ago
- Measuring Thinking Efficiency in Reasoning Models - Research Repository☆39Dec 2, 2025Updated 5 months ago
- Unconditional music synthesis using a diffusion model in the STFT domain☆12May 31, 2022Updated 3 years ago
- Image Search Engine with HuggingFace Sentence Transformer☆12Aug 31, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- cheap & easy LLM experiments for amateurs (alpha)☆25Nov 30, 2025Updated 5 months ago
- Generative AI, Multi-Agent Systems (MAS), AI Research Methodology, Industry Best Practices, and The Future of Work (Kenyon College's Inte…☆23Dec 22, 2025Updated 4 months ago
- A simple voice agent using FastRTC and Groq☆60May 16, 2025Updated 11 months ago
- This repo contains code and data of our contribution to the 2024 LLM Hackathon, materials' property prediction from textual descriptions …☆12May 9, 2024Updated last year
- An intuitive approach towards understanding how Retrieval Augmented Generation (RAG) systems work, for the curious yet daunted reader☆29Jul 12, 2025Updated 9 months ago
- Landing repository for the paper "Predicting the Order of Upcoming Tokens Improves Language Modeling"☆44Sep 12, 2025Updated 7 months ago
- Exploring and demonstrating OpenAI's Swarm framework☆20Oct 20, 2024Updated last year
- ☆15Jul 18, 2022Updated 3 years ago
- Instantly convert ideas into app code with AI! This React app uses the Gemini API to generate and preview code from Markdown, making prot…☆13Mar 31, 2026Updated last month
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Dive into Deep Learning, with Julia programming language and Flux.jl.☆12Oct 28, 2024Updated last year
- ☆20Jun 28, 2025Updated 10 months ago
- Code for the paper "ClinicalBench: Can LLMs Beat Traditional ML Models in Clinical Prediction?"☆32Jun 18, 2025Updated 10 months ago
- Python package to export NN/RR interval series in KUBIOS HRV readable format and to import HRV results from KUBIOS report files in .txt f…☆12Jan 4, 2019Updated 7 years ago
- prediction market assistant using kalshi API and perplexity sonar api☆51Feb 22, 2025Updated last year
- This repository will contain the presentation and python jupyter notebooks for my DataHack Summit 2025 conference talk, Building Effectiv…☆76Aug 25, 2025Updated 8 months ago
- uses all reasoning models in parallel and synthesizes an answer with o1. also has multi-chat where you can chat with any of them☆41Jan 23, 2025Updated last year
- A pip installable package for optimal transport inspired loss functions in the spectral domain. Can be used for audio applications such a…☆30Apr 3, 2026Updated last month
- A powerful and simple asynchronous task management system that divides complex tasks into subtasks, processes them concurrently using o1 …☆16Dec 26, 2024Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta☆13Nov 11, 2024Updated last year
- ☆16Nov 25, 2022Updated 3 years ago
- A trading agent that uses deep reinforcement learning to trade Ethereum.☆24Jun 21, 2024Updated last year
- Cline Browser-Use MCP☆22Apr 27, 2025Updated last year
- Fine-tune copilot based on your codebase☆12Mar 26, 2024Updated 2 years ago
- Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant …☆15Mar 11, 2024Updated 2 years ago
- Deep Competitive Analyst is a 'deep agent' style LLM assistant built to automate the creation of company profiles and competitive analyse…☆39Nov 25, 2025Updated 5 months ago
- SadTalker gradio_demo.py file with code section that allows you to set the eye blink and pose reference videos for the software to use wh…☆11Jun 20, 2023Updated 2 years ago
- Chinese Word Segmentation task based on BERT and implemented in Pytorch☆14Aug 14, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Персонализированная система рекомендации вакансий для IT-специалистов. Использует API сайтов поиска работы для сбора данных, анализирует …☆14Aug 7, 2024Updated last year
- A pytorch implementation of a text to videos GAN☆12Jul 26, 2019Updated 6 years ago
- Hackable AlphaFold 3 inference pipeline.☆35Jun 18, 2025Updated 10 months ago
- Статьи по роликам на канале мыш☆18Dec 26, 2025Updated 4 months ago
- GPT-2 port to C#☆11Sep 27, 2022Updated 3 years ago
- Building your own AI Agent using Semantic Kernel - Microsoft Learn Zero to Hero Community 2024☆20May 23, 2024Updated last year
- CRUD Word documents with Python☆13Feb 5, 2026Updated 2 months ago