An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuning
☆37May 18, 2025Updated 9 months ago
Alternatives and similar repositories for GRPO-Training
Users that are interested in GRPO-Training are comparing it to the libraries listed below
Sorting:
- Image Search Engine with HuggingFace Sentence Transformer☆12Aug 31, 2023Updated 2 years ago
- Generative AI, Multi-Agent Systems (MAS), AI Research Methodology, Industry Best Practices, and The Future of Work (Kenyon College's Inte…☆23Dec 22, 2025Updated 2 months ago
- This repo contains code and data of our contribution to the 2024 LLM Hackathon, materials' property prediction from textual descriptions …☆12May 9, 2024Updated last year
- An intuitive approach towards understanding how Retrieval Augmented Generation (RAG) systems work, for the curious yet daunted reader☆28Jul 12, 2025Updated 7 months ago
- Unconditional music synthesis using a diffusion model in the STFT domain☆12May 31, 2022Updated 3 years ago
- This repository will contain the presentation and python jupyter notebooks for my DataHack Summit 2025 conference talk, Building Effectiv…☆75Aug 25, 2025Updated 6 months ago
- Measuring Thinking Efficiency in Reasoning Models - Research Repository☆39Dec 2, 2025Updated 3 months ago
- Custom triton kernels for training Karpathy's nanoGPT.☆19Oct 21, 2024Updated last year
- Collaborative Multi-Agent RAG with CrewAI☆72May 19, 2024Updated last year
- A pip installable package for optimal transport inspired loss functions in the spectral domain. Can be used for audio applications such a…☆29Dec 5, 2025Updated 2 months ago
- Exploring and demonstrating OpenAI's Swarm framework☆20Oct 20, 2024Updated last year
- cheap & easy LLM experiments for amateurs (alpha)☆25Nov 30, 2025Updated 3 months ago
- Code for the paper "ClinicalBench: Can LLMs Beat Traditional ML Models in Clinical Prediction?"☆31Jun 18, 2025Updated 8 months ago
- ☆24Jun 27, 2024Updated last year
- Official repository of "TensorFlow Serving with Docker for Model Deployment" Coursera Project☆23Aug 27, 2020Updated 5 years ago
- simple terminal-based AI coding agent. This is for learning purposes more than a final working app.☆27Mar 6, 2025Updated 11 months ago
- Built and deployed scalable LLM retrieval APIs on a hybrid GCP architecture with full CI/CD, IaC, and monitoring☆72Aug 10, 2025Updated 6 months ago
- Detecting car parking slot on Open car park space☆13Oct 21, 2019Updated 6 years ago
- ☆27Aug 5, 2024Updated last year
- ☆28Nov 26, 2024Updated last year
- The classic movies redux with machine learning using TensorFlow and Keras.☆11Feb 12, 2019Updated 7 years ago
- FakeChecker is a part of my Engineering thesis project on Warsaw University of Technology. Its aim is to detect fake reviews on Google Ma…☆12Jun 13, 2023Updated 2 years ago
- In-memory OLAP SQL server for object storage data.☆14Oct 15, 2025Updated 4 months ago
- AI-powered cryptocurrency trading bot built using deep reinforcement learning (DRL). The bot is designed as a research platform for devel…☆10Jan 18, 2025Updated last year
- ☆17Feb 6, 2025Updated last year
- n8n Templates in JSON☆16Feb 9, 2025Updated last year
- Program to plot a Ramachandran plot of all dihedral angles from a given PDB file. Background is empirically generated from the peptides …☆12Feb 25, 2025Updated last year
- Implementation of a modular, high-performance, and simplistic mamba for high-speed applications☆40Nov 11, 2024Updated last year
- Building a multi-agent RAG system with advanced RAG methods☆12Jan 12, 2025Updated last year
- This is a repository to let you know the implementation of a basic RAG pipeline using LangChain in Supabase Edge Functions.☆11May 22, 2024Updated last year
- This is a A/B test project from Udacity.☆12Dec 24, 2019Updated 6 years ago
- Awesome Curated | Contructive Developmental Theory: Adult Development, Dialectical Thought Form Framework, Immunity to Change, etc☆12May 9, 2025Updated 9 months ago
- Durability for web streams powered by S2☆22Jan 2, 2026Updated 2 months ago
- Adding NeMo Guardrails to a LlamaIndex RAG pipeline☆41Feb 20, 2024Updated 2 years ago
- Parse data and generate plotting scripts based on plotly.☆11Dec 8, 2025Updated 2 months ago
- A codebase for data crawling and preprocessing for TTS and ASR systems training.☆22Updated this week
- Multi-Agent LLM System for Digital Scam Protection☆12Dec 19, 2024Updated last year
- The codebase contains the implementation for the paper "An asset subset-constrained minimax optimization framework for online portfolio s…☆11Dec 3, 2024Updated last year
- Calculate allowed interactions in QED☆10Nov 2, 2022Updated 3 years ago