An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuning
☆37May 18, 2025Updated last year
Alternatives and similar repositories for GRPO-Training
Users that are interested in GRPO-Training are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Custom triton kernels for training Karpathy's nanoGPT.☆19Oct 21, 2024Updated last year
- Measuring Thinking Efficiency in Reasoning Models - Research Repository☆39Dec 2, 2025Updated 5 months ago
- Unconditional music synthesis using a diffusion model in the STFT domain☆12May 31, 2022Updated 3 years ago
- Image Search Engine with HuggingFace Sentence Transformer☆12Aug 31, 2023Updated 2 years ago
- cheap & easy LLM experiments for amateurs (alpha)☆25Nov 30, 2025Updated 5 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- simple terminal-based AI coding agent. This is for learning purposes more than a final working app.☆27Mar 6, 2025Updated last year
- Generative AI, Multi-Agent Systems (MAS), AI Research Methodology, Industry Best Practices, and The Future of Work (Kenyon College's Inte…☆23Dec 22, 2025Updated 5 months ago
- App built in the "Coding the Future With AI" YouTube tutorial series "Mastering AI Coding"☆12Jan 5, 2025Updated last year
- Landing repository for the paper "Predicting the Order of Upcoming Tokens Improves Language Modeling"☆45May 13, 2026Updated last week
- ☆15Jul 18, 2022Updated 3 years ago
- Instantly convert ideas into app code with AI! This React app uses the Gemini API to generate and preview code from Markdown, making prot…☆14Mar 31, 2026Updated last month
- Code for the paper "ClinicalBench: Can LLMs Beat Traditional ML Models in Clinical Prediction?"☆34Jun 18, 2025Updated 11 months ago
- Extract, timestamp, and analyze specific content from video collections using LLM-powered audio/video processing.☆64Oct 1, 2025Updated 7 months ago
- This repository will contain the presentation and python jupyter notebooks for my DataHack Summit 2025 conference talk, Building Effectiv…☆76Aug 25, 2025Updated 8 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Official repository of "TensorFlow Serving with Docker for Model Deployment" Coursera Project☆23Aug 27, 2020Updated 5 years ago
- Making of cuda kernel☆16May 27, 2025Updated 11 months ago
- Collaborative Multi-Agent RAG with CrewAI☆75May 19, 2024Updated 2 years ago
- ☆16Nov 25, 2022Updated 3 years ago
- ☆28Aug 5, 2024Updated last year
- Fine-tune copilot based on your codebase☆12Mar 26, 2024Updated 2 years ago
- Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant …☆15Mar 11, 2024Updated 2 years ago
- ☆15Apr 26, 2025Updated last year
- Calculate allowed interactions in QED☆10Nov 2, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆13Jun 15, 2022Updated 3 years ago
- ☆27Jan 17, 2025Updated last year
- Building your own AI Agent using Semantic Kernel - Microsoft Learn Zero to Hero Community 2024☆20May 23, 2024Updated 2 years ago
- Implementing scalable LLMs in pure JAX (no third-party libraries)☆50May 11, 2026Updated last week
- Code for "What really matters in matrix-whitening optimizers?"☆24Oct 31, 2025Updated 6 months ago
- A swarm of LLM agents that will help you test, document, and productionize your code!☆19May 11, 2026Updated last week
- Implementation of Infini-Transformer in Pytorch☆112Jan 4, 2025Updated last year
- A Python application that loads and processes both web pages and local documents, indexing their content using embeddings, and enabling s…☆26Jul 20, 2025Updated 10 months ago
- MV-RAG combines retrieval with multi-view generation to create accurate 3D-consistent visuals. By retrieving reference images and text, i…☆24Nov 29, 2025Updated 5 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆15Aug 5, 2022Updated 3 years ago
- A codebase for data crawling and preprocessing for TTS and ASR systems training.☆23Feb 26, 2026Updated 2 months ago
- [ICML-2025] We introduce Lie group Relative position Encodings (LieRE) that goes beyond RoPE in supporting n-dimensional inputs.☆14Aug 8, 2025Updated 9 months ago
- ☆10Apr 9, 2019Updated 7 years ago
- ☆32Feb 2, 2025Updated last year
- TuneTables is a tabular classifier that implements prompt tuning for frozen prior-fitted networks.☆24Mar 31, 2025Updated last year
- Open Source AI Database for Voice Agent Transcripts | Call Analysis & Insights | Extraction | Labelling & Classification☆29Nov 3, 2025Updated 6 months ago