minimal GRPO implementation from scratch
☆103Mar 14, 2025Updated 11 months ago
Alternatives and similar repositories for Tiny-GRPO
Users that are interested in Tiny-GRPO are comparing it to the libraries listed below
Sorting:
- Get aid from local LLMs right in your PowerShell☆15May 2, 2025Updated 10 months ago
- 🎮 Material You TUI for monitoring NVIDIA GPUs☆58Jan 16, 2026Updated last month
- Code for "What really matters in matrix-whitening optimizers?"☆22Oct 31, 2025Updated 4 months ago
- See https://github.com/cuda-mode/triton-index/ instead!☆11May 8, 2024Updated last year
- Quick access to any large language model from your browser.☆10Feb 16, 2026Updated 3 weeks ago
- unofficial pytorch implementation of HiFi-GAN with fast MISR.☆15Mar 21, 2023Updated 2 years ago
- ☆16Aug 7, 2024Updated last year
- AI Search engine☆13Sep 24, 2025Updated 5 months ago
- Minute-long video generation at 24FPS.☆55Feb 2, 2026Updated last month
- ☆32Nov 18, 2025Updated 3 months ago
- Demonstration of how to run multiple chains in Langchain Assyncronously☆12Jul 6, 2023Updated 2 years ago
- MilimoChat: Privacy-first, self-hosted AI chat with customizable personas, context-aware memory, and local analytics. Built on Python/Str…☆14Mar 12, 2025Updated 11 months ago
- Python client SDK for Ultravox.☆16Dec 10, 2025Updated 3 months ago
- ☆16Oct 17, 2025Updated 4 months ago
- SmarterRouter: An intelligent LLM gateway and VRAM-aware router for Ollama, llama.cpp, and OpenAI. Features semantic caching, model profi…☆62Mar 4, 2026Updated last week
- MCP server for searching npm packages☆15Feb 20, 2026Updated 2 weeks ago
- Optimus is a flexible and scalable framework built to train language models efficiently across diverse hardware configurations, including…☆68Dec 4, 2025Updated 3 months ago
- ☆29Mar 3, 2026Updated last week
- Python package of MP-SENet from Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement.☆21Nov 1, 2024Updated last year
- ☆16Jul 8, 2024Updated last year
- ☆18Aug 17, 2022Updated 3 years ago
- A scalable automated alignment method for large language models. Resources for "Aligning Large Language Models via Self-Steering Optimiza…☆20Nov 21, 2024Updated last year
- Visualize any repo or codebase into diagram or animation☆20Oct 14, 2024Updated last year
- Repository for the code assignment of the Deep Learning 1 course, Fall 2022 edition☆20Dec 9, 2022Updated 3 years ago
- ☆44Sep 19, 2024Updated last year
- ☆34Jul 8, 2025Updated 8 months ago
- Fine-tunes a student LLM using teacher feedback for improved reasoning and answer quality. Implements GRPO with teacher-provided evaluati…☆51May 7, 2025Updated 10 months ago
- A small rust-based data loader☆36Feb 20, 2026Updated 2 weeks ago
- Canopyai Orpheus & LMStudio: 100% Uncensored Private Offline chat☆27Apr 17, 2025Updated 10 months ago
- Pipeline parallelism for the minimalist☆41Aug 6, 2025Updated 7 months ago
- Fast Streaming TTS with Orpheus + WebRTC (with FastRTC)☆350Apr 10, 2025Updated 11 months ago
- H.AI cookbook provides code examples and guides to help developers use models developed by H Company.☆66Feb 20, 2026Updated 2 weeks ago
- ☆25Apr 26, 2025Updated 10 months ago
- ☆24Dec 8, 2024Updated last year
- ☆30Jul 18, 2024Updated last year
- A collection of niche / personally useful PyTorch optimizers with modified code.☆27Oct 25, 2025Updated 4 months ago
- This is a hands-on for ML beginners to perform SimCSE step-by-step. Implemented both supervised SimCSE and unsupervisied SimCSE, and dist…☆22Oct 6, 2023Updated 2 years ago
- Production-ready Python library for multi-provider LLM orchestration☆40Oct 10, 2025Updated 5 months ago
- A very simple GRPO implement for reproducing r1-like LLM thinking.☆1,603Nov 21, 2025Updated 3 months ago