simple grpo
☆12May 28, 2025Updated 9 months ago
Alternatives and similar repositories for grpo
Users that are interested in grpo are comparing it to the libraries listed below
Sorting:
- A full fledged mistral+wandb☆13Aug 16, 2024Updated last year
- Customize, control, and enhance LLM generation with logits processors, featuring visualization capabilities to inspect and understand sta…☆44Jan 8, 2026Updated last month
- Benchmark structured generation libraries☆31Oct 25, 2024Updated last year
- Code to go with beginner FastHTML tutorial☆20Jul 5, 2025Updated 7 months ago
- ☆49Oct 28, 2025Updated 4 months ago
- The production website for SquiggleConf: a conference for excellent web dev tooling☆11Jan 27, 2026Updated last month
- A Bayesian model for time-series count data with weekend effects and a lagged reporting process☆10Mar 7, 2022Updated 3 years ago
- An unnecessarily tiny and minimal implementation of GPT-2 in NumPy.☆11Feb 12, 2023Updated 3 years ago
- ☆11Aug 20, 2025Updated 6 months ago
- Python driver for MobilityDB☆11Apr 12, 2023Updated 2 years ago
- LLM-aided data filtering☆14Dec 3, 2024Updated last year
- Use `outlines` generators with Haystack.☆15Feb 24, 2026Updated last week
- Benchmark of glucose predictive models in diabetes☆11Nov 12, 2024Updated last year
- ☆14Apr 17, 2023Updated 2 years ago
- Typed python equivalent for R pipes.☆13Oct 16, 2022Updated 3 years ago
- In the first course of Machine Learning Engineering for Production Specialization, you will identify the various components and design an…☆10Nov 4, 2021Updated 4 years ago
- A simple library-less CUDA implementation of the OneSweep sorting algorithm.☆11Feb 26, 2024Updated 2 years ago
- A simple Python library for compartment models☆11Aug 23, 2021Updated 4 years ago
- 🦌 Deep Retention, Winner @ Calhacks ✨🌠☆10Oct 26, 2024Updated last year
- Compiling useful links, papers, benchmarks, ideas, etc.☆46Mar 16, 2025Updated 11 months ago
- Building or integrating an LLM wrapper shouldn't take more than 10 minutes.☆13Feb 1, 2025Updated last year
- ☆11Mar 24, 2025Updated 11 months ago
- An automated trading bot for Magic Online☆15Mar 8, 2011Updated 14 years ago
- ☆11Oct 16, 2023Updated 2 years ago
- Tracking the history of the FARA data from https://www.justice.gov/nsd-fara☆16Aug 3, 2023Updated 2 years ago
- Driving range prediction by looking at energy consumption rate of Electronic Vehicles using ML regression techniques.☆14Sep 19, 2021Updated 4 years ago
- OpenTelemetry wrapper for Claude Code CLI that logs tool calls, token usage, costs, and execution traces to Logfire, Sentry, Honeycomb, o…☆18Oct 24, 2025Updated 4 months ago
- Put your data somewhere you can look at it☆29Jun 9, 2025Updated 8 months ago
- ☆13Sep 11, 2023Updated 2 years ago
- applications of https://github.com/PrefectHQ/marvin☆13Jan 15, 2024Updated 2 years ago
- Transcribing audio files on Modal with open source ASR models is fast, cheap, and easy!☆18Jul 25, 2025Updated 7 months ago
- ☆12Jul 24, 2023Updated 2 years ago
- Text Adventure Learning Environment Suite - Benchmark to evaluate language models on interactive text environments.☆25Feb 18, 2026Updated 2 weeks ago
- 🌳☆11Dec 13, 2021Updated 4 years ago
- Generates and optimizes Haiku system and user prompts for classification☆14Oct 27, 2025Updated 4 months ago
- ☆12Jul 6, 2023Updated 2 years ago
- Minimalistic Google Docs based workflow for Distill.pub☆10Jun 14, 2023Updated 2 years ago
- Autocomplete / Autofill Text field with Dropdown menu to choose between suggested values from a given list.☆14Feb 23, 2024Updated 2 years ago
- 📙 Notebooks Academy: Write Production-Ready Code From Jupyter.☆13Jan 5, 2023Updated 3 years ago