ivanleomk/modal-grpo

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ivanleomk/modal-grpo)

ivanleomk / modal-grpo

☆19

Alternatives and similar repositories for modal-grpo

Users that are interested in modal-grpo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

jxnl / instructor-classify
View on GitHub
☆37May 5, 2025Updated last year
ari-holtzman / newformer
View on GitHub
☆16Jul 20, 2023Updated 3 years ago
kimbochen / mini-llama-mlx
View on GitHub
A simple LLaMA implementation using MLX.
☆15Apr 22, 2024Updated 2 years ago
mzbac / flux.1.app
View on GitHub
☆21Oct 9, 2024Updated last year
s-smits / grpo-optuna
View on GitHub
Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna
☆60Oct 18, 2025Updated 9 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
rosmineb / unit_test_rl
View on GitHub
Project code for training LLMs to write better unit tests + code
☆22May 19, 2025Updated last year
haizelabs / annotate
View on GitHub
Skill to annotate and create ai judges from agent logs
☆17Oct 28, 2025Updated 9 months ago
BY571 / DistRL-LLM
View on GitHub
Distributed Reinforcement Learning for LLM Fine-Tuning with multi-GPU utilization
☆22Mar 12, 2025Updated last year
MeLeLBGU / SaGe
View on GitHub
Code for SaGe subword tokenizer (EACL 2023)
☆28Nov 30, 2024Updated last year
keraJLi / synthetic-gymnax
View on GitHub
Drop-in environment replacements that make your RL algorithm train faster.
☆22Jun 19, 2024Updated 2 years ago
axolotl-ai-cloud / axolotl-cookbook
View on GitHub
☆39Aug 1, 2025Updated 11 months ago
lyramakesmusic / activations-vis
View on GitHub
☆15Feb 13, 2026Updated 5 months ago
brendanhogan / DeepSeekRL-Extended
View on GitHub
Exploring Applications of GRPO
☆252Aug 25, 2025Updated 11 months ago
camenduru / champ-jupyter
View on GitHub
☆12Mar 25, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
aws-samples / generative-ai-to-build-a-devsecops-chatbot
View on GitHub
☆13Nov 5, 2024Updated last year
QuesmaOrg / otel-bench
View on GitHub
OpenTelemetry Benchmark - can AI trace your failed login?
☆20Jul 14, 2026Updated 2 weeks ago
Doriandarko / claude-search-mcp
View on GitHub
A MCP server that provides web search capabilities using the Claude API.
☆48May 10, 2025Updated last year
sanchit-gandhi / notebooks
View on GitHub
A collection of notebooks for the Hugging Face blog series (https://huggingface.co/blog).
☆47Feb 13, 2026Updated 5 months ago
XiangLi1999 / AutoBencher
View on GitHub
☆33Jul 11, 2024Updated 2 years ago
nateraw / modal-examples
View on GitHub
Apps that run on modal.com
☆13Sep 14, 2025Updated 10 months ago
sausheong / openai
View on GitHub
Go package that wraps around OpenAI HTTP APIs
☆12Mar 2, 2023Updated 3 years ago
SYED-M-HUSSAIN / Camera_Inferencing_YOLOv8_Object_Detection
View on GitHub
This Python script uses YOLOv8 from Ultralytics for real-time object detection using OpenCV. The script initializes a camera, loads the Y…
☆11Sep 6, 2024Updated last year
awni / mlx_nanocode
View on GitHub
Minimal Claude Code alternative powered by MLX
☆47Jan 11, 2026Updated 6 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
Marker-Inc-Korea / CoT-llama2
View on GitHub
Chain-of-thought 방식을 활용하여 llama2를 fine-tuning
☆10Nov 18, 2023Updated 2 years ago
gsalaz98 / cinnamon_roll
View on GitHub
Limit Orderbook Replay/Analysis Library
☆10Nov 19, 2018Updated 7 years ago
VigneswaranB97 / Finding-distance-between-objects-in-an-image-using-OpenCV
View on GitHub
It is very difficult for getting a perfect distance between gaps and objects, Here using OpenCV, some possibilities can be made
☆10Nov 24, 2018Updated 7 years ago
broadinstitute / ml4ht_data_source
View on GitHub
Multimodal data loader compatible with pytorch and tensorflow
☆12Aug 14, 2024Updated last year
mindreframer / webpack_and_rails
View on GitHub
☆13Nov 27, 2014Updated 11 years ago
Doriandarko / OraclesGPT
View on GitHub
☆11Aug 22, 2023Updated 2 years ago
agilestacks / stack-ml-eks
View on GitHub
Customizable GitOps template for Kubeflow on AWS EKS
☆10Nov 19, 2020Updated 5 years ago
SawyerHood / develop.sh
View on GitHub
☆21May 26, 2024Updated 2 years ago
ArturTanona / grpo_unsloth_docker
View on GitHub
☆56Feb 10, 2025Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
jtanningbed / mcp-ag2-example
View on GitHub
a simple example demonstrating MCP + ag2 (autogen) integration
☆42Jul 19, 2025Updated last year
mzbac / SwiftAgent
View on GitHub
☆15Jun 13, 2025Updated last year
Pinafore / karl-flashcards-web-app
View on GitHub
The backend and web frontend for the KAR³L flashcard app
☆14Sep 28, 2025Updated 10 months ago
wisespace-io / nsqueue
View on GitHub
Rust client for the NSQ realtime message processing system
☆20Jul 8, 2017Updated 9 years ago
enoche / tGraphAD
View on GitHub
Research sources on graph-based anomaly detection
☆13Nov 29, 2022Updated 3 years ago
cocktailpeanutlabs / deus
View on GitHub
☆10Aug 6, 2024Updated last year
JoviDeCroock / le-chien
View on GitHub
A chat application hosted on CF workers built with preact
☆16Apr 26, 2026Updated 3 months ago