☆19Mar 16, 2025Updated last year
Alternatives and similar repositories for modal-grpo
Users that are interested in modal-grpo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆15Jun 19, 2025Updated 9 months ago
- ☆37May 5, 2025Updated 11 months ago
- ☆16Jul 20, 2023Updated 2 years ago
- SpatialThinker: Reinforcing 3D Reasoning in Multimodal LLMs via Spatial Rewards☆37Jan 28, 2026Updated 2 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆60Oct 18, 2025Updated 5 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- CodemixedNLP: An Extensible and Open NLP Toolkit for Code-Switching☆18Mar 29, 2021Updated 5 years ago
- Project code for training LLMs to write better unit tests + code☆21May 19, 2025Updated 10 months ago
- a Python library that uses Reinforcement Learning (RL) to train LLMs.☆43Updated this week
- Companion repo to "RAG is more than vector search" blog post☆23Mar 6, 2025Updated last year
- ☆40Aug 1, 2025Updated 8 months ago
- Code for SaGe subword tokenizer (EACL 2023)☆28Nov 30, 2024Updated last year
- Drop-in environment replacements that make your RL algorithm train faster.☆22Jun 19, 2024Updated last year
- This Python script uses YOLOv8 from Ultralytics for real-time object detection using OpenCV. The script initializes a camera, loads the Y…☆11Sep 6, 2024Updated last year
- A miniature version of Modal☆23Jun 11, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Chain-of-thought 방식을 활용하여 llama2를 fine-tuning☆10Nov 18, 2023Updated 2 years ago
- ☆12Mar 25, 2024Updated 2 years ago
- Exploring Applications of GRPO☆252Aug 25, 2025Updated 7 months ago
- AI coding assistant in rust☆31Mar 7, 2025Updated last year
- ☆13Nov 5, 2024Updated last year
- Table module for ProseMirror☆29Aug 20, 2023Updated 2 years ago
- ☆11Aug 22, 2023Updated 2 years ago
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆110Sep 19, 2025Updated 6 months ago
- A collection of notebooks for the Hugging Face blog series (https://huggingface.co/blog).☆46Feb 13, 2026Updated 2 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆15Apr 26, 2025Updated 11 months ago
- Go package that wraps around OpenAI HTTP APIs☆12Mar 2, 2023Updated 3 years ago
- VLS: Steering Pretrained Robot Policies via Vision–Language Models☆48Mar 29, 2026Updated 2 weeks ago
- PyTorch codes for the paper "An Empirical Study of Multimodal Model Merging"☆37Oct 11, 2023Updated 2 years ago
- The backend behind the LLM-Perf Leaderboard☆11May 5, 2024Updated last year
- Add a buss down chain to your neck (Built using Nextjs, Convex, & Gemini 2.5 flash)☆34Aug 29, 2025Updated 7 months ago
- ☆18Jul 3, 2025Updated 9 months ago
- Lego for GRPO☆30May 27, 2025Updated 10 months ago
- Limit Orderbook Replay/Analysis Library☆10Nov 19, 2018Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆21May 26, 2024Updated last year
- ☆11Updated this week
- ☆28Aug 7, 2023Updated 2 years ago
- DEPRECATED - A DogStatsd Python client☆16Dec 12, 2018Updated 7 years ago
- A work in progress library that fuses the HL7 FHIR standard with scikit-learn☆21Jul 26, 2023Updated 2 years ago
- ☆10Aug 6, 2024Updated last year
- ☆15Jun 2, 2025Updated 10 months ago