☆19Mar 16, 2025Updated last year
Alternatives and similar repositories for modal-grpo
Users that are interested in modal-grpo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆37May 5, 2025Updated 10 months ago
- ☆16Jul 20, 2023Updated 2 years ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆59Oct 18, 2025Updated 5 months ago
- CodemixedNLP: An Extensible and Open NLP Toolkit for Code-Switching☆18Mar 29, 2021Updated 4 years ago
- Project code for training LLMs to write better unit tests + code☆21May 19, 2025Updated 10 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- a Python library that uses Reinforcement Learning (RL) to train LLMs.☆43Mar 21, 2026Updated last week
- ☆56Mar 4, 2025Updated last year
- ☆40Aug 1, 2025Updated 7 months ago
- Time-series transformer☆11Sep 18, 2025Updated 6 months ago
- A module for pulling python license data from `environment.yaml` and `requirements.txt` files☆11Nov 23, 2018Updated 7 years ago
- ChainRulesCore compatible pullbacks using ForwardDiff☆13Mar 9, 2026Updated 2 weeks ago
- Drop-in environment replacements that make your RL algorithm train faster.☆21Jun 19, 2024Updated last year
- This Python script uses YOLOv8 from Ultralytics for real-time object detection using OpenCV. The script initializes a camera, loads the Y…☆11Sep 6, 2024Updated last year
- A miniature version of Modal☆23Jun 11, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Chain-of-thought 방식을 활용하여 llama2를 fine-tuning☆10Nov 18, 2023Updated 2 years ago
- A seamless matchmaking application that is programmed with Cohere Command R+, Stanford NLP DSPy framework, Weaviate Vector store and Crew…☆59Apr 20, 2024Updated last year
- ☆12Mar 25, 2024Updated 2 years ago
- ☆15May 17, 2024Updated last year
- Some convenient hacks when using Nonconvex.jl.☆17Feb 10, 2026Updated last month
- Exploring Applications of GRPO☆252Aug 25, 2025Updated 7 months ago
- ☆13Nov 5, 2024Updated last year
- An MCP server for a Node.js debugger. Designed to run locally alongside Claude Code or other coding agents.☆16Jun 19, 2025Updated 9 months ago
- A collection of notebooks for the Hugging Face blog series (https://huggingface.co/blog).☆46Feb 13, 2026Updated last month
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆109Sep 19, 2025Updated 6 months ago
- ☆15Apr 26, 2025Updated 11 months ago
- Go package that wraps around OpenAI HTTP APIs☆12Mar 2, 2023Updated 3 years ago
- PyTorch codes for the paper "An Empirical Study of Multimodal Model Merging"☆37Oct 11, 2023Updated 2 years ago
- The backend behind the LLM-Perf Leaderboard☆11May 5, 2024Updated last year
- ☆20Sep 11, 2025Updated 6 months ago
- ☆18Jul 3, 2025Updated 8 months ago
- stream-of-consciousness experience of an AI's thinking process, complete with creative tangents and unexpected connections.☆14Jan 29, 2025Updated last year
- Lego for GRPO☆30May 27, 2025Updated 10 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆11Mar 16, 2026Updated last week
- An experimental implementation of sum-product networks with dense unitary transformations in leaves☆13Sep 8, 2022Updated 3 years ago
- Lisp environment with Emacs-like editor☆11Jan 8, 2022Updated 4 years ago
- Automatically summarize lectures and ask questions about the course material