☆19Mar 16, 2025Updated last year
Alternatives and similar repositories for modal-grpo
Users that are interested in modal-grpo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆16Jul 20, 2023Updated 2 years ago
- A simple LLaMA implementation using MLX.☆15Apr 22, 2024Updated 2 years ago
- ☆21Oct 9, 2024Updated last year
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆60Oct 18, 2025Updated 8 months ago
- Project code for training LLMs to write better unit tests + code☆22May 19, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Companion repo to "RAG is more than vector search" blog post☆23Mar 6, 2025Updated last year
- ☆40Aug 1, 2025Updated 10 months ago
- A network scanner to output in various adblock formats☆31Jun 7, 2026Updated last week
- Code for SaGe subword tokenizer (EACL 2023)☆28Nov 30, 2024Updated last year
- A module for pulling python license data from `environment.yaml` and `requirements.txt` files☆11Nov 23, 2018Updated 7 years ago
- Distributed Reinforcement Learning for LLM Fine-Tuning with multi-GPU utilization☆22Mar 12, 2025Updated last year
- ChainRulesCore compatible pullbacks using ForwardDiff☆13Apr 20, 2026Updated last month
- This Python script uses YOLOv8 from Ultralytics for real-time object detection using OpenCV. The script initializes a camera, loads the Y…☆11Sep 6, 2024Updated last year
- A miniature version of Modal☆24Jun 11, 2024Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- A seamless matchmaking application that is programmed with Cohere Command R+, Stanford NLP DSPy framework, Weaviate Vector store and Crew…☆59Apr 20, 2024Updated 2 years ago
- ☆12Mar 25, 2024Updated 2 years ago
- A MCP server that provides web search capabilities using the Claude API.☆48May 10, 2025Updated last year
- a new family of super small music generation models focusing on experimental music and latent space exploration capabilities☆36May 9, 2024Updated 2 years ago
- ☆15May 17, 2024Updated 2 years ago
- Some convenient hacks when using Nonconvex.jl.☆17Feb 10, 2026Updated 4 months ago
- ☆13Nov 5, 2024Updated last year
- AI-driven edge caching of any origin, using Cloudflare Workers and Deepseek AI☆21Jan 31, 2025Updated last year
- Apps that run on modal.com☆13Sep 14, 2025Updated 9 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆110Sep 19, 2025Updated 8 months ago
- ☆11Aug 22, 2023Updated 2 years ago
- A collection of notebooks for the Hugging Face blog series (https://huggingface.co/blog).☆47Feb 13, 2026Updated 4 months ago
- ☆15Apr 26, 2025Updated last year
- ☆11Dec 6, 2020Updated 5 years ago
- Go package that wraps around OpenAI HTTP APIs☆12Mar 2, 2023Updated 3 years ago
- Provides a convenience Julia macro to extract fields from composite types☆15Feb 14, 2020Updated 6 years ago
- The backend behind the LLM-Perf Leaderboard☆11May 5, 2024Updated 2 years ago
- ☆24Sep 11, 2025Updated 9 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Add a buss down chain to your neck (Built using Nextjs, Convex, & Gemini 2.5 flash)☆34Aug 29, 2025Updated 9 months ago
- ☆18Jul 3, 2025Updated 11 months ago
- A ray-tracer for curved spacetimes☆14May 10, 2023Updated 3 years ago
- Lego for GRPO☆30May 27, 2025Updated last year
- Convolutional Neural Networks for shoreline prediction☆12May 15, 2024Updated 2 years ago
- Code for the paper "Policy Adaptation via Language Optimization: Decomposing Tasks for Few-Shot Imitation"☆33Dec 1, 2024Updated last year
- ☆21May 26, 2024Updated 2 years ago