simple grpo
☆12May 28, 2025Updated 10 months ago
Alternatives and similar repositories for grpo
Users that are interested in grpo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A full fledged mistral+wandb☆13Aug 16, 2024Updated last year
- Projects Pages for the NJU-3DV's Reserach Work☆10Jan 26, 2026Updated 2 months ago
- Customize, control, and enhance LLM generation with logits processors, featuring visualization capabilities to inspect and understand sta…☆46Jan 8, 2026Updated 3 months ago
- Benchmark structured generation libraries☆31Oct 25, 2024Updated last year
- An unnecessarily tiny and minimal implementation of GPT-2 in NumPy.☆11Feb 12, 2023Updated 3 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Direct Numerical Layout Generation for 3D Indoor Scene Synthesis via Spatial Reasoning☆32Mar 15, 2026Updated 3 weeks ago
- Privately averaging everyone's faces together without sharing actual pictures☆14Jan 12, 2023Updated 3 years ago
- Structured output benchmarks comparing DSPy and BAML with different LLMs☆28Dec 23, 2025Updated 3 months ago
- Prototype your Jupyter Widget in the browser with anywidget and JupyterLite 💡☆17Apr 7, 2025Updated last year
- A JupyterLite deployment to try JupyterLab, Jupyter Notebook and IPython in the browser☆13Jan 14, 2026Updated 2 months ago
- Presentations from the 2023 Fellowship.☆14Jan 31, 2024Updated 2 years ago
- A simplified version for DMC (Deep Multimodal Clustering for Unsupervised Audiovisual Learning)☆19May 27, 2020Updated 5 years ago
- Just some nice dice in Python☆21Jan 6, 2026Updated 3 months ago
- Use Hermes-2-Pro-Mistral-7B function calling with your OpenAI API compatible code.☆18May 7, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆45Jan 24, 2024Updated 2 years ago
- A fork of sqlite-utils with CLI etc removed☆17Apr 6, 2026Updated last week
- A fun PGM experience☆15May 19, 2025Updated 10 months ago
- ☆42Mar 11, 2026Updated last month
- A MCP to connect LLMs to the archives of The Guardian☆19Jun 29, 2025Updated 9 months ago
- Explore training for quantized models☆26Jul 12, 2025Updated 9 months ago
- Text world based on Minecraft rules.☆17May 13, 2024Updated last year
- ☆121Updated this week
- gpt-3.5-turbo-instruct, prompted with PGN, vs Stockfish Level 4 on LiChess☆15Sep 19, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆18Apr 2, 2023Updated 3 years ago
- A redesigned JupyterLab launcher☆17Feb 27, 2026Updated last month
- Will write CUDA for 100 days☆39May 25, 2025Updated 10 months ago
- using information theory to encourage agents to cooperate and compete☆19Oct 4, 2018Updated 7 years ago
- ☆19Nov 6, 2024Updated last year
- Simulations for predictive model selection in causal inference☆13Jan 16, 2025Updated last year
- Getting started with BAML for creating and querying knowledge graphs with LLMs☆22May 20, 2025Updated 10 months ago
- AI code completions in JupyterLab powered by Codeium ✨☆19Oct 23, 2025Updated 5 months ago
- This is the course repository for the Spring 2023 iteration of MACS 30123 "Large-Scale Computing for the Social Sciences" at the Universi…☆13May 16, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- LLMs: Beyond Chat☆21Jan 31, 2025Updated last year
- Python driver for MobilityDB☆11Apr 12, 2023Updated 3 years ago
- The production website for SquiggleConf: a conference for excellent web dev tooling☆11Mar 7, 2026Updated last month
- Tracking the history of the FARA data from https://www.justice.gov/nsd-fara☆16Aug 3, 2023Updated 2 years ago
- ☆14Apr 17, 2023Updated 2 years ago
- Financial Dividend Investing Dashboard Build in Python with Solara☆18Mar 3, 2024Updated 2 years ago
- ☆10Sep 13, 2025Updated 7 months ago