simple grpo
☆12May 28, 2025Updated 11 months ago
Alternatives and similar repositories for grpo
Users that are interested in grpo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A full fledged mistral+wandb☆13Aug 16, 2024Updated last year
- Projects Pages for the NJU-3DV's Reserach Work☆10May 8, 2026Updated 2 weeks ago
- Customize, control, and enhance LLM generation with logits processors, featuring visualization capabilities to inspect and understand sta…☆47Jan 8, 2026Updated 4 months ago
- Benchmark structured generation libraries☆31Oct 25, 2024Updated last year
- Code to go with beginner FastHTML tutorial☆22Jul 5, 2025Updated 10 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code for the ICLR 2022 paper "Attacking deep networks with surrogate-based adversarial black-box methods is easy"☆10Oct 16, 2025Updated 7 months ago
- Indranet Explorer, a simulated browser☆16Nov 12, 2024Updated last year
- ☆11Jul 21, 2023Updated 2 years ago
- Implementation of Centered Kernel Alignment (CKA)☆10Apr 7, 2021Updated 5 years ago
- Scrape and display Tyler Cowen's current favorite restaurants☆11May 8, 2026Updated 2 weeks ago
- Compiling useful links, papers, benchmarks, ideas, etc.☆46Mar 16, 2025Updated last year
- A paper list of sample-efficient reinforcement learning☆18Jan 12, 2022Updated 4 years ago
- Official implementation of "Removing Batch Normalization Boosts Adversarial Training" (ICML'22)☆19Jul 20, 2022Updated 3 years ago
- An unnecessarily tiny and minimal implementation of GPT-2 in NumPy.☆11Feb 12, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Webvision Challenge 2020 developer kit☆10Dec 8, 2022Updated 3 years ago
- Direct Numerical Layout Generation for 3D Indoor Scene Synthesis via Spatial Reasoning☆33Mar 15, 2026Updated 2 months ago
- A minimalist Docker project to help people getting started with Node, WizardCoder, CTransformers, Python, Express and TypeScript. Ready t…☆14Jun 23, 2023Updated 2 years ago
- Privately averaging everyone's faces together without sharing actual pictures☆14Jan 12, 2023Updated 3 years ago
- Structured output benchmarks comparing DSPy and BAML with different LLMs☆28Dec 23, 2025Updated 5 months ago
- Prototype your Jupyter Widget in the browser with anywidget and JupyterLite 💡☆17Apr 7, 2025Updated last year
- A JupyterLite deployment to try JupyterLab, Jupyter Notebook and IPython in the browser☆13Jan 14, 2026Updated 4 months ago
- DistRL: An Asynchronous Distributed Reinforcement Learning Framework for On-Device Control Agents☆24Aug 4, 2025Updated 9 months ago
- ☆12Apr 1, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Presentations from the 2023 Fellowship.☆13Jan 31, 2024Updated 2 years ago
- A simplified version for DMC (Deep Multimodal Clustering for Unsupervised Audiovisual Learning)☆19May 27, 2020Updated 5 years ago
- Deliberate experimental Rust implementation of Syft☆12May 20, 2021Updated 5 years ago
- A rust api to read temperature and humidity from DHT11 sensor☆19May 18, 2019Updated 7 years ago
- Just some nice dice in Python☆22Jan 6, 2026Updated 4 months ago
- Use Hermes-2-Pro-Mistral-7B function calling with your OpenAI API compatible code.☆18May 7, 2024Updated 2 years ago
- ☆46Jan 24, 2024Updated 2 years ago
- A simple and concise implementation of the RFCN is given☆12Feb 20, 2021Updated 5 years ago
- A fork of sqlite-utils with CLI etc removed☆17Apr 28, 2026Updated 3 weeks ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Structured Generation Evals☆14Sep 25, 2024Updated last year
- A fun PGM experience☆15May 19, 2025Updated last year
- Fully Homomorphic Encryption library☆25Mar 25, 2020Updated 6 years ago
- A MCP to connect LLMs to the archives of The Guardian☆20Jun 29, 2025Updated 10 months ago
- Explore training for quantized models☆26Jul 12, 2025Updated 10 months ago
- Example code to create high-quality knowledge graphs using entity resolution with Kuzu and Senzing☆24Sep 17, 2025Updated 8 months ago
- Use Playwright + Chromium to scrape posts from others on Twitter for semi-automated analysis of their personality traits (manual login re…☆12Oct 26, 2024Updated last year