simple grpo
☆12May 28, 2025Updated last year
Alternatives and similar repositories for grpo
Users that are interested in grpo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A full fledged mistral+wandb☆13Aug 16, 2024Updated last year
- Projects Pages for the NJU-3DV's Reserach Work☆10May 8, 2026Updated last month
- Customize, control, and enhance LLM generation with logits processors, featuring visualization capabilities to inspect and understand sta…☆47Jan 8, 2026Updated 5 months ago
- Benchmark structured generation libraries☆31Oct 25, 2024Updated last year
- Code to go with beginner FastHTML tutorial☆22Jul 5, 2025Updated 11 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Code for the ICLR 2022 paper "Attacking deep networks with surrogate-based adversarial black-box methods is easy"☆10Oct 16, 2025Updated 8 months ago
- Indranet Explorer, a simulated browser☆16Nov 12, 2024Updated last year
- ☆11Jul 21, 2023Updated 2 years ago
- Implementation of Centered Kernel Alignment (CKA)☆10Apr 7, 2021Updated 5 years ago
- Scrape and display Tyler Cowen's current favorite restaurants☆11Jun 22, 2026Updated last week
- Compiling useful links, papers, benchmarks, ideas, etc.☆45Mar 16, 2025Updated last year
- A paper list of sample-efficient reinforcement learning☆20Jan 12, 2022Updated 4 years ago
- Official implementation of "Removing Batch Normalization Boosts Adversarial Training" (ICML'22)☆19Jul 20, 2022Updated 3 years ago
- An unnecessarily tiny and minimal implementation of GPT-2 in NumPy.☆11Feb 12, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Webvision Challenge 2020 developer kit☆10Dec 8, 2022Updated 3 years ago
- Direct Numerical Layout Generation for 3D Indoor Scene Synthesis via Spatial Reasoning☆35Mar 15, 2026Updated 3 months ago
- A minimalist Docker project to help people getting started with Node, WizardCoder, CTransformers, Python, Express and TypeScript. Ready t…☆14Jun 23, 2023Updated 3 years ago
- Privately averaging everyone's faces together without sharing actual pictures☆14Jan 12, 2023Updated 3 years ago
- Structured output benchmarks comparing DSPy and BAML with different LLMs☆28Dec 23, 2025Updated 6 months ago
- Prototype your Jupyter Widget in the browser with anywidget and JupyterLite 💡☆17Apr 7, 2025Updated last year
- A JupyterLite deployment to try JupyterLab, Jupyter Notebook and IPython in the browser☆13Jun 2, 2026Updated last month
- DistRL: An Asynchronous Distributed Reinforcement Learning Framework for On-Device Control Agents☆24Aug 4, 2025Updated 11 months ago
- ☆12Apr 1, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Presentations from the 2023 Fellowship.☆13Jan 31, 2024Updated 2 years ago
- A simplified version for DMC (Deep Multimodal Clustering for Unsupervised Audiovisual Learning)☆19May 27, 2020Updated 6 years ago
- Deliberate experimental Rust implementation of Syft☆12May 20, 2021Updated 5 years ago
- Just some nice dice in Python☆22Jun 17, 2026Updated 2 weeks ago
- A rust api to read temperature and humidity from DHT11 sensor☆19May 18, 2019Updated 7 years ago
- Use Hermes-2-Pro-Mistral-7B function calling with your OpenAI API compatible code.☆18May 7, 2024Updated 2 years ago
- ☆46Jan 24, 2024Updated 2 years ago
- A simple and concise implementation of the RFCN is given☆12Feb 20, 2021Updated 5 years ago
- A fork of sqlite-utils with CLI etc removed☆17Apr 28, 2026Updated 2 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Structured Generation Evals☆14Sep 25, 2024Updated last year
- A fun PGM experience☆15May 19, 2025Updated last year
- Fully Homomorphic Encryption library☆25Mar 25, 2020Updated 6 years ago
- A MCP to connect LLMs to the archives of The Guardian☆20Jun 29, 2025Updated last year
- Explore training for quantized models☆26Jul 12, 2025Updated 11 months ago
- Example code to create high-quality knowledge graphs using entity resolution with Kuzu and Senzing☆25Sep 17, 2025Updated 9 months ago
- Use Playwright + Chromium to scrape posts from others on Twitter for semi-automated analysis of their personality traits (manual login re…☆12Oct 26, 2024Updated last year