This repository has code for fine-tuning LLMs with GRPO specifically for Rust Programming using cargo as feedback
☆116Mar 8, 2025Updated last year
Alternatives and similar repositories for GRPO-With-Cargo-Feedback
Users that are interested in GRPO-With-Cargo-Feedback are comparing it to the libraries listed below
Sorting:
- Run computational experiments using marimo notebooks☆21Jun 11, 2025Updated 8 months ago
- This AI Agent retrieves the latest news articles based on a multi keyword using the Serp API. It processes the results and returns struct…☆11Jan 31, 2025Updated last year
- Fast, concurrent, safe MPMC & MPSC FIFO queue implementation☆14Nov 16, 2021Updated 4 years ago
- virtual machine for rapid embedded development☆31Jan 16, 2025Updated last year
- A short demo to introduce the polars dataframe library through a marimo notebook.☆24Jan 29, 2025Updated last year
- A cheat sheet for marimo that runs as a marimo app☆27Feb 15, 2024Updated 2 years ago
- AI agents + toolkits for scientific knowledge☆37Aug 29, 2025Updated 6 months ago
- Firestore-inspired real-time sql queries subset for Tauri (Typescript frontend, Rust backend)☆11Nov 25, 2024Updated last year
- ☆11Jan 9, 2025Updated last year
- Implement reinforcement learning(RL) based on parameterized quantum circuits with quantum computing cloud Quafu.☆11Oct 19, 2023Updated 2 years ago
- A CodeMirror extension for inline completions, next-edit prediction, and prompt history☆44Updated this week
- Simple template for Freya.☆18May 27, 2025Updated 9 months ago
- 'Build a Full-Stack Twitter Clone with Rust' course code and notes☆13Aug 6, 2023Updated 2 years ago
- This repo contains code for Numerous widgets, patterns and tools to support developing apps in frameworks like Marimo and Panel. The widg…☆18Jul 10, 2025Updated 8 months ago
- Lesson plugins for Marimo notebooks☆19Apr 9, 2025Updated 11 months ago
- Karpathy's llama2.c transpiled to MLX for Apple Silicon☆14Dec 28, 2023Updated 2 years ago
- KernelBench v2: Can LLMs Write GPU Kernels? - Benchmark with Torch -> Triton (and more!) problems☆22Jul 4, 2025Updated 8 months ago
- ☆15Jun 4, 2025Updated 9 months ago
- A proc macro for creating compile-time checked CSS class sets, in the style of classNames☆17Jan 7, 2023Updated 3 years ago
- WebGym: Web-browser-based tasks for RL Agents☆24Feb 4, 2021Updated 5 years ago
- 🔵 D2: Declarative Diagramming in Python via AnyWidget☆20Mar 1, 2026Updated last week
- ☆23Jun 11, 2025Updated 8 months ago
- 😎 A curated list of the best resources in the HASH ecosystem☆28Sep 14, 2023Updated 2 years ago
- MarketGPT: Developing a Pre-trained transformer (GPT) for Modeling Financial Time Series☆17Sep 5, 2025Updated 6 months ago
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆18Jul 24, 2025Updated 7 months ago
- DuoGuard: A Two-Player RL-Driven Framework for Multilingual LLM Guardrails☆31Feb 26, 2025Updated last year
- ☆90Nov 9, 2024Updated last year
- [moved!] Include mdbooks at compile time in your Rust project☆25Feb 5, 2025Updated last year
- Simple repository for training small reasoning models☆49Feb 17, 2026Updated 3 weeks ago
- A whisper <lib|cli|server> written in rust☆20Jan 3, 2026Updated 2 months ago
- Rust based implementation of a full stack web application Twitter clone using the Tide Web Framework☆15Jul 18, 2022Updated 3 years ago
- Minimalist ML framework for Rust☆21Feb 28, 2026Updated last week
- Simple Implementation of a Transformer in the new framework MLX by Apple☆19Nov 18, 2024Updated last year
- We feature a project or marimo notebook from the community every Thursday!☆56Jul 25, 2025Updated 7 months ago
- Systematic evaluation framework that automatically rates overthinking behavior in large language models.☆97May 16, 2025Updated 9 months ago
- Drop-in environment replacements that make your RL algorithm train faster.☆21Jun 19, 2024Updated last year
- A logging utility to provide a standard interface whether you're targetting web desktop, fullstack, and more.☆22Dec 7, 2024Updated last year
- Shared personal notes created while working with the Apple MLX machine learning framework☆24Dec 12, 2025Updated 2 months ago
- Enlightener, the cutting-edge Retrieval-Augmented Generation (RAG) system that revolutionizes query responses. By combining the power of …☆14Jul 28, 2025Updated 7 months ago