Oxen-AI / GRPO-With-Cargo-FeedbackView external linksLinks
This repository has code for fine-tuning LLMs with GRPO specifically for Rust Programming using cargo as feedback
☆114Mar 8, 2025Updated 11 months ago
Alternatives and similar repositories for GRPO-With-Cargo-Feedback
Users that are interested in GRPO-With-Cargo-Feedback are comparing it to the libraries listed below
Sorting:
- Run computational experiments using marimo notebooks☆21Jun 11, 2025Updated 8 months ago
- This AI Agent retrieves the latest news articles based on a multi keyword using the Serp API. It processes the results and returns struct…☆11Jan 31, 2025Updated last year
- A short demo to introduce the polars dataframe library through a marimo notebook.☆24Jan 29, 2025Updated last year
- virtual machine for rapid embedded development☆31Jan 16, 2025Updated last year
- Firestore-inspired real-time sql queries subset for Tauri (Typescript frontend, Rust backend)☆11Nov 25, 2024Updated last year
- Implement reinforcement learning(RL) based on parameterized quantum circuits with quantum computing cloud Quafu.☆11Oct 19, 2023Updated 2 years ago
- A CodeMirror extension for inline completions, next-edit prediction, and prompt history☆43Updated this week
- Make data-driven table rendering easy with Dioxus☆11May 9, 2022Updated 3 years ago
- 'Build a Full-Stack Twitter Clone with Rust' course code and notes☆13Aug 6, 2023Updated 2 years ago
- Simple template for Freya.☆17May 27, 2025Updated 8 months ago
- Karpathy's llama2.c transpiled to MLX for Apple Silicon☆14Dec 28, 2023Updated 2 years ago
- ☆15Jun 4, 2025Updated 8 months ago
- KernelBench v2: Can LLMs Write GPU Kernels? - Benchmark with Torch -> Triton (and more!) problems☆21Jul 4, 2025Updated 7 months ago
- Example code to create a persistent SQLite extension in Rust☆18May 16, 2022Updated 3 years ago
- A proc macro for creating compile-time checked CSS class sets, in the style of classNames☆17Jan 7, 2023Updated 3 years ago
- A Rust 🦀 port of the Hugging Face smolagents library.☆42Mar 26, 2025Updated 10 months ago
- ☆13Nov 28, 2025Updated 2 months ago
- ☆18Apr 18, 2025Updated 9 months ago
- 😎 A curated list of the best resources in the HASH ecosystem☆28Sep 14, 2023Updated 2 years ago
- DuoGuard: A Two-Player RL-Driven Framework for Multilingual LLM Guardrails☆30Feb 26, 2025Updated 11 months ago
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆18Jul 24, 2025Updated 6 months ago
- MarketGPT: Developing a Pre-trained transformer (GPT) for Modeling Financial Time Series☆17Sep 5, 2025Updated 5 months ago
- Simple repository for training small reasoning models☆49Feb 6, 2025Updated last year
- Collection of useful conversions and widgets implemented as a portable Rust app☆22Updated this week
- Minimalist ML framework for Rust☆20Feb 8, 2026Updated last week
- Animation framework for Dioxus☆21Jan 30, 2025Updated last year
- Simple Implementation of a Transformer in the new framework MLX by Apple☆19Nov 18, 2024Updated last year
- Systematic evaluation framework that automatically rates overthinking behavior in large language models.☆96May 16, 2025Updated 9 months ago
- ☆137Mar 20, 2025Updated 10 months ago
- One click away from a locally downloaded, fine-tuned model, hosted on hugging face, with inference built in. In two hours.☆24Nov 9, 2025Updated 3 months ago
- This demo showcases different approaches to handling the delay during RAG (Retrieval-Augmented Generation) lookups in a voice-enabled AI …☆20Jan 23, 2025Updated last year
- ☆14Jan 4, 2026Updated last month
- Enlightener, the cutting-edge Retrieval-Augmented Generation (RAG) system that revolutionizes query responses. By combining the power of …☆14Jul 28, 2025Updated 6 months ago
- A Voice Activity Detector rust library using the Silero VAD model.☆62Aug 4, 2025Updated 6 months ago
- ☆86Feb 1, 2024Updated 2 years ago
- Sphynx Hallucination Induction☆53Jan 31, 2025Updated last year
- Customize, control, and enhance LLM generation with logits processors, featuring visualization capabilities to inspect and understand sta…☆44Jan 8, 2026Updated last month
- This repository is an implementation of inferring the PaliGemma Vision Language Model on Android using Hugging Face-Gradio Client API for…☆20Oct 10, 2024Updated last year
- Sky LiveKit Agent Perplexica is a local, free solution integrating LiveKit with advanced internet search. It uses a local Perplexica inst…☆28Feb 6, 2025Updated last year