Oxen-AI / GRPO-With-Cargo-FeedbackLinks

This repository has code for fine-tuning LLMs with GRPO specifically for Rust Programming using cargo as feedback

☆100

Alternatives and similar repositories for GRPO-With-Cargo-Feedback

Users that are interested in GRPO-With-Cargo-Feedback are comparing it to the libraries listed below

Sorting:

beowolx / rensa
High-performance MinHash implementation in Rust with Python bindings for efficient similarity estimation and deduplication of large datas…
☆192Updated 2 weeks ago
atoma-network / atoma-infer
Fast serverless LLM inference, in Rust.
☆88Updated 5 months ago
OpenPipe / deductive-reasoning
Train your own SOTA deductive reasoning model
☆103Updated 4 months ago
chenwanqq / candle-llava
implement llava using candle
☆15Updated last year
LaurentMazare / mamba.rs
☆130Updated last year
Vaibhavs10 / fast-llm.rs
☆138Updated last year
brendanhogan / picoDeepResearch
☆64Updated 2 months ago
Dan-wanna-M / kbnf
A high-performance constrained decoding engine based on context free grammar in Rust
☆54Updated 2 months ago
guidance-ai / llgtrt
TensorRT-LLM server with Structured Outputs (JSON) built with Rust
☆57Updated 3 months ago
dottxt-ai / outlines-core
Faster structured generation
☆238Updated 2 months ago
leo-du / llama2.rs
Inference Llama 2 in one file of zero-dependency, zero-unsafe Rust
☆38Updated 2 years ago
ShelbyJenkins / llm_client
The Easiest Rust Interface for Local LLMs and an Interface for Deterministic Signals from Probabilistic LLM Vibes
☆219Updated 2 weeks ago
oxideai / mlx-rs
Unofficial Rust bindings to Apple's mlx framework
☆173Updated this week
huggingface / kernel-builder
👷 Build compute kernels
☆87Updated this week
jimexist / surya-rs
Rust implementation of Surya
☆58Updated 5 months ago
s-smits / grpo-optuna
Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna
☆55Updated 6 months ago
fbilhaut / gline-rs
Inference engine for GLiNER models, in Rust
☆64Updated last month
Noveum / ai-gateway
Built for demanding AI workflows, this gateway offers low-latency, provider-agnostic access, ensuring your AI applications run smoothly a…
☆73Updated 2 months ago
graniet / kheish
Kheish: A multi-role LLM agent for tasks like code auditing, file searching, and more seamlessly leveraging RAG and extensible modules.
☆141Updated 7 months ago
haizelabs / j1-micro
j1-micro (1.7B) & j1-nano (600M) are absurdly tiny but mighty reward models.
☆94Updated 2 weeks ago
argilla-io / argilla-cookbook
Simple examples using Argilla tools to build AI
☆53Updated 8 months ago
PrimeIntellect-ai / genesys
☆130Updated 4 months ago
krypticmouse / DSRs
A DSPy rewrite to(not port) Rust
☆48Updated this week
pixelspark / poly
A single-binary, GPU-accelerated LLM server (HTTP and WebSocket API) written in Rust
☆80Updated last year
xjdr-alt / llmri
look how they massacred my boy
☆63Updated 9 months ago
Narsil / hf-chat
☆26Updated 7 months ago
N8python / mlx-pretrain
A simple MLX implementation for pretraining LLMs on Apple Silicon.
☆83Updated 3 months ago
jlscheerer / xtr-warp
XTR/WARP (SIGIR'25) is an extremely fast and accurate retrieval engine based on Stanford's ColBERTv2/PLAID and Google DeepMind's XTR.
☆152Updated 3 months ago
N8python / n8loom
A tree-based prefix cache library that allows rapid creation of looms: hierarchal branching pathways of LLM generations.
☆72Updated 5 months ago
gnp / minbpe-rs
Port of Andrej Karpathy's minbpe to Rust
☆25Updated last year