A fast, local, and secure approach for training LLMs for coding tasks using GRPO with WebAssembly and interpreter feedback.
☆42Apr 4, 2025Updated last year
Alternatives and similar repositories for grpo_code
Users that are interested in grpo_code are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Train your own SOTA deductive reasoning model☆109Mar 6, 2025Updated last year
- A locally trained model of Stoney Nakoda has been developed and released. You can access the working model here or train your own instanc…☆10Oct 29, 2025Updated 5 months ago
- Kaggle AIMO2 solution with token-efficient reasoning LLM recipes☆49Aug 7, 2025Updated 8 months ago
- Paleograph is a browser-based 3D viewer built for archaeologists who need to explore tabular measurement datasets.☆52Dec 11, 2025Updated 4 months ago
- Training tiny models to prove hard theorems☆72Mar 5, 2026Updated last month
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆34Nov 11, 2025Updated 5 months ago
- Simple and efficient DeepSeek V3 SFT using pipeline parallel and expert parallel, with both FP8 and BF16 trainings☆117Jul 27, 2025Updated 8 months ago
- ☆18Mar 21, 2025Updated last year
- ☆21Jun 8, 2025Updated 10 months ago
- eShopLite - Semantic Search is a reference .NET application implementing an eCommerce site with Search features using Keyword Search and …☆13Apr 24, 2025Updated 11 months ago
- Generate debug symbols on the fly for your RE needs☆17Jul 19, 2022Updated 3 years ago
- Retail Search with AI☆14Feb 14, 2026Updated 2 months ago
- ☆27Nov 13, 2025Updated 5 months ago
- Official implementation of Bifrost-1: Bridging Multimodal LLMs and Diffusion Models with Patch-level CLIP Latents (NeurIPS 2025)☆46Nov 24, 2025Updated 4 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- React CodeGen using GPT☆12Feb 11, 2024Updated 2 years ago
- ☆14Sep 6, 2022Updated 3 years ago
- Historical Language Model for London - A specialized LLM trained on 1500-1850 historical English text☆29Nov 1, 2025Updated 5 months ago
- AzureAIOBalancer is a Terraform repository for automating the deployment of a load-balanced Azure OpenAI environment across multiple regi…☆10Nov 3, 2023Updated 2 years ago
- A starter kit for evaluating benchmarks on the 🤗 Hub☆16Updated this week
- This is our Final Year Project in Bachelors. We try to avoid congestion on two levels i.e Intersection level and Infrastructure to Vehicl…☆10Oct 12, 2020Updated 5 years ago
- AZ AI DevContainer: Prebuilt AI Developer DevContainer/Codespace Environment including Python, Jupyter, Infra as Code deployment, AI Foun…☆15Updated this week
- ☆67Mar 25, 2026Updated 2 weeks ago
- The official repo for "OpenMoE 2: Sparse Diffusion Language Models".☆54Dec 28, 2025Updated 3 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Display PDFs in your RAG app☆21Feb 24, 2025Updated last year
- Official Code Release for "Training a Generally Curious Agent"☆45May 18, 2025Updated 10 months ago
- ☆12Jan 3, 2022Updated 4 years ago
- Create, edit and convert AI character files for CharacterAI, Pygmalion, Text Generation, KoboldAI and TavernAI☆23Dec 4, 2023Updated 2 years ago
- A team of AI agents that answer document related questions (RAG alternative)☆13Apr 16, 2025Updated 11 months ago
- Fuel innovation and advance language models with HomoScriptor: A vibrant, community-driven dataset for fine-tuning large language models.☆18Oct 14, 2023Updated 2 years ago
- Deep reinforcement learning in autonomous driving☆12Aug 25, 2021Updated 4 years ago
- ☆12Feb 23, 2025Updated last year
- LogicApps workflows for historic and ongoing document export from SharePoint Online to Azure Storage, for ingesting into AI Search.☆11Jan 8, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆16Nov 13, 2024Updated last year
- ☆32Jan 17, 2025Updated last year
- Power BI AI CV Analysis for Recruitment: Automating Candidate Matching with OpenAI☆18Nov 17, 2024Updated last year
- Azure AI Visual Search toolkit☆15Oct 25, 2022Updated 3 years ago
- Demo tutorial on how to program in Python an autonomous bot that plays the GeoGuessr game, using different Vision LLMs with LangChain☆14Oct 22, 2024Updated last year
- Waffer-thin FlaskGPT on Vercel.☆12Jun 1, 2023Updated 2 years ago
- ☆22Oct 22, 2023Updated 2 years ago