A fast, local, and secure approach for training LLMs for coding tasks using GRPO with WebAssembly and interpreter feedback.
☆42Apr 4, 2025Updated last year
Alternatives and similar repositories for grpo_code
Users that are interested in grpo_code are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Train your own SOTA deductive reasoning model☆112Mar 6, 2025Updated last year
- Kaggle AIMO2 solution with token-efficient reasoning LLM recipes☆50Aug 7, 2025Updated 10 months ago
- ☆35Nov 11, 2025Updated 7 months ago
- Training tiny models to prove hard theorems☆77Mar 5, 2026Updated 3 months ago
- Pytorch code for NeurIPS 2025 paper "Accurate and Efficient Low-Rank Model Merging in Core Space"☆40Feb 2, 2026Updated 4 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆21Jun 8, 2025Updated last year
- eShopLite - Semantic Search is a reference .NET application implementing an eCommerce site with Search features using Keyword Search and …☆13Apr 24, 2025Updated last year
- Retail Search with AI☆14Feb 14, 2026Updated 4 months ago
- Official implementation of Bifrost-1: Bridging Multimodal LLMs and Diffusion Models with Patch-level CLIP Latents (NeurIPS 2025)☆47Nov 24, 2025Updated 6 months ago
- ☆28Nov 13, 2025Updated 7 months ago
- React CodeGen using GPT☆12Feb 11, 2024Updated 2 years ago
- Code for the SofT-GRPO algorithm on the LLM soft-thinking reasoning pattern.☆52Jan 2, 2026Updated 5 months ago
- Historical Language Model for London - A specialized LLM trained on 1500-1850 historical English text☆32Nov 1, 2025Updated 7 months ago
- OpenHFT is a python application that enables retail traders to deploy quantitative trading strategies for indian markets☆13Aug 15, 2025Updated 10 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A starter kit for evaluating benchmarks on the 🤗 Hub☆16Apr 8, 2026Updated 2 months ago
- Extracting-Data-from-PDFs-with-Local-LLM☆16Nov 1, 2024Updated last year
- AZ AI DevContainer: Prebuilt AI Developer DevContainer/Codespace Environment including Python, Jupyter, Infra as Code deployment, AI Foun…☆18Apr 14, 2026Updated 2 months ago
- ☆21Sep 6, 2021Updated 4 years ago
- Official Code Release for "Training a Generally Curious Agent"☆47May 18, 2025Updated last year
- The official repo for "OpenMoE 2: Sparse Diffusion Language Models".☆58Dec 28, 2025Updated 5 months ago
- A team of AI agents that answer document related questions (RAG alternative)☆14Apr 16, 2025Updated last year
- Fuel innovation and advance language models with HomoScriptor: A vibrant, community-driven dataset for fine-tuning large language models.☆18Oct 14, 2023Updated 2 years ago
- ☆26Mar 30, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Frontier Open-Source Text-to-Speech☆40Dec 16, 2025Updated 5 months ago
- LogicApps workflows for historic and ongoing document export from SharePoint Online to Azure Storage, for ingesting into AI Search.☆11Jan 8, 2025Updated last year
- Get insights from your research papers with LlamaExtract☆29Aug 8, 2025Updated 10 months ago
- ☆16Nov 13, 2024Updated last year
- Contextrie curates what each agent sees so tasks stay sharp from step one to step one thousand.☆58Apr 3, 2026Updated 2 months ago
- Instruct-tune LLaMA on consumer hardware and qa inference with llama_index☆24Apr 2, 2023Updated 3 years ago
- ☆34Jan 17, 2025Updated last year
- Power BI AI CV Analysis for Recruitment: Automating Candidate Matching with OpenAI☆18Nov 17, 2024Updated last year
- Azure AI Visual Search toolkit☆15Oct 25, 2022Updated 3 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Demo tutorial on how to program in Python an autonomous bot that plays the GeoGuessr game, using different Vision LLMs with LangChain☆14Oct 22, 2024Updated last year
- Reproduction of paper: AutoAugment: Learning Augmentation Strategies from Data☆16Jun 7, 2020Updated 6 years ago
- Waffer-thin FlaskGPT on Vercel.☆12Jun 1, 2023Updated 3 years ago
- A quick glimpse in the Swedish government's remisser☆15Apr 25, 2024Updated 2 years ago
- Sample application demonstrates how to use of Vanilla AI Agents framework to build a basic call center in the context of a generic TelCo …☆20Mar 26, 2026Updated 2 months ago
- NeuroBLAST v3 architecture code☆37Jan 6, 2026Updated 5 months ago
- Streamline on-policy/off-policy distillation workflows in a few lines of code☆105Feb 26, 2026Updated 3 months ago