A fast, local, and secure approach for training LLMs for coding tasks using GRPO with WebAssembly and interpreter feedback.
☆42Apr 4, 2025Updated last year
Alternatives and similar repositories for grpo_code
Users that are interested in grpo_code are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Train your own SOTA deductive reasoning model☆110Mar 6, 2025Updated last year
- ☆35Nov 11, 2025Updated 6 months ago
- Training tiny models to prove hard theorems☆77Mar 5, 2026Updated 2 months ago
- Simple and efficient DeepSeek V3 SFT using pipeline parallel and expert parallel, with both FP8 and BF16 trainings☆117Jul 27, 2025Updated 9 months ago
- Evaluate Transformers from the Hub 🔥☆14Apr 3, 2026Updated last month
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- eShopLite - Semantic Search is a reference .NET application implementing an eCommerce site with Search features using Keyword Search and …☆13Apr 24, 2025Updated last year
- Generate debug symbols on the fly for your RE needs☆17Apr 30, 2026Updated 3 weeks ago
- ☆27Nov 13, 2025Updated 6 months ago
- React CodeGen using GPT☆12Feb 11, 2024Updated 2 years ago
- Extract Chinese/English QA Data from WikiHow pages.☆16May 21, 2023Updated 3 years ago
- Historical Language Model for London - A specialized LLM trained on 1500-1850 historical English text☆30Nov 1, 2025Updated 6 months ago
- OpenHFT is a python application that enables retail traders to deploy quantitative trading strategies for indian markets☆13Aug 15, 2025Updated 9 months ago
- AzureAIOBalancer is a Terraform repository for automating the deployment of a load-balanced Azure OpenAI environment across multiple regi…☆10Nov 3, 2023Updated 2 years ago
- line-drawer recreates a given image by only drawing it by simple straight lines. Implementation inspired by linify.me and written in Pyth…☆14Aug 31, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- AZ AI DevContainer: Prebuilt AI Developer DevContainer/Codespace Environment including Python, Jupyter, Infra as Code deployment, AI Foun…☆18Apr 14, 2026Updated last month
- ☆71Mar 25, 2026Updated 2 months ago
- ☆15Nov 11, 2023Updated 2 years ago
- Official Code Release for "Training a Generally Curious Agent"☆47May 18, 2025Updated last year
- A complete waste of time☆15Dec 11, 2022Updated 3 years ago
- The official repo for "OpenMoE 2: Sparse Diffusion Language Models".☆56Dec 28, 2025Updated 4 months ago
- ☆10Sep 25, 2024Updated last year
- A team of AI agents that answer document related questions (RAG alternative)☆14Apr 16, 2025Updated last year
- Version tracking for all public Fabric json schemas☆14Jan 27, 2026Updated 3 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Generate discrete random variates from a set of dynamically weighted elements in Solidity using a forest of trees data structure, based o…☆12Apr 3, 2023Updated 3 years ago
- Source code and instructions for LAB 910 - Declarative Agents: Build Agents for Microsoft 365 Copilot☆15Mar 26, 2025Updated last year
- ☆12Feb 23, 2025Updated last year
- ☆16Nov 13, 2024Updated last year
- Get insights from your research papers with LlamaExtract☆30Aug 8, 2025Updated 9 months ago
- ☆34Jan 17, 2025Updated last year
- Power BI AI CV Analysis for Recruitment: Automating Candidate Matching with OpenAI☆18Nov 17, 2024Updated last year
- Demo tutorial on how to program in Python an autonomous bot that plays the GeoGuessr game, using different Vision LLMs with LangChain☆14Oct 22, 2024Updated last year
- Waffer-thin FlaskGPT on Vercel.☆12Jun 1, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆22Oct 22, 2023Updated 2 years ago
- A Streamlit app to add structured tags to a dataset card☆22Jun 30, 2022Updated 3 years ago
- NeuroBLAST v3 architecture code☆37Jan 6, 2026Updated 4 months ago
- Streamline on-policy/off-policy distillation workflows in a few lines of code☆103Feb 26, 2026Updated 3 months ago
- Source code for GreaTer ICLR 2025 - Gradient Over Reasoning makes Smaller Language Models Strong Prompt Optimizers☆36Apr 18, 2025Updated last year
- An MCP server for Microsoft Azure pricing that goes beyond the Azure Pricing Calculator, with programmatic cost estimates plus FinOps fea…☆53May 17, 2026Updated last week
- ☆41Apr 30, 2025Updated last year