A fast, local, and secure approach for training LLMs for coding tasks using GRPO with WebAssembly and interpreter feedback.
☆41Apr 4, 2025Updated 11 months ago
Alternatives and similar repositories for grpo_code
Users that are interested in grpo_code are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Train your own SOTA deductive reasoning model☆108Mar 6, 2025Updated last year
- A locally trained model of Stoney Nakoda has been developed and released. You can access the working model here or train your own instanc…☆10Oct 29, 2025Updated 4 months ago
- 训练自己的中文 Embedding 模型☆29Jan 6, 2025Updated last year
- Training tiny models to prove hard theorems☆64Mar 5, 2026Updated 2 weeks ago
- Evaluate Transformers from the Hub 🔥☆14Nov 27, 2023Updated 2 years ago
- ☆41Apr 30, 2025Updated 10 months ago
- eShopLite - Semantic Search is a reference .NET application implementing an eCommerce site with Search features using Keyword Search and …☆13Apr 24, 2025Updated 11 months ago
- Retail Search with AI☆14Feb 14, 2026Updated last month
- Official implementation of Bifrost-1: Bridging Multimodal LLMs and Diffusion Models with Patch-level CLIP Latents (NeurIPS 2025)☆45Nov 24, 2025Updated 4 months ago
- React CodeGen using GPT☆12Feb 11, 2024Updated 2 years ago
- AzureAIOBalancer is a Terraform repository for automating the deployment of a load-balanced Azure OpenAI environment across multiple regi…☆10Nov 3, 2023Updated 2 years ago
- Copy the MLP of llama3 8 times as 8 experts , created a router with random initialization,add load balancing loss to construct an 8x8b Mo…☆27Jul 1, 2024Updated last year
- AZ AI DevContainer: Prebuilt AI Developer DevContainer/Codespace Environment including Python, Jupyter, Infra as Code deployment, AI Foun…☆14Mar 14, 2026Updated last week
- The official repo for "OpenMoE 2: Sparse Diffusion Language Models".☆53Dec 28, 2025Updated 2 months ago
- ☆21Sep 6, 2021Updated 4 years ago
- Display PDFs in your RAG app☆21Feb 24, 2025Updated last year
- Official Code Release for "Training a Generally Curious Agent"☆45May 18, 2025Updated 10 months ago
- ☆10Sep 25, 2024Updated last year
- Modular task agnostic training pipeline using LFM2 from Liquid AI with unsloth.☆16Sep 13, 2025Updated 6 months ago
- Source code and instructions for LAB 910 - Declarative Agents: Build Agents for Microsoft 365 Copilot☆14Mar 26, 2025Updated 11 months ago
- LogicApps workflows for historic and ongoing document export from SharePoint Online to Azure Storage, for ingesting into AI Search.☆11Jan 8, 2025Updated last year
- ☆16Nov 13, 2024Updated last year
- looping☆20Jun 10, 2025Updated 9 months ago
- ☆11Updated this week
- ☆32Jan 17, 2025Updated last year
- Azure AI Visual Search toolkit☆15Oct 25, 2022Updated 3 years ago
- NeuroBLAST v3 architecture code☆36Jan 6, 2026Updated 2 months ago
- ☆42Jul 5, 2025Updated 8 months ago
- Waffer-thin FlaskGPT on Vercel.☆12Jun 1, 2023Updated 2 years ago
- This is a visual editor for langgraph workflow. It helps to quickly design and debug the workflow from scratch.☆44Jun 11, 2024Updated last year
- A Streamlit app to add structured tags to a dataset card☆22Jun 30, 2022Updated 3 years ago
- A quick glimpse in the Swedish government's remisser☆15Apr 25, 2024Updated last year
- Source code for GreaTer ICLR 2025 - Gradient Over Reasoning makes Smaller Language Models Strong Prompt Optimizers☆36Apr 18, 2025Updated 11 months ago
- Streamline on-policy/off-policy distillation workflows in a few lines of code☆97Feb 26, 2026Updated 3 weeks ago
- Evaluating Reward Models in Multilingual Settings (ACL Main '25)☆41May 16, 2025Updated 10 months ago
- ☆45Nov 14, 2025Updated 4 months ago
- j1-micro (1.7B) & j1-nano (600M) are absurdly tiny but mighty reward models.☆102Jul 19, 2025Updated 8 months ago
- ☆39Aug 1, 2025Updated 7 months ago
- use raspberry pi to get real-time mentions(weibo), the mentions will be as the commands to control arduino.☆43May 21, 2013Updated 12 years ago