A fast, local, and secure approach for training LLMs for coding tasks using GRPO with WebAssembly and interpreter feedback.
☆41Apr 4, 2025Updated 11 months ago
Alternatives and similar repositories for grpo_code
Users that are interested in grpo_code are comparing it to the libraries listed below
Sorting:
- Train your own SOTA deductive reasoning model☆107Mar 6, 2025Updated 11 months ago
- ☆41Apr 30, 2025Updated 10 months ago
- Training tiny models to prove hard theorems☆29Feb 15, 2026Updated 2 weeks ago
- ☆21Sep 6, 2021Updated 4 years ago
- Kaggle AIMO2 solution with token-efficient reasoning LLM recipes☆43Aug 7, 2025Updated 6 months ago
- Copy the MLP of llama3 8 times as 8 experts , created a router with random initialization,add load balancing loss to construct an 8x8b Mo…☆27Jul 1, 2024Updated last year
- 超简单复现Deepseek-R1-Zero和Deepseek-R1,以「24点游戏」为例。通过zero-RL、SFT以及SFT+RL,以激发LLM的自主验证反思能力。 About Clean, minimal, accessible reproduction of Dee…☆34Apr 5, 2025Updated 10 months ago
- ☆23Updated this week
- A simple WeChat Official Account layout tool based on Dify☆17Jun 27, 2025Updated 8 months ago
- Difyで作る生成AIアプリ完全入門☆17May 25, 2025Updated 9 months ago
- ☆39Aug 1, 2025Updated 7 months ago
- Agentic Learning Powered by AWorld☆90Feb 13, 2026Updated 2 weeks ago
- ☆36Feb 20, 2024Updated 2 years ago
- Simple and efficient DeepSeek V3 SFT using pipeline parallel and expert parallel, with both FP8 and BF16 trainings☆115Jul 27, 2025Updated 7 months ago
- 参考《上海交通大学生存手册》开源☆16Sep 25, 2024Updated last year
- Write the database metadata into the dify knowledge☆12Dec 30, 2025Updated 2 months ago
- A full-stack AI-powered business intelligence tool for non-experts, featuring serverless backend processing and a secure Streamlit fronte…☆28Feb 13, 2026Updated 2 weeks ago
- ☆28Dec 4, 2025Updated 2 months ago
- Repository of IPBench☆19Jan 4, 2026Updated 2 months ago
- ☆11Aug 29, 2025Updated 6 months ago
- Workflow automation, but you just describe what you want and it happens.☆27Nov 22, 2025Updated 3 months ago
- Python Telegraph api.☆15Mar 22, 2025Updated 11 months ago
- 知予人工智能:从学习者到研究者☆13Jan 20, 2025Updated last year
- Official Python SDK for SwastikAI☆11Nov 15, 2024Updated last year
- Use the knowledge graph generated by GraphRAG as the external knowledge base for the Dify workflow.☆21Jun 4, 2025Updated 8 months ago
- Extract annotated misspellings from MIMIC-III.☆13Dec 17, 2020Updated 5 years ago
- ☆34Nov 11, 2025Updated 3 months ago
- A distilled DeepSeek-R1 variant built on Qwen2.5-32B, fine-tuned with curated data for enhanced performance and efficiency. <metadata> gp…☆16Mar 11, 2025Updated 11 months ago
- 🤖AI Agents for Financial Trading💰: LLM-Driven Stock Prediction & Investment Recommendation System☆13Apr 14, 2025Updated 10 months ago
- ☆11Jul 17, 2023Updated 2 years ago
- GBM implementation on Legate☆14Jan 28, 2026Updated last month
- LangReact 是一个配置化的 Planning Agent 应用开发工具,通过配置、插件,能快速为你的 GPT 应用提供 Planning 功能。☆12Apr 23, 2024Updated last year
- eShopLite - Semantic Search is a reference .NET application implementing an eCommerce site with Search features using Keyword Search and …☆13Apr 24, 2025Updated 10 months ago
- Applescripts for controlling Spotify☆23Oct 20, 2016Updated 9 years ago
- ☆12Jun 28, 2024Updated last year
- ☆10Dec 29, 2023Updated 2 years ago
- This is a fork from Ryan Carson's AI Dev Tasks repository, with some code cleanup and refactoring to enable support for PostgreSQL databa…☆15Sep 8, 2025Updated 5 months ago
- A small framework to benchmark forecasting models via backtesting☆13Nov 25, 2023Updated 2 years ago
- A universal skills runtime framework SDK for building, deploying, and executing modular capabilities across diverse environments.☆27Updated this week