anakin87 / qwen-scheduler-grpoView external linksLinks
Train a Language Model with GRPO to create a schedule from a list of events and priorities
☆263Apr 29, 2025Updated 9 months ago
Alternatives and similar repositories for qwen-scheduler-grpo
Users that are interested in qwen-scheduler-grpo are comparing it to the libraries listed below
Sorting:
- Countdown Game Distill&RL☆47Sep 5, 2025Updated 5 months ago
- An end-to-end voice assistant running entirely on Apple Silicon.☆62Nov 29, 2025Updated 2 months ago
- Simple repository for training small reasoning models☆49Feb 6, 2025Updated last year
- MCP Server for checking domain name availability using WHOIS and DNS via stdio.☆22Oct 24, 2025Updated 3 months ago
- This plugin allows the Cheshire Cat to use tools written in R language☆10Dec 23, 2024Updated last year
- The official repository of MM-R5☆28Jun 22, 2025Updated 7 months ago
- ReAct AI Agent from Scratch using DeepSeek: Handling Memory & Tools without Frameworks☆35Feb 18, 2025Updated 11 months ago
- A code for calculating MBTR molecule/crystal structure representation. (https://doi.org/10.1088/2632-2153/aca005)☆13Nov 15, 2022Updated 3 years ago
- 一个基于MCP协议的开发文档服务器,专为各类开发框架文档设计☆49Mar 31, 2025Updated 10 months ago
- AI recipe & grocery list generator☆14Jan 25, 2025Updated last year
- TaCo: Enhancing Cross-Lingual Transfer for Low-Resource Languages in LLMs through Translation-Assisted Chain-of-Thought Processes☆13Jul 1, 2025Updated 7 months ago
- win32 native frontend for llama-cli☆12Nov 2, 2024Updated last year
- Control drones with natural language☆169Jan 23, 2026Updated 3 weeks ago
- ☆94Jul 7, 2025Updated 7 months ago
- Local DeepSearch (Advantage: Low Threshold): an implementation of Agentic RAG based on DeepSeek-R1 API and Tavily API☆17Jun 21, 2025Updated 7 months ago
- Generate Web Pages and Components with text prompts, with Local Models. (or Cloud Models, if you want)☆402Jan 26, 2026Updated 3 weeks ago
- LettuceDetect is a hallucination detection framework for RAG applications.☆531Sep 9, 2025Updated 5 months ago
- Auto Thinking Mode switch for Qwen3 in Open webui☆70May 8, 2025Updated 9 months ago
- A CI/CD tool that automatically captures code changes, generates mobile-optimized HTML diffs, uploads them to cloud storage, and sends no…☆27Sep 9, 2025Updated 5 months ago
- Fine-tuning embedding models.☆14Nov 25, 2024Updated last year
- (WIP) Open Knowledge Layer(OpenKL): The Advance Knowledge and Memory for Personal Agents.☆82Sep 19, 2025Updated 4 months ago
- Free AI coding in chatbots☆1,334Updated this week
- RAG template for enterprise or organization☆35Jan 2, 2025Updated last year
- ☆45May 4, 2025Updated 9 months ago
- ☆20May 20, 2025Updated 8 months ago
- A no-brainer solution to turning your Obsidian PKM into a Zola site. Forked.☆22Jul 5, 2025Updated 7 months ago
- RWKV-based Text-to-Speech implementation in Rust☆26Oct 14, 2025Updated 4 months ago
- ☆247Jun 6, 2025Updated 8 months ago
- ☆108Jan 25, 2026Updated 3 weeks ago
- Moss: A voice assistant using LLM and Langchain which can control your home assistant and chat more.☆31Jul 12, 2025Updated 7 months ago
- ☆20Mar 25, 2025Updated 10 months ago
- The objective of this project is to demonstrate how to fine-tune deepseek-r1-distill-llama-8b.☆16Feb 19, 2025Updated 11 months ago
- Mentis: A powerful multi-agent orchestration framework built on LangGraph.☆294May 16, 2025Updated 9 months ago
- ☆28Oct 22, 2024Updated last year
- A virtual agent for your virtual books📚☆48May 18, 2025Updated 8 months ago
- ☆762Dec 23, 2025Updated last month
- Library for model distillation☆162Sep 6, 2025Updated 5 months ago
- 功能上来说就是Claude Code webUI和frp的结合体,简化配置和部署☆63Nov 7, 2025Updated 3 months ago
- “YOLOLite — lightweight YOLO in PyTorch. ONNX export + CPU inference (Raspberry Pi friendly).”☆57Feb 2, 2026Updated 2 weeks ago