☆13Mar 16, 2025Updated 11 months ago
Alternatives and similar repositories for GRPO-R1
Users that are interested in GRPO-R1 are comparing it to the libraries listed below
Sorting:
- MCP DeepResearch Server: 基于 LangGraph + Ollama + Tavily 的深度研究服务器,支持异步运行、超时控制与进度推送☆31Jun 16, 2025Updated 8 months ago
- 🤗 HF Downloader (Hugging Face Downloader) 📦 A user-friendly GUI tool for downloading Hugging Face resources with enhanced connectivity…☆13Jan 5, 2025Updated last year
- AI Demo 项目,一个专门为希望学习和探索人工智能(AI)技术的开发者准备的实战案例集合。☆25Jan 3, 2026Updated last month
- llms related stuff , including code, docs☆13Feb 25, 2025Updated last year
- 天池算法比赛《BetterMixture - 大模型数据混合挑战赛》的第一名top1解决方案☆34Jul 7, 2024Updated last year
- KDD2024-WhoIsWho-Top3☆16Jun 17, 2024Updated last year
- Submission Guide + Discussion Board for AI Singapore Global Challenge for Safe and Secure LLMs (Track 1A).☆16Jul 4, 2024Updated last year
- 为 AstrBot 提供一种 Deepresearch 方案☆25Aug 5, 2025Updated 6 months ago
- ☆11Updated this week
- Code for Robust Fine-tuning (RbFT)☆17Jan 31, 2025Updated last year
- 大模型API企业网关,公司内部API管理,分发聚和系统,支持将多种大模型转换成统一的OpenAI兼容接口,尤其对国内开源模型deepseek,qwen,kimi,glm 提供特别支持 可供个人或者企业内部大模型API统一管理和渠道分发使用(key管理与二次分发),长期更新,支…☆36Sep 12, 2025Updated 5 months ago
- [Findings of EMNLP22] From Mimicking to Integrating: Knowledge Integration for Pre-Trained Language Models☆19Mar 16, 2023Updated 2 years ago
- QGEval: A Benchmark for Question Generation Evaluation☆19Nov 7, 2024Updated last year
- This repo offers advanced tutorials for LLMs, BERT-based models, and multimodal models, covering fine-tuning, quantization, vocabulary ex…☆24May 5, 2025Updated 9 months ago
- Dify DSL collection收集Dify工作流文件DSL,这里很多文件并不是本人原创,而是收集而来,感谢原作者。目前我是初学github,后面会加入大量原创内容☆25Jul 13, 2025Updated 7 months ago
- ☆19Jun 25, 2024Updated last year
- 服务端:Git开源项目Dify,前端:Dify-web。是我自己结合AI开发的一个web应 用,目前支持:流式输出、文字转语音、markdown、上下文持续对话。其它功能接口待补充,欢迎一起学习交流完善!☆42Jul 18, 2025Updated 7 months ago
- It is a simple demo of chatDB workflow in dify.☆24Dec 7, 2024Updated last year
- 使用bert进行中文方面级情感识别。☆25Jun 26, 2023Updated 2 years ago
- BLOOM 模型的指令微调☆24Jun 15, 2023Updated 2 years ago
- 🌈 NERpy: Implementation of Named Entity Recognition using Python. 命名实体识别工具,支持BertSoftmax、BertSpan等模型,开箱即用。☆117Feb 19, 2024Updated 2 years ago
- Difyで作る生成AIアプリ完全入門☆17May 25, 2025Updated 9 months ago
- ☆22Feb 14, 2026Updated 2 weeks ago
- A simple WeChat Official Account layout tool based on Dify☆17Jun 27, 2025Updated 8 months ago
- 基于 Dify 构建的高级搜索工具☆31Aug 22, 2024Updated last year
- 大语言模型应用:RAG、NL2SQL、聊天机器人、预训练、MOE混合专家模型、微调训练、强化学习、天池数据竞赛☆75Feb 10, 2025Updated last year
- TianGong-AI-Unstructure☆71Feb 4, 2026Updated 3 weeks ago
- 基于python的BM25文本匹配算法实现☆34Apr 17, 2022Updated 3 years ago
- 基于Qwen2+SFT+DPO的医疗问答系统,项目中使用了自定义的 SFTTrainer/DPOTrainer/TRPOTrainer用于训练,其次,项目还调用各种知识库工具(neo4j, milvus, LDA, 等)进行自动化训练数据生成。另外,使用 vllm 用于推理…☆61Jan 4, 2026Updated last month
- Workflow automation, but you just describe what you want and it happens.☆27Nov 22, 2025Updated 3 months ago
- ☆28Dec 4, 2025Updated 2 months ago
- Write the database metadata into the dify knowledge☆12Dec 30, 2025Updated 2 months ago
- ☆11Aug 29, 2025Updated 6 months ago
- 一个基于FastAPI和React的智能体系统,支持多智能体管理、mcp管理、知识库、聊天对话等功能。An intelligent agent system based on FastAPI and React, supporting multi-agent managem…☆21Jan 25, 2026Updated last month
- A full-stack AI-powered business intelligence tool for non-experts, featuring serverless backend processing and a secure Streamlit fronte…☆27Feb 13, 2026Updated 2 weeks ago
- Y-Agent Studio 是一个面向 企业级应用 的Agent开发套,Y-Agent是其中的核心模块。 包含了:支持智能体编排、RAG、流程日志、单元测试、流程测试、语料生产等垂直领域非常需要的功能。 智能体编排可以在同一个流程中,同时支持多智能体协作和流程混合编排…☆25Oct 4, 2025Updated 4 months ago
- ☆31Feb 10, 2026Updated 2 weeks ago
- 智能体设计与工作流编排赋能的数据库系统学习平台. Database system learning platform empowered by agent design and workflow orchestration☆51Aug 17, 2025Updated 6 months ago
- (撰写ing..)本仓库偏教程性质,以「模型中文化」为一个典型的模型训练问题切入场景,指导读者上手学习LLM二次微调训练。☆37Aug 5, 2024Updated last year