shareAI-lab / alignment-handbook-cnLinks
中文版hf-alignment-handbook,大模型全套sft、dpo、orpo、cpt训练教程.
☆14Updated last year
Alternatives and similar repositories for alignment-handbook-cn
Users that are interested in alignment-handbook-cn are comparing it to the libraries listed below
Sorting:
- 最简易的R1结果在小模型上的复现,阐述类O1与DeepSeek R1最重要的本质。Think is all your need。利用实验佐证,对于强推理能力,think思考过程性内容是AGI/ASI的核心。☆44Updated 10 months ago
- MCP DeepResearch Server: 基于 LangGraph + Ollama + Tavily 的深度研究服务器,支持异步运行、超时控制与进度推送☆32Updated 5 months ago
- 大模型智能体Agent中文教程,博客代码仓库☆53Updated last month
- Implemented a script that automatically adjusts Qwen3's inference and non-inference capabilities, based on an OpenAI-like API. The infere…☆22Updated 7 months ago
- 🤖 基于AutoGen的AI辩论系统 | 🗣️ 支持中文交互 | 🔄 多智能体协作 | 📝 自动记录辩论过程 🤖 AI Debate System based on AutoGen | 🗣️ Chinese Interaction | 🔄 Multi-Age…☆19Updated last year
- Built on the robust XTuner backend framework, XTuner Chat GUI offers a user-friendly platform for quick and efficient local model inferen…☆13Updated last year
- MetaSearch:llm深度研究(deepsearch)功能方案实现☆34Updated 3 months ago
- ☆13Updated 8 months ago
- 一起来养一只拥有专属记忆的AI猫猫吧!☆10Updated last year
- 基于Llama3,通过进一步CPT,SFT,ORPO得到的中文版Llama3☆17Updated last year
- In this fast-paced world, we all need a little something to spice up life. Whether you need a glass of sweet talk to lift your spirits or…☆60Updated 6 months ago
- 基于大模型生成内容的智能语音对讲☆10Updated last year
- ☆15Updated last year
- Recursive Abstractive Processing for Tree-Organized Retrieval☆10Updated last year
- ☆17Updated 4 months ago
- [2025-上海人工智能实验室书生实训营十佳、优秀项目]☆39Updated 2 months ago
- An open-source chat text to control actions agentic workflow framework/showcase powered by Agently AI application development framework.☆29Updated last year
- 基于Langchain的学术论文RAG知识库系统☆17Updated last year
- mcp的webui界面,支持客户端连接多个sse服务端,支持 openai、deepseek、qwen等大模型,另外附上构建的 agent的 stdio和sse的简单 天气查询的完整示例☆37Updated 6 months ago
- A simple and well-tailored LLM application framework that enables you to seamlessly integrate LLM capabilities in the most "Code-Centric"…☆69Updated this week
- ☆95Updated last year
- ☆10Updated this week
- A travel agent based on Qwen2.5, fine-tuned by SFT + DPO/PPO/GRPO using traveling question-answer dataset, a mindmap can be output using …☆39Updated 3 weeks ago
- 一个用于BiliBili网站实时热点&舆情分析的AI 智能体☆84Updated last year
- ✨🦋 illufly - 【幻蝶】基于记忆蒸馏、资料检索的自我进化智能体☆75Updated 3 weeks ago
- ☆16Updated last year
- ☆45Updated 7 months ago
- A minimal LLM sales agent framework for sales agent fast deployment and benchmark. Support OpenAI models, Claude, HuggingFace models, Gem…☆19Updated last year
- 一个基于多模态向量模型及视觉多模态模型构建的图片搜索引擎&管理系统,实现精准的以文搜文,文搜图、以图搜图多种智能检索方式。An image search engine management system built upon multimodal vector models…☆75Updated 2 months ago
- Tracking the hot Github repos and update daily 每天自动追踪Github热门项目☆49Updated this week