shareAI-lab / alignment-handbook-cnLinks
中文版hf-alignment-handbook,大模型全套sft、dpo、orpo、cpt训练教程.
☆13Updated last year
Alternatives and similar repositories for alignment-handbook-cn
Users that are interested in alignment-handbook-cn are comparing it to the libraries listed below
Sorting:
- Built on the robust XTuner backend framework, XTuner Chat GUI offers a user-friendly platform for quick and efficient local model inferen…☆13Updated last year
- 最简易的R1结果在小模型上的复现,阐述类O1与DeepSeek R1最重要的本质。Think is all your need。利用实验佐证,对于强推理能力,think思考过程性内容是AGI/ASI的核心。☆45Updated 8 months ago
- [2025-上海人工智能实验室书生实训营十佳、优秀项目]☆35Updated last month
- In this fast-paced world, we all need a little something to spice up life. Whether you need a glass of sweet talk to lift your spirits or…☆59Updated 5 months ago
- 基于Llama3,通过进一步CPT,SFT,ORPO得到的中文版Llama3☆17Updated last year
- 大模型智能体Agent中文教程,博客代码仓库☆47Updated 2 weeks ago
- Train deepseek r1-like reasoning LLM with ease | 轻松训练1个deepseek r1类的推理LLM☆17Updated 8 months ago
- Xtuner Factory☆34Updated last year
- ☆15Updated last year
- MCP DeepResearch Server: 基于 LangGraph + Ollama + Tavily 的深度研究服务器,支持异步运行、超时控制与进度推送☆31Updated 4 months ago
- 一个面向多模态大模型训 练的智能数据集构建与评估平台☆129Updated last month
- ☆43Updated 5 months ago
- ☆95Updated 10 months ago
- 一起来养一只拥有专属记忆的AI猫猫吧!☆10Updated last year
- Yet Another Papers With Code☆35Updated last month
- ☆29Updated last month
- ☆13Updated 7 months ago
- Implemented a script that automatically adjusts Qwen3's inference and non-inference capabilities, based on an OpenAI-like API. The infere…☆22Updated 5 months ago
- ✨🦋 illufly - 【幻蝶】基于记忆蒸馏、资料检索的自我进化智能体☆74Updated 4 months ago
- An open-source chat text to control actions agentic workflow framework/showcase powered by Agently AI application development framework.☆28Updated last year
- AgileGen: Empowering Agile-Based Generative Software Development through Human-AI Teamwork (accepted by ACM TOSEM)☆23Updated 11 months ago
- 一个简单的恰到好处LLM应用框架,能够让你以最“Code Center“的方式无缝集成LLM能力。LLM As Function, Prompt As Code☆69Updated this week
- 🔥Your Daily Dose of AI Research from Hugging Face 🔥 Stay updated with the latest AI breakthroughs! This bot automatically collects and…☆54Updated this week
- Recursive Abstractive Processing for Tree-Organized Retrieval☆10Updated last year
- Real-time video understanding and interaction through text,audio,image and video with large multi-modal model. 利用多模态大模型的实时视频理解和交互框架,通过文本…☆25Updated last year
- Tracking the hot Github repos and update daily 每 天自动追踪Github热门项目☆49Updated last week
- Examples for QinYan GLMs☆13Updated last year
- MLLM @ Game☆14Updated 5 months ago
- Zero-human, cold-start construction of long-chain agents in professional domains☆43Updated 6 months ago
- MetaSearch:llm深度研究(deepsearch)功能方案实现☆32Updated 2 months ago