shareAI-lab / alignment-handbook-cnLinks
中文版hf-alignment-handbook,大模型全套sft、dpo、orpo、cpt训练教程.
☆14Updated last year
Alternatives and similar repositories for alignment-handbook-cn
Users that are interested in alignment-handbook-cn are comparing it to the libraries listed below
Sorting:
- 最简易的R1结果在小模型上的复现,阐述类O1与DeepSeek R1最重要的本质。Think is all your need。利用实验佐 证,对于强推理能力,think思考过程性内容是AGI/ASI的核心。☆45Updated 11 months ago
- Built on the robust XTuner backend framework, XTuner Chat GUI offers a user-friendly platform for quick and efficient local model inferen…☆13Updated last year
- 大模型智能体Agent中文教程,博客代码仓库☆54Updated 2 months ago
- 一起来养一只拥有专属记忆的AI猫猫吧!☆10Updated last year
- 🤖 基于AutoGen的AI辩论系统 | 🗣️ 支持中文交互 | 🔄 多智能体协作 | 📝 自动记录辩论过程 🤖 AI Debate System based on AutoGen | 🗣️ Chinese Interaction | 🔄 Multi-Age…☆19Updated last year
- An open-source chat text to control actions agentic workflow framework/showcase powered by Agently AI application development framework.☆29Updated last year
- 基于Llama3,通过进一步CPT,SFT,ORPO得到的中文版Llama3☆17Updated last year
- In this fast-paced world, we all need a little something to spice up life. Whether you need a glass of sweet talk to lift your spirits or…☆60Updated 7 months ago
- Xtuner Factory☆35Updated last year
- ☆13Updated 9 months ago
- Implemented a script that automatically adjusts Qwen3's inference and non-inference capabilities, based on an OpenAI-like API. The infere…☆22Updated 8 months ago
- 🔥Your Daily Dose of AI Research from Hugging Face 🔥 Stay updated with the latest AI breakthroughs! This bot automatically collects and…☆56Updated this week
- MCP DeepResearch Server: 基于 LangGraph + Ollama + Tavily 的深度研究服务器,支持异步运行、超时控制与进度推送☆32Updated 6 months ago
- vllm混合推理扩展插件,支持多NUMA混合推理,单卡推理Qwen3-Next模型可达1000+ prefill☆31Updated 2 months ago
- SwanLab Official Documentation | SwanLab官方文档☆22Updated this week
- Recursive Abstractive Processing for Tree-Organized Retrieval☆10Updated last year
- ✨🦋 illufly - 【幻蝶】基于记忆蒸馏、资料检索的自我进化智能体☆75Updated last month
- Examples for QinYan GLMs☆13Updated last year
- 目前各大高校领域将各种信息分布在不同的部门信息门户下,存在典型的信息孤岛问题,各个部门信息没有形成互通。当前,老师和学生存在很多有关本校相关文件、政策和活动等众多方面智能问答的统一入口的需求,例如财务处、人事处、学工处、教务处、图书馆等存在各种政策和文件规定,目前在校师生都…☆34Updated last year
- AgileGen: Empowering Agile-Based Generative Software Development through Human-AI Teamwork (accepted by ACM TOSEM)☆23Updated last year
- ☆15Updated last year
- [2025-上海人工智能实验室书生实训营十佳、优秀项目]☆40Updated 3 months ago
- Train deepseek r1-like reasoning LLM with ease | 轻松训练1个deepseek r1类的推理LLM☆17Updated 10 months ago
- ☆17Updated 5 months ago
- 训练自己的中文 Embedding 模型☆27Updated last year
- Finetune Llama 3, Mistral & Gemma LLMs 2-5x faster with 80% less memory☆29Updated last year
- MetaSearch:llm深度研究(deepsearch)功能方案实现☆35Updated 4 months ago
- 天池算法比赛《BetterMixture - 大模型数据混合挑战赛》的第一名top1解决方案☆33Updated last year
- 本项目借助飞桨平台,构建起一套创新的多模型协同系统,实现 PDF 文件到 Markdown 文件的高效、精准转换。☆27Updated 9 months ago
- ☆96Updated last year