从零到一实现一个 miniLLM~(动手学习LLM)
☆79Apr 30, 2024Updated 2 years ago
Alternatives and similar repositories for LLMs-101
Users that are interested in LLMs-101 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 从0到1构建一个MiniLLM (pretrain+sft+dpo实践中)☆549Mar 23, 2025Updated last year
- This is a repository used by individuals to experiment and reproduce the pre-training process of LLM.☆502May 1, 2025Updated last year
- 中文版hf-alignment-handbook,大模型全套sft、dpo、orpo、cpt训练教程.☆15Aug 25, 2024Updated last year
- 🎓 降低中文学术写作 AIGC 检测率的 Claude Code Skill | Reduce AIGC detection rate for Chinese academic writing☆79Feb 24, 2026Updated 2 months ago
- 不用搭建环境,解压即用,4G显存可用☆11Mar 1, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Chinese license plate recognition☆27Nov 13, 2021Updated 4 years ago
- ☆11Nov 18, 2024Updated last year
- ☆19Feb 25, 2024Updated 2 years ago
- 「PyTorch」A deep matching model library for recommendations & advertising. It's easy to train models and to export representation vectors …☆91Jun 12, 2022Updated 3 years ago
- 数字人+大模型☆26Nov 7, 2023Updated 2 years ago
- 本项目从零开始构建并优化了一个千万参数级别的大规模预训练语言模型,涵盖预训练、有监督微调(SFT)和R1推理蒸馏三个阶段。项目采用自定义Transformer架构(包括RMSNorm、分组注意力、多Query机制、SwiGLU激活和RoPE位置编码),实现高效的长文本处理和…☆21Mar 10, 2025Updated last year
- 用于从头预训练+SFT一个小参数量的中文LLaMa2的仓库;24G单卡即可运行得到一个具备简单中文问答能力的chat-llama2.☆2,913May 21, 2024Updated last year
- collection of ppt/pdf from security conferences☆19Feb 25, 2026Updated 2 months ago
- ☆61Mar 20, 2026Updated last month
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- 从0开始,将chatgpt的技术路线跑一遍。☆280Sep 5, 2024Updated last year
- 从零实现一个小参数量中文大语言模型。☆1,011Aug 22, 2024Updated last year
- ☆17Apr 16, 2021Updated 5 years ago
- The MongoDB Database☆22Dec 7, 2016Updated 9 years ago
- ☆13Jan 23, 2025Updated last year
- cracked prompt of famous coding agent and autodev☆24Mar 19, 2026Updated last month
- 中文对话0.2B小模型(ChatLM-Chinese-0.2B),开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sft微调,给出三元组信息抽取微调示例。☆1,704Apr 20, 2024Updated 2 years ago
- ☆255Oct 26, 2025Updated 6 months ago
- a autodl environment for native finetune stable diffusion.☆11Dec 7, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- AFFNet-Unofficial Implementation☆15Aug 23, 2023Updated 2 years ago
- 用Numpy复现可训练的LLaMa3☆34Jul 5, 2024Updated last year
- 一款数据标注工具(仿照百度在线标注平台)☆13Jul 5, 2021Updated 4 years ago
- LLM+RAG for QA☆23Jan 15, 2024Updated 2 years ago
- Implements a minimalistic version of Stable Cascade training☆13Oct 24, 2024Updated last year
- Using Low-rank adaptation to quickly fine-tune diffusion models.☆11Mar 14, 2023Updated 3 years ago
- [NeurIPS 23] Characterizing OOD Error via Optimal Transport☆13Nov 19, 2023Updated 2 years ago
- ☆13Oct 23, 2018Updated 7 years ago
- Linking field-based and airborne data☆11Apr 5, 2020Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- introduce AI infra knowledges. 人工智能系统基础架构知识库☆16Jun 4, 2023Updated 2 years ago
- Psy-Insight: Mental Health Oriented Interpretable Multi-turn Bilingual Counseling Dataset for Large Language Model Finetuning☆23Jan 4, 2026Updated 4 months ago
- ☆27Feb 1, 2025Updated last year
- ☆16Jun 8, 2023Updated 2 years ago
- This repository collects recent top papers about causal inference for recommendation. We will keep updating the paper list weekly.☆16May 3, 2022Updated 4 years ago
- ☆87Apr 9, 2024Updated 2 years ago
- LTX-Video-Trainer-GUI 是为LTX视频lora模型训练提供的GUI工具,支持通过简单的界面训练 LoRA 模型用于视频生成。本训练器提供了直观的 GUI 界面,使用户能够轻松设置和启动训练流程,无需编写复杂代码。☆13Jul 18, 2025Updated 9 months ago