SwanHubX / SwanLabLinks
⚡️SwanLab - an open-source, modern-design AI training tracking and visualization tool. Supports Cloud / Self-hosted use. Integrated with PyTorch / Transformers / LLaMA Factory / veRL/ Swift / Ultralytics / MMEngine / Keras etc.
☆2,597Updated this week
Alternatives and similar repositories for SwanLab
Users that are interested in SwanLab are comparing it to the libraries listed below
Sorting:
- 复现大模型相关算法及一些学习记录☆2,166Updated last week
- 🚀 「大模型」1小时从0训练26M参数的视觉多模态VLM!🌏 Train a 26M-parameter VLM from scratch in just 1 hours!☆4,663Updated 4 months ago
- A streamlined and customizable framework for efficient large model evaluation and performance benchmarking☆1,654Updated this week
- ☆1,050Updated this week
- 欢迎来到 LLM-Dojo,这里是一个开源大模型学习场所,使用简洁且易阅读的代码构建模型训练框架(支持各种主流模型如Qwen、Llama、GLM等等)、RLHF框架(DPO/CPO/KTO/PPO)等各种功能。👩🎓👨🎓☆863Updated 3 weeks ago
- 从零实现一个 llama3 中文版☆962Updated last year
- 这是一个从头训练大语言模型的项目,包括预训练、微调和直接偏好优化,模型拥有1B参数,支持中英文。☆584Updated 6 months ago
- 从无名小卒到大模型(LLM)大英雄~ 欢迎关注后续!!!☆1,653Updated 5 months ago
- EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL☆3,552Updated 2 weeks ago
- 🌟100+ 原创 LLM / RL 原理图📚,《大模型算法》作者巨献!💥(100+ LLM/RL Algorithm Maps )☆1,334Updated last month
- An open-source solution for full parameter fine-tuning of DeepSeek-V3/R1 671B, including complete code and scripts from training to infer…☆754Updated 6 months ago
- 个人构建MoE大模型:从预训练到DPO的完整实践☆1,127Updated this week
- 《大模型白盒子构建指南》:一个全手搓的Tiny-Universe☆3,727Updated 2 weeks ago
- 从零实现一个小参数量中文大语言模型。☆813Updated last year
- 制作懂人情世故的大语言模型 | 涵盖提示词工程、RAG、Agent、LLM微调教程☆1,526Updated 4 months ago
- Distributed RL System for LLM Reasoning☆2,569Updated last week
- 中文对话0.2B小模型(ChatLM-Chinese-0.2B),开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sft微调,给出三元组信息抽取微调示例。☆1,601Updated last year
- Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (…☆9,792Updated this week
- 《大语言模型》作者:赵鑫,李军毅,周昆,唐天一,文继荣☆3,947Updated 2 weeks ago
- DeepSeek 系列工作解读、扩展和复现。☆675Updated 5 months ago
- 利用HuggingFace的官方下载工具从镜像网站进行高速下载。☆1,200Updated 11 months ago
- Reproduce R1 Zero on Logic Puzzle☆2,396Updated 5 months ago
- 中文翻译的 Hands-On-Large-Language-Models (hands-on-llms),动手学习大模型☆1,332Updated 2 months ago
- 大模型算法岗面试题(含答案):常见问题和概念解析 "大模型面试题"、"算法岗面试"、"面试常见问题"、"大模型算 法面试"、"大模型应用基础"☆1,255Updated last month
- A comprehensive library for implementing LLMs, including a unified training pipeline and comprehensive model evaluation.☆836Updated 3 months ago
- 每个人都能看懂的大模型知识分享,LLMs春/秋招大模型面试前必看,让你和面试官侃侃而谈☆4,211Updated 3 months ago
- Transformers 库快速入门教程☆1,657Updated 11 months ago
- LLM全栈优质资源汇总☆629Updated 2 months ago
- 中文羊驼大模型三期项目 (Chinese Llama-3 LLMs) developed from Meta Llama 3☆1,939Updated 11 months ago
- Chat-甄嬛是利用《甄嬛传》剧本中所有关于甄嬛的台词和语句,基于ChatGLM2进行LoRA微调得到的模仿甄嬛语气的聊天语言模型。☆732Updated 3 months ago