SwanHubX / SwanLabLinks
⚡️SwanLab - an open-source, modern-design AI training tracking and visualization tool. Supports Cloud / Self-hosted use. Integrated with PyTorch / Transformers / LLaMA Factory / Swift / Ultralytics / veRL / MMEngine / Keras etc.
☆2,037Updated this week
Alternatives and similar repositories for SwanLab
Users that are interested in SwanLab are comparing it to the libraries listed below
Sorting:
- 🚀 「大模型」1小时从0训练26M参数的视觉多模态VLM!🌏 Train a 26M-parameter VLM from scratch in just 1 hours!☆3,865Updated 2 months ago
- 复现大模型相关算法及一些学习记录☆1,739Updated last week
- 《大模型白盒子构建指南》:一个全手搓的Tiny-Universe☆3,150Updated last month
- 欢迎来到 LLM-Dojo,这里是一个开源大模型学习场所,使用简洁且易阅读的代码构建模型训练框架(支持各种主流模型如Qwen、Llama、GLM等等)、RLHF框架(DPO/CPO/KTO/PPO)等各种功能。👩🎓👨🎓☆781Updated this week
- 从无名小卒到大模型(LLM)大英雄~ 欢迎关注后续!!!☆1,424Updated 2 months ago
- 《大语言模型》作者:赵鑫,李军毅,周昆,唐天一,文继荣☆3,724Updated 2 months ago
- A streamlined and customizable framework for efficient large model evaluation and performance benchmarking☆1,203Updated this week
- 从零实现一个 llama3 中文版☆913Updated last year
- 每个人都能看懂的大模型知识分享,LLMs春/秋招大模型面试前必看,让你和面试官侃侃而谈☆3,351Updated 3 weeks ago
- minimal-cost for training 0.5B R1-Zero☆743Updated last month
- ☆958Updated last month
- 📚 从零开始的大语言模型原理与实践教程☆3,942Updated this week
- EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL☆2,764Updated this week
- Align Anything: Training All-modality Model with Feedback☆4,085Updated 3 weeks ago
- ✨✨VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction☆2,334Updated 3 months ago
- Reproduce R1 Zero on Logic Puzzle☆2,361Updated 3 months ago
- DeepSeek 系列工作解读、扩展和复现。☆658Updated 2 months ago
- Distributed RL System for LLM Reasoning☆1,837Updated this week
- 🧑🚀 全世界最好的LLM资料总结(视频生成、Agent、辅助编程、数据处理、模型训练、模型推理、o1 模型、MCP、小语言模型、视觉语言模型) | Summary of the world's best LLM resources.☆5,491Updated last week
- 从零实现一个小参数量中文大语言模型。☆709Updated 10 months ago
- A very simple GRPO implement for reproducing r1-like LLM thinking.☆1,130Updated 2 months ago
- A powerful tool for creating fine-tuning datasets for LLM☆8,785Updated last week
- LLM&VLM Tutorial☆1,833Updated last month
- An open-source solution for full parameter fine-tuning of DeepSeek-V3/R1 671B, including complete code and scripts from training to infer…☆703Updated 3 months ago
- Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-…☆8,249Updated this week
- [ECCV 2024] Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.☆1,838Updated 5 months ago
- Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and pe…☆3,209Updated 2 weeks ago
- Train a 1B LLM with 1T tokens from scratch by personal☆685Updated 2 months ago
- Practice to LLM.☆1,442Updated last month
- 这是一个从头训练大语言模型的项目,包括预训练、微调和直接偏好优化,模型拥有1B参数,支持中英文。☆437Updated 4 months ago