cnstark / gputaskerLinks
An awesome gpu tasks scheduler. 轻量好用的GPU机群任务调度工具。觉得有用可以点个star
☆197Updated 3 years ago
Alternatives and similar repositories for gputasker
Users that are interested in gputasker are comparing it to the libraries listed below
Sorting:
- 📊 A simple command-line utility for querying and monitoring GPU status☆91Updated 3 years ago
- Download papers and supplemental materials from open-access paper website, such as AAAI, AAMAS, AISTATS, COLT, CORL, CVPR, ECCV, ICCV, IC…☆293Updated 2 months ago
- 一款便捷的抢占显卡脚本☆393Updated last month
- pytorch单精度、半精度、混合精度、单卡、多卡(DP / DDP)、FSDP、DeepSpeed模型训练代码,并对比不同方法的训练速度以及GPU内存的使用☆129Updated last year
- DeepSpeed教程 & 示例注释 & 学习笔记 (大模型高效训练)☆187Updated 2 years ago
- DeepSpeed Tutorial☆106Updated last year
- The record of what I‘ve been through. Now moved to Notion. See link below☆103Updated last year
- ☆183Updated 2 weeks ago
- Cool Papers - Immersive Paper Discovery☆703Updated 5 months ago
- Yet another PyTorch Trainer and some core components for deep learning.☆222Updated last year
- ZO2 (Zeroth-Order Offloading): Full Parameter Fine-Tuning 175B LLMs with 18GB GPU Memory [COLM2025]☆200Updated 6 months ago
- 多模态 MM +Chat 合集☆282Updated 5 months ago
- A pupil in the computer world.(Felix Fu)☆254Updated last year
- The pure and clear PyTorch Distributed Training Framework.☆275Updated 2 years ago
- ☆74Updated last week
- huggingface mirror download☆589Updated 10 months ago
- 欢迎来到 "LLM-travel" 仓库!探索大语言模型(LLM)的奥秘 🚀。致力于深入理解、探讨以及实现与大模型相关的各种技术、原理和应用。☆371Updated last year
- 本项目提供了基于910B的huggingface LLM模型的Tensor Parallel(TP)部署教程,同时也可以作为一份极简的TP学习代码。☆30Updated last month
- How to use wandb?☆694Updated 2 years ago
- 实验室服务器管理☆29Updated 2 years ago
- ☆262Updated 11 months ago
- LiBai(李白): A Toolbox for Large-Scale Distributed Parallel Training☆405Updated 6 months ago
- Simple tutorials on Pytorch DDP training☆286Updated 3 years ago
- 青稞Talk☆190Updated 3 weeks ago
- ☆172Updated this week
- Watch for idle GPUs and run your jobs: launches jobs in tmux, keeps logs/status and sends start/finish emails..☆81Updated 4 months ago
- 主要记录大语言大模型(LLMs) 算法(应用)工程师多模态相关知识☆266Updated last year
- vLLM Documentation in Chinese Simplified / vLLM 中文文档☆154Updated last month
- A MoE impl for PyTorch, [ATC'23] SmartMoE☆71Updated 2 years ago
- ☆79Updated 2 years ago