cnstark / gputaskerLinks
An awesome gpu tasks scheduler. 轻量好用的GPU机群任务调度工具。觉得有用可以点个star
☆196Updated 3 years ago
Alternatives and similar repositories for gputasker
Users that are interested in gputasker are comparing it to the libraries listed below
Sorting:
- 📊 A simple command-line utility for querying and monitoring GPU status☆91Updated 3 years ago
- DeepSpeed教程 & 示例注释 & 学习笔记 (大模型高效训练)☆185Updated 2 years ago
- 一款便捷的抢占显卡脚本☆392Updated last month
- pytorch单精度、半精度、混合精度、单卡、多卡(DP / DDP)、FSDP、DeepSpeed模型训练代码,并对比不同方法的训练速度以及GPU内存的使用☆129Updated last year
- The record of what I‘ve been through. Now moved to Notion. See link below☆102Updated last year
- DeepSpeed Tutorial☆105Updated last year
- A light-weight script for maintaining a LOT of machine learning experiments.☆92Updated 3 years ago
- Yet another PyTorch Trainer and some core components for deep learning.☆222Updated last year
- ☆261Updated 10 months ago
- ZO2 (Zeroth-Order Offloading): Full Parameter Fine-Tuning 175B LLMs with 18GB GPU Memory [COLM2025]☆198Updated 6 months ago
- Cool Papers - Immersive Paper Discovery☆694Updated 4 months ago
- The pure and clear PyTorch Distributed Training Framework.☆275Updated 2 years ago
- Download papers and supplemental materials from open-access paper website, such as AAAI, AAMAS, AISTATS, COLT, CORL, CVPR, ECCV, ICCV, IC…☆291Updated last month
- mindspore implementation of transformers☆68Updated 2 years ago
- ☆181Updated last week
- Python debug configuration generator for vscode☆29Updated 4 years ago
- Pure Pytorch Docker Images.☆481Updated last year
- ☆72Updated this week
- ☆28Updated 2 years ago
- 青稞Talk☆189Updated this week
- ☆172Updated this week
- 欢迎来到 "LLM-travel" 仓库!探索大语言模型(LLM)的奥秘 🚀。致力于深入理解、探 讨以及实现与大模型相关的各种技术、原理和应用。☆367Updated last year
- vLLM Documentation in Chinese Simplified / vLLM 中文文档☆152Updated last month
- How to use wandb?☆692Updated 2 years ago
- ☆79Updated 2 years ago
- Simple tutorials on Pytorch DDP training☆286Updated 3 years ago
- Efficient Mixture of Experts for LLM Paper List☆156Updated 3 months ago
- ☆61Updated last year
- 本项目提供了基于910B的huggingface LLM模型的Tensor Parallel(TP)部署教程,同时也可以作为一份极简的TP学习代码。☆30Updated 2 weeks ago
- 更纯粹、更高压缩率的Tokenizer☆488Updated last year