cnstark / gputaskerLinks
An awesome gpu tasks scheduler. 轻量好用的GPU机群任务调度工具。觉得有用可以点个star
☆191Updated 3 years ago
Alternatives and similar repositories for gputasker
Users that are interested in gputasker are comparing it to the libraries listed below
Sorting:
- 📊 A simple command-line utility for querying and monitoring GPU status☆92Updated 2 years ago
- The record of what I‘ve been through.☆101Updated 8 months ago
- DeepSpeed教程 & 示例注释 & 学习笔记 (大模型高效训练)☆178Updated 2 years ago
- 一款便捷的抢占显卡脚本☆364Updated 8 months ago
- DeepSpeed Tutorial☆102Updated last year
- Cool Papers - Immersive Paper Discovery☆627Updated last month
- pytorch单精度、半精度、混合精度、单卡、多卡(DP / DDP)、FSDP、DeepSpeed模型训练代码,并对比不同方法的训练速度以及GPU内存的使用☆122Updated last year
- Download papers and supplemental materials from open-access paper website, such as AAAI, AAMAS, AISTATS, COLT, CORL, CVPR, ECCV, ICCV, IC…☆277Updated this week
- Yet another PyTorch Trainer and some core components for deep learning.☆223Updated last year
- 主要记录大语言大模型(LLMs) 算法(应用)工程师多模态相关知识☆246Updated last year
- The pure and clear PyTorch Distributed Training Framework.☆274Updated last year
- A light-weight script for maintaining a LOT of machine learning experiments.☆92Updated 2 years ago
- Watch for idle GPUs and run your jobs: launches jobs in tmux, keeps logs/status and sends start/finish emails..☆79Updated 3 weeks ago
- ☆63Updated last month
- 青稞Talk☆148Updated 3 weeks ago
- 欢迎来到 "LLM-travel" 仓库!探索大语言模型(LLM)的奥秘 🚀。致力于深入理解、探讨以及实现与大模型相关的各种技术、原理和应用。☆344Updated last year
- ZO2 (Zeroth-Order Offloading): Full Parameter Fine-Tuning 175B LLMs with 18GB GPU Memory [COLM2025]☆189Updated 2 months ago
- 多模态 MM +Chat 合集☆276Updated last month
- 看图学大模型☆320Updated last year
- ☆262Updated 7 months ago
- Efficient Mixture of Experts for LLM Paper List☆132Updated 2 weeks ago
- Lion and Adam optimization comparison☆64Updated 2 years ago
- ☆174Updated this week
- ☆79Updated last year
- ☆61Updated last year
- Simple tutorials on Pytorch DDP training☆281Updated 3 years ago
- ☆212Updated 11 months ago
- LiBai(李白): A Toolbox for Large-Scale Distributed Parallel Training☆407Updated 2 months ago
- How to use wandb?☆678Updated 2 years ago
- ☆203Updated 5 months ago