XuehaiPan / nvitopLinks
An interactive NVIDIA-GPU process viewer and beyond, the one-stop solution for GPU process management.
ā5,728Updated last week
Alternatives and similar repositories for nvitop
Users that are interested in nvitop are comparing it to the libraries listed below
Sorting:
- š A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (iā¦ā8,941Updated last week
- Hackable and optimized Transformers building blocks, supporting a composable construction.ā9,729Updated this week
- Fast and memory-efficient exact attentionā18,340Updated last week
- š A simple command-line utility for querying and monitoring GPU statusā4,237Updated 3 months ago
- GPU & Accelerator process monitoring for AMD, Apple, Huawei, Intel, NVIDIA and Qualcommā9,336Updated 3 weeks ago
- Multi-GPU CUDA stress testā1,793Updated 11 months ago
- Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)ā9,042Updated 2 weeks ago
- View model summaries in PyTorch!ā2,824Updated this week
- Accessible large language models via k-bit quantization for PyTorch.ā7,230Updated this week
- AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (Nā¦ā4,655Updated 3 months ago
- Development repository for the Triton language and compilerā16,198Updated this week
- Simple, safe way to store and distribute tensorsā3,354Updated 2 weeks ago
- PyTorch extensions for high performance and large scale training.ā3,339Updated 2 months ago
- A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.ā2,700Updated last month
- Transformer related optimization, including BERT, GPTā6,248Updated last year
- Machine learning metrics for distributed, scalable PyTorch applications.ā2,312Updated this week
- š¦ Lion, new optimizer discovered by Google Brain using genetic algorithms that is purportedly better than Adam(w), in Pytorchā2,144Updated 7 months ago
- Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"ā12,251Updated 7 months ago
- š Geometric Computer Vision Library for Spatial AIā10,614Updated this week
- A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorchā8,724Updated this week
- A concise but complete full-attention transformer with a set of promising experimental features from various papersā5,461Updated this week
- An open source implementation of CLIP.ā12,176Updated last month
- Foundation Architecture for (M)LLMsā3,094Updated last year
- High-speed Large Language Model Serving for Local Deploymentā8,235Updated 5 months ago
- ā4,084Updated last year
- Ongoing research training transformer models at scaleā12,910Updated this week
- NumPy & SciPy for GPUā10,345Updated this week
- Prevent PyTorch's `CUDA error: out of memory` in just 1 line of code.ā1,824Updated last month
- FFCV: Fast Forward Computer Vision (and other ML workloads!)ā2,945Updated last year
- Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.ā4,096Updated 11 months ago