XuehaiPan / nvitopLinks
An interactive NVIDIA-GPU process viewer and beyond, the one-stop solution for GPU process management.
β5,839Updated this week
Alternatives and similar repositories for nvitop
Users that are interested in nvitop are comparing it to the libraries listed below
Sorting:
- π A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (iβ¦β9,010Updated this week
- Fast and memory-efficient exact attentionβ18,776Updated this week
- Hackable and optimized Transformers building blocks, supporting a composable construction.β9,821Updated last week
- Accessible large language models via k-bit quantization for PyTorch.β7,450Updated this week
- View model summaries in PyTorch!β2,830Updated last week
- π A simple command-line utility for querying and monitoring GPU statusβ4,242Updated 4 months ago
- Development repository for the Triton language and compilerβ16,484Updated this week
- Build and run containers leveraging NVIDIA GPUsβ3,515Updated this week
- Ongoing research training transformer models at scaleβ13,130Updated this week
- Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)β9,101Updated last month
- Multi-GPU CUDA stress testβ1,831Updated 11 months ago
- π€ PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.β19,252Updated this week
- Simple, safe way to store and distribute tensorsβ3,380Updated last week
- GPU & Accelerator process monitoring for AMD, Apple, Huawei, Intel, NVIDIA and Qualcommβ9,426Updated last month
- π Accelerate inference and training of π€ Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimizationβ¦β3,016Updated this week
- A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.β2,742Updated last month
- Transformer related optimization, including BERT, GPTβ6,267Updated last year
- PyTorch extensions for high performance and large scale training.β3,352Updated 3 months ago
- Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"β12,511Updated 7 months ago
- FFCV: Fast Forward Computer Vision (and other ML workloads!)β2,955Updated last year
- The Fast Cross-Platform Package Managerβ7,575Updated this week
- Large Language Model Text Generation Inferenceβ10,413Updated last week
- AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (Nβ¦β4,670Updated 2 weeks ago
- Train transformer language models with reinforcement learning.β14,989Updated this week
- PyTorch native post-training libraryβ5,399Updated this week
- A machine learning compiler for GPUs, CPUs, and ML acceleratorsβ3,413Updated this week
- An open source implementation of CLIP.β12,360Updated this week
- A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper, Ada and Blaβ¦β2,627Updated this week
- Machine learning metrics for distributed, scalable PyTorch applications.β2,325Updated this week
- A tool for enriching the output of nvidia-smi.β566Updated last year