XuehaiPan / nvitopLinks
An interactive NVIDIA-GPU process viewer and beyond, the one-stop solution for GPU process management.
☆5,664Updated 3 weeks ago
Alternatives and similar repositories for nvitop
Users that are interested in nvitop are comparing it to the libraries listed below
Sorting:
- Fast and memory-efficient exact attention☆18,043Updated this week
- Hackable and optimized Transformers building blocks, supporting a composable construction.☆9,641Updated this week
- 🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (i…☆8,875Updated this week
- Accessible large language models via k-bit quantization for PyTorch.☆7,150Updated last week
- Development repository for the Triton language and compiler☆15,939Updated this week
- Simple, safe way to store and distribute tensors☆3,324Updated this week
- 📊 A simple command-line utility for querying and monitoring GPU status☆4,220Updated 2 months ago
- Multi-GPU CUDA stress test☆1,741Updated 10 months ago
- 🦁 Lion, new optimizer discovered by Google Brain using genetic algorithms that is purportedly better than Adam(w), in Pytorch☆2,143Updated 7 months ago
- PyTorch extensions for high performance and large scale training.☆3,335Updated 2 months ago
- Running large language models on a single GPU for throughput-oriented scenarios.☆9,336Updated 8 months ago
- Transformer related optimization, including BERT, GPT☆6,219Updated last year
- A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.☆2,658Updated last week
- AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (N…☆4,647Updated 2 months ago
- 🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.☆18,861Updated this week
- A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch☆8,690Updated last week
- GPU & Accelerator process monitoring for AMD, Apple, Huawei, Intel, NVIDIA and Qualcomm☆9,252Updated 2 months ago
- Ongoing research training transformer models at scale☆12,641Updated last week
- An open source implementation of CLIP.☆12,001Updated 2 weeks ago
- RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable)…☆13,744Updated last week
- FFCV: Fast Forward Computer Vision (and other ML workloads!)☆2,942Updated last year
- NumPy & SciPy for GPU☆10,293Updated this week
- A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper, Ada and Bla…☆2,507Updated last week
- Training and serving large-scale neural networks with auto parallelization.☆3,137Updated last year
- Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)☆9,001Updated this week
- [ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"☆8,323Updated 10 months ago
- Tensor library for machine learning☆12,712Updated this week
- View model summaries in PyTorch!☆2,810Updated 2 weeks ago
- Large Language Model Text Generation Inference☆10,249Updated this week
- Train transformer language models with reinforcement learning.☆14,366Updated this week