XuehaiPan / nvitopLinks
An interactive NVIDIA-GPU process viewer and beyond, the one-stop solution for GPU process management.
ā6,187Updated last week
Alternatives and similar repositories for nvitop
Users that are interested in nvitop are comparing it to the libraries listed below
Sorting:
- š A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (iā¦ā9,199Updated this week
- Fast and memory-efficient exact attentionā19,864Updated this week
- GPU & Accelerator process monitoring for AMD, Apple, Huawei, Intel, NVIDIA and Qualcommā9,647Updated 3 months ago
- Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)ā9,215Updated 2 months ago
- Development repository for the Triton language and compilerā17,154Updated this week
- š A simple command-line utility for querying and monitoring GPU statusā4,260Updated 6 months ago
- Hackable and optimized Transformers building blocks, supporting a composable construction.ā9,989Updated last week
- Simple, safe way to store and distribute tensorsā3,470Updated 2 weeks ago
- Transformer related optimization, including BERT, GPTā6,320Updated last year
- Multi-GPU CUDA stress testā1,931Updated last year
- Accessible large language models via k-bit quantization for PyTorch.ā7,647Updated last week
- The Fast Cross-Platform Package Managerā7,677Updated last week
- AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (Nā¦ā4,682Updated 3 weeks ago
- SGLang is a fast serving framework for large language models and vision language models.ā18,662Updated this week
- View model summaries in PyTorch!ā2,865Updated last week
- PyTorch extensions for high performance and large scale training.ā3,380Updated 5 months ago
- A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.ā2,841Updated 3 months ago
- Train transformer language models with reinforcement learning.ā15,818Updated this week
- CUDA Templates and Python DSLs for High-Performance Linear Algebraā8,559Updated 2 weeks ago
- Ongoing research training transformer models at scaleā13,755Updated last week
- Solve puzzles. Learn CUDA.ā11,530Updated last year
- llama3 implementation one matrix multiplication at a timeā15,172Updated last year
- A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorchā8,813Updated last week
- Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.ā3,168Updated 4 months ago
- Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"ā12,789Updated 9 months ago
- Mamba SSM architectureā16,046Updated this week
- RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable)ā¦ā14,009Updated this week
- A tool for enriching the output of nvidia-smi.ā569Updated last year
- Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, 17+ clouds, oā¦ā8,807Updated this week
- ā4,100Updated last year