XuehaiPan / nvitopLinks
An interactive NVIDIA-GPU process viewer and beyond, the one-stop solution for GPU process management.
☆6,556Updated last week
Alternatives and similar repositories for nvitop
Users that are interested in nvitop are comparing it to the libraries listed below
Sorting:
- Multi-GPU CUDA stress test☆2,093Updated 3 months ago
- GPU & Accelerator process monitoring for AMD, Apple, Huawei, Intel, NVIDIA and Qualcomm☆10,119Updated this week
- Fast and memory-efficient exact attention☆22,113Updated last week
- Hackable and optimized Transformers building blocks, supporting a composable construction.☆10,326Updated last week
- View model summaries in PyTorch!☆2,901Updated 2 weeks ago
- 📊 A simple command-line utility for querying and monitoring GPU status☆4,339Updated 10 months ago
- 🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (i…☆9,486Updated this week
- AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (N…☆4,702Updated last month
- Development repository for the Triton language and compiler☆18,387Updated this week
- Accessible large language models via k-bit quantization for PyTorch.☆7,939Updated 3 weeks ago
- Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)☆9,395Updated 2 weeks ago
- A conda-forge distribution.☆9,313Updated this week
- Transformer related optimization, including BERT, GPT☆6,392Updated last year
- PyTorch extensions for high performance and large scale training.☆3,397Updated 9 months ago
- Simple, safe way to store and distribute tensors☆3,619Updated last week
- Python bindings for llama.cpp☆9,971Updated 5 months ago
- The Fast Cross-Platform Package Manager☆7,908Updated 2 weeks ago
- ☆4,112Updated last year
- A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.☆2,981Updated last week
- Machine learning metrics for distributed, scalable PyTorch applications.☆2,405Updated last week
- Build and run containers leveraging NVIDIA GPUs☆4,074Updated this week
- A concise but complete full-attention transformer with a set of promising experimental features from various papers☆5,800Updated this week
- Inference Llama 2 in one file of pure C☆19,162Updated last year
- Ongoing research training transformer models at scale☆15,162Updated this week
- A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on H…☆3,152Updated this week
- SGLang is a high-performance serving framework for large language models and multimodal models.☆23,439Updated this week
- 🦁 Lion, new optimizer discovered by Google Brain using genetic algorithms that is purportedly better than Adam(w), in Pytorch☆2,184Updated last year
- Tensor library for machine learning☆13,923Updated this week
- [MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration☆3,436Updated 6 months ago
- Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.☆3,355Updated 8 months ago