XuehaiPan / nvitop
An interactive NVIDIA-GPU process viewer and beyond, the one-stop solution for GPU process management.
β4,828Updated 3 weeks ago
Related projects β
Alternatives and complementary repositories for nvitop
- π A simple command-line utility for querying and monitoring GPU statusβ4,066Updated 3 months ago
- GPU & Accelerator process monitoring for AMD, Apple, Huawei, Intel, NVIDIA and Qualcommβ8,235Updated 2 weeks ago
- Fast and memory-efficient exact attentionβ14,279Updated this week
- Hackable and optimized Transformers building blocks, supporting a composable construction.β8,660Updated this week
- Accessible large language models via k-bit quantization for PyTorch.β6,299Updated this week
- π A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (iβ¦β7,958Updated this week
- Multi-GPU CUDA stress testβ1,435Updated 3 months ago
- Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"β10,776Updated 3 months ago
- LAVIS - A One-stop Library for Language-Vision Intelligenceβ9,943Updated this week
- RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the bestβ¦β12,672Updated this week
- View model summaries in PyTorch!β2,603Updated this week
- Transformer related optimization, including BERT, GPTβ5,890Updated 7 months ago
- Mamba SSM architectureβ13,239Updated 2 weeks ago
- π€ PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.β16,471Updated this week
- PyTorch extensions for high performance and large scale training.β3,195Updated last week
- β4,035Updated 5 months ago
- Build and run containers leveraging NVIDIA GPUsβ2,472Updated this week
- An open source implementation of CLIP.β10,344Updated last week
- FFCV: Fast Forward Computer Vision (and other ML workloads!)β2,867Updated 5 months ago
- [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.β20,286Updated 3 months ago
- Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.β2,339Updated 2 months ago
- Machine learning metrics for distributed, scalable PyTorch applications.β2,137Updated this week
- Simple, safe way to store and distribute tensorsβ2,900Updated 2 weeks ago
- A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUsβ¦β1,979Updated this week
- A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorchβ8,415Updated 2 weeks ago
- Ongoing research training transformer models at scaleβ10,595Updated this week
- Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"β6,369Updated 5 months ago
- Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)β8,524Updated this week
- An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.β4,497Updated last month
- A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.β2,328Updated last month