Projects:
- A high-throughput and memory-efficient inference and serving engine for LLMs☆26,822Updated this week
- World's fastest and most advanced password recovery utility☆20,894Updated last month
- Build and run Docker containers leveraging NVIDIA GPUs☆17,189Updated 9 months ago
- Instant neural graphics primitives: lightning fast NeRF and more☆15,810Updated 5 months ago
- kaldi-asr/kaldi is the official location of the Kaldi project.☆14,115Updated this week
- Open3D: A Modern Library for 3D Data Processing☆11,220Updated this week
- NumPy aware dynamic Python compiler using LLVM☆9,787Updated 3 weeks ago
- CUDA on ??? GPUs☆8,932Updated this week
- cuDF - GPU DataFrame Library☆8,259Updated this week
- NumPy & SciPy for GPU☆8,124Updated this week
- A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other ma…☆7,992Updated this week
- Containers for machine learning☆7,804Updated this week
- Modular ZK(Zero Knowledge) backend accelerated by GPU☆7,777Updated this week
- Go package for computer vision using OpenCV 4 and beyond. Includes support for DNN, CUDA, and OpenCV Contrib.☆6,578Updated this week
- Samples for CUDA Developers which demonstrates features in CUDA Toolkit☆6,087Updated last month
- A flexible framework of neural networks for deep learning☆5,885Updated last year
- OneFlow is a deep learning framework designed to be user-friendly, scalable and efficient.☆5,864Updated last week
- Solve puzzles. Learn CUDA.☆5,703Updated 2 weeks ago
- CUDA Templates for Linear Algebra Subroutines☆5,359Updated this week
- SGLang is a fast serving framework for large language models and vision language models.☆5,121Updated this week
- [ARCHIVED] The C++ parallel algorithms library. See https://github.com/NVIDIA/cccl☆4,907Updated 7 months ago
- An interactive NVIDIA-GPU process viewer and beyond, the one-stop solution for GPU process management.☆4,617Updated last week
- Tengine is a lite, high performance, modular inference engine for embedded device☆4,617Updated this week
- ArrayFire: a general purpose GPU library.☆4,525Updated last week
- ALIEN is a CUDA-powered artificial life simulation program.☆4,462Updated this week
- A PyTorch Library for Accelerating 3D Deep Learning Research☆4,428Updated 3 weeks ago
- cuML - RAPIDS Machine Learning Library☆4,158Updated this week
- HIP: C++ Heterogeneous-Compute Interface for Portability☆3,690Updated this week
- Lightning fast C++/CUDA neural network framework☆3,666Updated 3 weeks ago
- Fast inference engine for Transformer models☆3,218Updated this week
- LightSeq: A High Performance Library for Sequence Processing and Generation☆3,169Updated last year
- Single C file, Realtime CPU/GPU Profiler with Remote Web Viewer☆3,100Updated 3 weeks ago
- Jittor is a high-performance deep learning framework based on JIT compiling and meta-operators.☆3,070Updated last week
- A GPU-powered real-time analytics storage and query engine.☆3,020Updated 2 months ago
- HeavyDB (formerly OmniSciDB)☆2,937Updated last week
- A retargetable MLIR-based machine learning compiler and runtime toolkit.☆2,559Updated this week
- PyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRT☆2,499Updated this week
- Minkowski Engine is an auto-diff neural network library for high-dimensional sparse tensors☆2,432Updated 6 months ago
- A data-parallel functional programming language☆2,366Updated this week
- CV-CUDA™ is an open-source, GPU accelerated library for cloud-scale image processing and computer vision.☆2,325Updated last week