NVIDIA / aistoreLinks

AIStore: scalable storage for AI applications

☆1,546

Alternatives and similar repositories for aistore

Users that are interested in aistore are comparing it to the libraries listed below

Sorting:

NVIDIA / deepops
Tools for building GPU clusters
☆1,369Updated last week
pytorch / gloo
Collective communications library with various primitives for multi-machine training.
☆1,323Updated 3 weeks ago
pytorch / torchx
TorchX is a universal job launcher for PyTorch applications. TorchX is designed to have fast iteration time for training/research and sup…
☆369Updated last week
rapidsai / cuvs
cuVS - a library for vector search and clustering on the GPU
☆455Updated this week
rapidsai / raft
RAFT contains fundamental widely-used algorithms and primitives for machine learning and information retrieval. The algorithms are CUDA-a…
☆907Updated this week
pytorch / kineto
A CPU+GPU Profiling library that provides access to timeline traces and hardware performance counters.
☆826Updated last week
pytorch / benchmark
TorchBench is a collection of open source benchmarks used to evaluate PyTorch performance.
☆959Updated last week
google / tensorstore
Library for reading and writing large multi-dimensional arrays.
☆1,421Updated last week
NVIDIA / DCGM
NVIDIA Data Center GPU Manager (DCGM) is a project for gathering telemetry and measuring the health of NVIDIA GPUs
☆542Updated 2 months ago
triton-inference-server / model_analyzer
Triton Model Analyzer is a CLI tool to help with better understanding of the compute and memory requirements of the Triton Inference Serv…
☆479Updated last month
pytorch / elastic
PyTorch elastic training
☆728Updated 3 years ago
dmlc / dlpack
common in-memory tensor structure
☆1,026Updated last month
pytorch / torcharrow
High performance model preprocessing library on PyTorch
☆649Updated last year
NVIDIA / enroot
A simple yet powerful tool to turn traditional container/OS images into unprivileged sandboxes.
☆788Updated 6 months ago
rapidsai / rmm
RAPIDS Memory Manager
☆594Updated last week
pytorch / torchdynamo
A Python-level JIT compiler designed to make unmodified PyTorch programs faster.
☆1,053Updated last year
tensorflow / runtime
A performant and modular runtime for TensorFlow
☆758Updated 2 months ago
triton-inference-server / pytriton
PyTriton is a Flask/FastAPI-like interface that simplifies Triton's deployment in Python environments.
☆804Updated last week
NVIDIA / nccl
Optimized primitives for collective multi-GPU communication
☆3,848Updated 3 weeks ago
AI-Hypercomputer / JetStream
JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs wel…
☆354Updated last month
run-ai / genv
GPU environment and cluster management with LLM support
☆613Updated last year
facebookresearch / fairscale
PyTorch extensions for high performance and large scale training.
☆3,337Updated 2 months ago
NVIDIA / libnvidia-container
NVIDIA container runtime library
☆982Updated 2 weeks ago
NVIDIA / pyxis
Container plugin for Slurm Workload Manager
☆354Updated 8 months ago
ray-project / kuberay
A toolkit to run Ray applications on Kubernetes
☆1,864Updated this week
NVIDIA / TransformerEngine
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper, Ada and Bla…
☆2,529Updated last week
triton-inference-server / model_navigator
Triton Model Navigator is an inference toolkit designed for optimizing and deploying Deep Learning models with a focus on NVIDIA GPUs.
☆206Updated 2 months ago
NVIDIA / nccl-tests
NCCL Tests
☆1,171Updated last month
uber / fiber
Distributed Computing for AI Made Simple
☆1,044Updated 2 years ago
pytorch / FBGEMM
FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/
☆1,399Updated this week