NVIDIA / tensorflowLinks
An Open Source Machine Learning Framework for Everyone
☆1,144Updated 8 months ago
Alternatives and similar repositories for tensorflow
Users that are interested in tensorflow are comparing it to the libraries listed below
Sorting:
- PyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRT☆2,768Updated this week
- A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper, Ada and Bla…☆2,450Updated last week
- State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enter…☆14,298Updated 9 months ago
- Optimized primitives for collective multi-GPU communication☆3,761Updated last week
- TensorFlow/TensorRT integration☆740Updated last year
- CUDA Core Compute Libraries☆1,669Updated this week
- cudnn_frontend provides a c++ wrapper for the cudnn backend API and samples on how to use it☆572Updated last week
- NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source compone…☆11,675Updated 2 weeks ago
- CV-CUDA™ is an open-source, GPU accelerated library for cloud-scale image processing and computer vision.☆2,510Updated 2 weeks ago
- CUDA Python: Performance meets Productivity☆2,719Updated this week
- BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.☆871Updated 5 months ago
- A CPU+GPU Profiling library that provides access to timeline traces and hardware performance counters.☆818Updated this week
- AIStore: scalable storage for AI applications☆1,519Updated this week
- Build and run containers leveraging NVIDIA GPUs☆3,282Updated this week
- Transformer related optimization, including BERT, GPT☆6,179Updated last year
- SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX R…☆2,424Updated this week
- CUDA Library Samples☆1,961Updated last week
- Intel® AI Reference Models: contains Intel optimizations for running deep learning workloads on Intel® Xeon® Scalable processors and Inte…☆717Updated this week
- HugeCTR is a high efficiency GPU framework designed for Click-Through-Rate (CTR) estimating training☆1,006Updated 2 months ago
- ONNX-TensorRT: TensorRT backend for ONNX☆3,084Updated 3 weeks ago
- NCCL Tests☆1,127Updated this week
- Provides end-to-end model development pipelines for LLMs and Multimodal models that can be launched on-prem or cloud-native.☆505Updated last month
- The NVIDIA® Tools Extension SDK (NVTX) is a C-based Application Programming Interface (API) for annotating events, code ranges, and resou…☆388Updated last week
- A machine learning compiler for GPUs, CPUs, and ML accelerators☆3,207Updated this week
- A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep lear…☆5,417Updated last week
- A fast GPU memory copy library based on NVIDIA GPUDirect RDMA technology☆1,112Updated last week
- CUDA Templates for Linear Algebra Subroutines☆7,631Updated this week
- NVIDIA container runtime library☆961Updated last week
- PyTorch extensions for high performance and large scale training.☆3,328Updated last month
- Enabling PyTorch on XLA Devices (e.g. Google TPU)☆2,613Updated this week