NVIDIA / tensorflowLinks
An Open Source Machine Learning Framework for Everyone
☆1,145Updated 9 months ago
Alternatives and similar repositories for tensorflow
Users that are interested in tensorflow are comparing it to the libraries listed below
Sorting:
- cudnn_frontend provides a c++ wrapper for the cudnn backend API and samples on how to use it☆582Updated 2 weeks ago
- A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper, Ada and Bla…☆2,507Updated last week
- PyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRT☆2,785Updated this week
- TensorFlow/TensorRT integration☆743Updated last year
- TorchBench is a collection of open source benchmarks used to evaluate PyTorch performance.☆957Updated 3 weeks ago
- CUDA Library Samples☆1,993Updated last week
- The Torch-MLIR project aims to provide first class support from the PyTorch ecosystem to the MLIR ecosystem.☆1,565Updated this week
- FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/☆1,385Updated this week
- Optimized primitives for collective multi-GPU communication☆3,814Updated last week
- A unified library of state-of-the-art model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. …☆1,006Updated last week
- Enabling PyTorch on XLA Devices (e.g. Google TPU)☆2,626Updated this week
- Build and run containers leveraging NVIDIA GPUs☆3,365Updated this week
- CUDA Core Compute Libraries☆1,711Updated this week
- NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source compone…☆11,773Updated this week
- ONNX Optimizer☆725Updated this week
- Triton Python, C++ and Java client libraries, and GRPC-generated client examples for go, java and scala.☆629Updated last week
- NCCL Tests☆1,156Updated 3 weeks ago
- The NVIDIA® Tools Extension SDK (NVTX) is a C-based Application Programming Interface (API) for annotating events, code ranges, and resou…☆402Updated last month
- A Python-level JIT compiler designed to make unmodified PyTorch programs faster.☆1,053Updated last year
- A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.☆989Updated 9 months ago
- Transformer related optimization, including BERT, GPT☆6,219Updated last year
- NVIDIA container runtime library☆974Updated this week
- This repository contains tutorials and examples for Triton Inference Server☆724Updated 2 weeks ago
- A CPU+GPU Profiling library that provides access to timeline traces and hardware performance counters.☆821Updated this week
- A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")☆339Updated this week
- Samples for CUDA Developers which demonstrates features in CUDA Toolkit☆7,655Updated last month
- A machine learning compiler for GPUs, CPUs, and ML accelerators☆3,294Updated this week
- CUDA Python: Performance meets Productivity☆2,790Updated this week
- common in-memory tensor structure☆1,019Updated 2 weeks ago
- BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.☆873Updated 5 months ago