NVIDIA / tensorflowLinks
An Open Source Machine Learning Framework for Everyone
☆1,152Updated last week
Alternatives and similar repositories for tensorflow
Users that are interested in tensorflow are comparing it to the libraries listed below
Sorting:
- cudnn_frontend provides a c++ wrapper for the cudnn backend API and samples on how to use it☆600Updated this week
- A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper, Ada and Bla…☆2,602Updated this week
- Optimized primitives for collective multi-GPU communication☆3,923Updated 2 weeks ago
- CUDA Library Samples☆2,047Updated 3 weeks ago
- PyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRT☆2,824Updated this week
- TensorFlow/TensorRT integration☆743Updated last year
- Build and run containers leveraging NVIDIA GPUs☆3,515Updated this week
- CUDA Core Compute Libraries☆1,834Updated this week
- Reference implementations of MLPerf™ inference benchmarks☆1,432Updated this week
- The NVIDIA® Tools Extension SDK (NVTX) is a C-based Application Programming Interface (API) for annotating events, code ranges, and resou…☆434Updated this week
- NCCL Tests☆1,209Updated 2 weeks ago
- [ARCHIVED] Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/cccl☆1,768Updated last year
- oneAPI Deep Neural Network Library (oneDNN)☆3,859Updated this week
- FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/☆1,416Updated this week
- A CPU+GPU Profiling library that provides access to timeline traces and hardware performance counters.☆845Updated this week
- Triton Python, C++ and Java client libraries, and GRPC-generated client examples for go, java and scala.☆639Updated this week
- Transformer related optimization, including BERT, GPT☆6,267Updated last year
- A benchmark framework for Tensorflow☆1,150Updated last year
- CV-CUDA™ is an open-source, GPU accelerated library for cloud-scale image processing and computer vision.☆2,551Updated 2 months ago
- TorchBench is a collection of open source benchmarks used to evaluate PyTorch performance.☆971Updated this week
- Provides end-to-end model development pipelines for LLMs and Multimodal models that can be launched on-prem or cloud-native.☆508Updated 3 months ago
- AIStore: scalable storage for AI applications☆1,569Updated this week
- CUDA Kernel Benchmarking Library☆696Updated this week
- NVIDIA Data Center GPU Manager (DCGM) is a project for gathering telemetry and measuring the health of NVIDIA GPUs☆558Updated 3 months ago
- A fast GPU memory copy library based on NVIDIA GPUDirect RDMA technology☆1,169Updated 2 months ago
- CUDA Python: Performance meets Productivity☆2,881Updated this week
- Samples for Intel® oneAPI Toolkits☆1,064Updated this week
- A profiling and performance analysis tool for machine learning☆407Updated this week
- A Python-level JIT compiler designed to make unmodified PyTorch programs faster.☆1,056Updated last year
- CUDA Templates for Linear Algebra Subroutines☆8,182Updated last week