NVIDIA / tensorflowLinks

An Open Source Machine Learning Framework for Everyone

☆1,152

Alternatives and similar repositories for tensorflow

Users that are interested in tensorflow are comparing it to the libraries listed below

Sorting:

NVIDIA / cudnn-frontend
cudnn_frontend provides a c++ wrapper for the cudnn backend API and samples on how to use it
☆600Updated this week
NVIDIA / TransformerEngine
A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper, Ada and Bla…
☆2,602Updated this week
NVIDIA / nccl
Optimized primitives for collective multi-GPU communication
☆3,923Updated 2 weeks ago
NVIDIA / CUDALibrarySamples
CUDA Library Samples
☆2,047Updated 3 weeks ago
pytorch / TensorRT
PyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRT
☆2,824Updated this week
tensorflow / tensorrt
TensorFlow/TensorRT integration
☆743Updated last year
NVIDIA / nvidia-container-toolkit
Build and run containers leveraging NVIDIA GPUs
☆3,515Updated this week
NVIDIA / cccl
CUDA Core Compute Libraries
☆1,834Updated this week
mlcommons / inference
Reference implementations of MLPerf™ inference benchmarks
☆1,432Updated this week
NVIDIA / NVTX
The NVIDIA® Tools Extension SDK (NVTX) is a C-based Application Programming Interface (API) for annotating events, code ranges, and resou…
☆434Updated this week
NVIDIA / nccl-tests
NCCL Tests
☆1,209Updated 2 weeks ago
NVIDIA / cub
[ARCHIVED] Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/cccl
☆1,768Updated last year
uxlfoundation / oneDNN
oneAPI Deep Neural Network Library (oneDNN)
☆3,859Updated this week
pytorch / FBGEMM
FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/
☆1,416Updated this week
pytorch / kineto
A CPU+GPU Profiling library that provides access to timeline traces and hardware performance counters.
☆845Updated this week
triton-inference-server / client
Triton Python, C++ and Java client libraries, and GRPC-generated client examples for go, java and scala.
☆639Updated this week
NVIDIA / FasterTransformer
Transformer related optimization, including BERT, GPT
☆6,267Updated last year
tensorflow / benchmarks
A benchmark framework for Tensorflow
☆1,150Updated last year
CVCUDA / CV-CUDA
CV-CUDA™ is an open-source, GPU accelerated library for cloud-scale image processing and computer vision.
☆2,551Updated 2 months ago
pytorch / benchmark
TorchBench is a collection of open source benchmarks used to evaluate PyTorch performance.
☆971Updated this week
NVIDIA / NeMo-Framework-Launcher
Provides end-to-end model development pipelines for LLMs and Multimodal models that can be launched on-prem or cloud-native.
☆508Updated 3 months ago
NVIDIA / aistore
AIStore: scalable storage for AI applications
☆1,569Updated this week
NVIDIA / nvbench
CUDA Kernel Benchmarking Library
☆696Updated this week
NVIDIA / DCGM
NVIDIA Data Center GPU Manager (DCGM) is a project for gathering telemetry and measuring the health of NVIDIA GPUs
☆558Updated 3 months ago
NVIDIA / gdrcopy
A fast GPU memory copy library based on NVIDIA GPUDirect RDMA technology
☆1,169Updated 2 months ago
NVIDIA / cuda-python
CUDA Python: Performance meets Productivity
☆2,881Updated this week
oneapi-src / oneAPI-samples
Samples for Intel® oneAPI Toolkits
☆1,064Updated this week
openxla / xprof
A profiling and performance analysis tool for machine learning
☆407Updated this week
pytorch / torchdynamo
A Python-level JIT compiler designed to make unmodified PyTorch programs faster.
☆1,056Updated last year
NVIDIA / cutlass
CUDA Templates for Linear Algebra Subroutines
☆8,182Updated last week