An Open Source Machine Learning Framework for Everyone
☆1,150Jul 31, 2025Updated 7 months ago
Alternatives and similar repositories for tensorflow
Users that are interested in tensorflow are comparing it to the libraries listed below
Sorting:
- State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enter…☆14,735Aug 12, 2024Updated last year
- AIStore: scalable storage for AI applications☆1,766Updated this week
- Build and run Docker containers leveraging NVIDIA GPUs☆17,498Dec 6, 2023Updated 2 years ago
- NVIDIA Linux open GPU kernel module source☆16,743Updated this week
- NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source compone…☆12,723Updated this week
- Build and run containers leveraging NVIDIA GPUs☆4,088Updated this week
- A Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")☆379Updated this week
- A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep lear…☆5,637Updated this week
- Samples for CUDA Developers which demonstrates features in CUDA Toolkit☆8,870Jan 6, 2026Updated last month
- PyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRT☆2,947Updated this week
- Optimized primitives for collective multi-GPU communication☆4,474Updated this week
- A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit and 4-bit floating point (FP8 and FP4) precision on H…☆3,176Updated this week
- CUDA Library Samples☆2,324Feb 21, 2026Updated last week
- A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Auto…☆16,807Updated this week
- [ARCHIVED] The C++ Standard Library for your entire system. See https://github.com/NVIDIA/cccl☆2,308Feb 7, 2024Updated 2 years ago
- Ongoing research training transformer models at scale☆15,461Updated this week
- Transformer related optimization, including BERT, GPT☆6,394Mar 27, 2024Updated last year
- TensorFlow/TensorRT integration☆743Nov 30, 2023Updated 2 years ago
- cudnn_frontend provides a c++ wrapper for the cudnn backend API and samples on how to use it☆686Feb 18, 2026Updated last week
- CUDA Templates and Python DSLs for High-Performance Linear Algebra☆9,315Updated this week
- ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator☆19,389Updated this week
- A high-performance framework for training wide-and-deep recommender systems on heterogeneous cluster☆161Apr 20, 2024Updated last year
- HugeCTR is a high efficiency GPU framework designed for Click-Through-Rate (CTR) estimating training☆1,045Sep 15, 2025Updated 5 months ago
- The Triton Inference Server provides an optimized cloud and edge inferencing solution.☆10,393Updated this week
- A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch☆8,926Updated this week
- A TensorFlow Extension: GPU performance tools for TensorFlow.☆26Jul 27, 2023Updated 2 years ago
- oneAPI Deep Neural Network Library (oneDNN)☆3,956Updated this week
- HierarchicalKV is a part of NVIDIA Merlin and provides hierarchical key-value storage to meet RecSys requirements. The key capability of…☆194Updated this week
- Development repository for the Triton language and compiler☆18,501Updated this week
- Python bindings for NVTX☆67Jun 9, 2023Updated 2 years ago
- C++ and Python support for the CUDA Quantum programming model for heterogeneous quantum-classical workflows☆939Updated this week
- NVIDIA xorg.conf configurator☆41Feb 17, 2026Updated last week
- [ARCHIVED] Cooperative primitives for CUDA C++. See https://github.com/NVIDIA/cccl☆1,820Oct 9, 2023Updated 2 years ago
- CUDA Core Compute Libraries☆2,182Updated this week
- Tensors and Dynamic neural networks in Python with strong GPU acceleration☆97,688Updated this week
- Convert TensorFlow, Keras, Tensorflow.js and Tflite models to ONNX☆2,517Feb 20, 2026Updated last week
- TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizat…☆12,938Updated this week
- A performant and modular runtime for TensorFlow☆753Sep 4, 2025Updated 5 months ago
- BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.☆918Dec 30, 2024Updated last year