Documentation for StreamExecutor open source proposal
☆83Mar 28, 2016Updated 10 years ago
Alternatives and similar repositories for streamexecutordoc
Users that are interested in streamexecutordoc are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆371Oct 23, 2017Updated 8 years ago
- tensorflow源码阅读笔记☆191Sep 18, 2018Updated 7 years ago
- Documentation for the entire CGRAFlow☆19Sep 17, 2021Updated 4 years ago
- ☆24Jul 31, 2017Updated 8 years ago
- Simple example of implementing a new Tensorflow operation and its gradient in C++.☆56Mar 28, 2019Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Minimal numerical computation library with TensorFlow APIs☆304Jan 2, 2019Updated 7 years ago
- An IR for efficiently simulating distributed ML computation.☆33Jan 13, 2024Updated 2 years ago
- ☆23Apr 25, 2023Updated 3 years ago
- GPU-specialized parameter server for GPU machine learning.☆102Apr 5, 2018Updated 8 years ago
- TF2 implementation of DLRM (inherited and modified from openrec's initial implementation)☆15Jul 14, 2020Updated 5 years ago
- ☆423Feb 24, 2026Updated 2 months ago
- Haskell binding for Menoh DNN inference library☆12Nov 30, 2018Updated 7 years ago
- Towards Hardware and Software Continuous Integration☆13Jun 8, 2020Updated 5 years ago
- A performant and modular runtime for TensorFlow☆754Sep 4, 2025Updated 8 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- DLPack for Tensorflow☆35Apr 13, 2020Updated 6 years ago
- Enhanced networking support for TensorFlow. Maintained by SIG-networking.☆100Nov 19, 2021Updated 4 years ago
- A fast multi-producer, multi-consumer lock-free concurrent queue for C++11☆10May 25, 2015Updated 10 years ago
- BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.☆925Dec 30, 2024Updated last year
- OFI Programmer's Guide☆52Dec 29, 2022Updated 3 years ago
- The Tensor Algebra SuperOptimizer for Deep Learning☆743Jan 26, 2023Updated 3 years ago
- Sublinear memory optimization for deep learning, reduce GPU memory cost to train deeper nets☆28Apr 22, 2016Updated 10 years ago
- TensorFlow and TVM integration☆36Apr 27, 2020Updated 6 years ago
- A benchmark framework for Tensorflow☆1,147Oct 6, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Heterogeneous Active Messages C++ library☆21Nov 8, 2019Updated 6 years ago
- Collective communications library with various primitives for multi-machine training.☆1,425Apr 21, 2026Updated 3 weeks ago
- TVM stack: exploring the incredible explosion of deep-learning frameworks and how to bring them together☆65May 22, 2018Updated 7 years ago
- Voice from TUNA☆18Dec 10, 2018Updated 7 years ago
- A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.☆1,000Sep 19, 2024Updated last year
- An Efficient Pipelined Data Parallel Approach for Training Large Model☆76Dec 11, 2020Updated 5 years ago
- ☆605Apr 6, 2018Updated 8 years ago
- TensorFlow kernels for probing memory☆15Mar 2, 2017Updated 9 years ago
- heterogeneity-aware-lowering-and-optimization☆258Jan 20, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- 2016华为codecraft算法大赛 (dfs+pruning)☆12Mar 6, 2017Updated 9 years ago
- ☆12Feb 5, 2023Updated 3 years ago
- ☆10Jul 22, 2023Updated 2 years ago
- Chaos: Scale-out Graph Processing from Secondary Storage☆52Mar 14, 2016Updated 10 years ago
- A simple memory manager for CUDA designed to help Deep Learning frameworks manage memory☆299Nov 28, 2018Updated 7 years ago
- It is open source ebook about TensorFlow kernel and implementation mechanism.☆2,889May 5, 2023Updated 3 years ago
- An MLIR frontend for tensor expressions☆24Sep 5, 2020Updated 5 years ago