Documentation for StreamExecutor open source proposal
☆83Mar 28, 2016Updated 10 years ago
Alternatives and similar repositories for streamexecutordoc
Users that are interested in streamexecutordoc are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆371Oct 23, 2017Updated 8 years ago
- tensorflow源码阅读笔记☆191Sep 18, 2018Updated 7 years ago
- Documentation for the entire CGRAFlow☆19Sep 17, 2021Updated 4 years ago
- ☆24Jul 31, 2017Updated 8 years ago
- Simple example of implementing a new Tensorflow operation and its gradient in C++.☆56Mar 28, 2019Updated 7 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆21Oct 15, 2015Updated 10 years ago
- This repo is used to assess NSL's scientific research assistants.☆17Jul 7, 2025Updated 11 months ago
- Minimal numerical computation library with TensorFlow APIs☆304Jan 2, 2019Updated 7 years ago
- An IR for efficiently simulating distributed ML computation.☆33Jan 13, 2024Updated 2 years ago
- Compiler toolkit for neuFlow.☆27Jul 7, 2013Updated 12 years ago
- GPU-specialized parameter server for GPU machine learning.☆102Apr 5, 2018Updated 8 years ago
- TF2 implementation of DLRM (inherited and modified from openrec's initial implementation)☆15Jul 14, 2020Updated 5 years ago
- Haskell binding for Menoh DNN inference library☆12Nov 30, 2018Updated 7 years ago
- 滴滴云推理服务的 HTTP 客户端示例代码☆21Nov 21, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Towards Hardware and Software Continuous Integration☆13Jun 8, 2020Updated 6 years ago
- A performant and modular runtime for TensorFlow☆753Sep 4, 2025Updated 9 months ago
- Enhanced networking support for TensorFlow. Maintained by SIG-networking.☆100Nov 19, 2021Updated 4 years ago
- A fast multi-producer, multi-consumer lock-free concurrent queue for C++11☆10May 25, 2015Updated 11 years ago
- Installation scripts for CUDA, cuDNN, TensorFlow, Caffe, etc. on Ubuntu machines☆24Aug 1, 2021Updated 4 years ago
- BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.☆928Dec 30, 2024Updated last year
- Reed-Solomon Erasure Coding in Haskell☆23Jan 22, 2017Updated 9 years ago
- OFI Programmer's Guide☆52Dec 29, 2022Updated 3 years ago
- The Tensor Algebra SuperOptimizer for Deep Learning☆743Jan 26, 2023Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Proof-of-Concept CNN in Halide☆22Aug 4, 2016Updated 9 years ago
- Sublinear memory optimization for deep learning, reduce GPU memory cost to train deeper nets☆28Apr 22, 2016Updated 10 years ago
- TensorFlow and TVM integration☆36Apr 27, 2020Updated 6 years ago
- Just save my record on github...☆27Feb 7, 2021Updated 5 years ago
- A benchmark framework for Tensorflow☆1,147Oct 6, 2023Updated 2 years ago
- Heterogeneous Active Messages C++ library☆21Nov 8, 2019Updated 6 years ago
- Collective communications library with various primitives for multi-machine training.☆1,429Apr 21, 2026Updated last month
- CUDA Waste is a wrapper for emulation of CUDA programs on Windows☆15Feb 17, 2016Updated 10 years ago
- Voice from TUNA☆18Dec 10, 2018Updated 7 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.☆1,001Sep 19, 2024Updated last year
- An Efficient Pipelined Data Parallel Approach for Training Large Model☆76Dec 11, 2020Updated 5 years ago
- ☆605Apr 6, 2018Updated 8 years ago
- heterogeneity-aware-lowering-and-optimization☆258Jan 20, 2024Updated 2 years ago
- ☆12Feb 5, 2023Updated 3 years ago
- A self-contained computer stack hobby project☆15Dec 23, 2016Updated 9 years ago
- Chaos: Scale-out Graph Processing from Secondary Storage☆52Mar 14, 2016Updated 10 years ago