baidu-research / catamountLinks
Catamount is a compute graph analysis tool to load, construct, and modify deep learning models and to symbolically analyze their compute requirements
☆14Updated 4 years ago
Alternatives and similar repositories for catamount
Users that are interested in catamount are comparing it to the libraries listed below
Sorting:
- Generating Families of Practical Fast Matrix Multiplication Algorithms☆12Updated 7 years ago
- ☆10Updated 2 years ago
- ☆14Updated 6 years ago
- Cairo lua bindings with extensions for torch☆15Updated 8 years ago
- stochs: fast stochastic solvers for machine learning in C++ and Cython☆26Updated 2 years ago
- GPU Automatically Tuned Linear Algebra Software☆28Updated 9 years ago
- Examples of building probabilistic models with MXNet linear algebra operators☆23Updated 7 years ago
- Python bindings for libNVVM☆37Updated 11 years ago
- Scientific library for high-precision computations and research☆49Updated 7 years ago
- The "CUDA templates" are a collection of C++ template classes and functions which provide a consistent interface to NVIDIA's "Compute Uni…☆27Updated 13 years ago
- Fork of magma to include more BLAS☆28Updated 8 years ago
- A CUDA implementation of the Tsetlin Machine based on bitwise operators☆26Updated 5 years ago
- Deep learning with a multiplication budget☆47Updated 6 years ago
- Input-aware cuBLAS/clBLAS implementation for better performance☆17Updated 2 years ago
- nGraph™ Backend for ONNX☆42Updated 2 years ago
- Code examples for CUDA and OpenACC☆34Updated 9 months ago
- python experiment management toolset☆15Updated 5 years ago
- OpenCL porting of the GROMACS molecular simulation toolkit☆25Updated 9 years ago
- Multi-core CPU implementation of deep learning for 2D and 3D sliding window convolutional networks (ConvNets).☆94Updated 8 years ago
- A Cython interface to FLANN☆24Updated 4 years ago
- Alchemist: an Apache Spark<->MPI interface☆26Updated 7 years ago
- Orio is an open-source extensible framework for the definition of domain-specific languages and generation of optimized code for multiple…☆37Updated 3 years ago
- TTC: A high-performance Compiler for Tensor Transpositions☆20Updated 7 years ago
- A platform for online learning that curtails data latency and saves you cost.☆47Updated 3 years ago
- akid is a python package written for doing research in Neural Network.☆14Updated 2 years ago
- An Architecture-level Fault Injection Tool for GPU Application Resilience Evaluations☆17Updated 5 years ago
- A visualization tool to show a TensorFlow's graph like TensorBoard☆44Updated 4 years ago
- The Operator Vectorization Library, or OVL, is a python productivity library for defining high performance custom operators for the Tenso…☆68Updated 8 years ago
- MCMC for the Dark Energy Spectroscopic Instrument☆13Updated 9 years ago
- npcomp - An aspirational MLIR based numpy compiler☆51Updated 4 years ago