a c++/cuda template library for tensor lazy evaluation
☆166May 8, 2023Updated 2 years ago
Alternatives and similar repositories for mtensor
Users that are interested in mtensor are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Offload Eigen operations to GPUs☆20Feb 3, 2022Updated 4 years ago
- Documents and source code related to a Hybrid HPL run for IU's BR2 machine☆16Nov 27, 2012Updated 13 years ago
- C++ library for tensors☆13Dec 6, 2019Updated 6 years ago
- a simple general program language☆100Feb 2, 2026Updated last month
- SEgo library for stereo relative pose estimation from line and point feature triplets☆10Jan 22, 2019Updated 7 years ago
- a tensor computing compiler based tile programming for gpu, cpu or tpu☆45Feb 2, 2026Updated last month
- 🚀 A parallelized numerical solver for magnetohydrodynamics (MHD) equations, developed using finite volume methods to simulate 🌌 jets in…☆13Jan 18, 2025Updated last year
- Julian macros for wrapping ccall☆14May 27, 2021Updated 4 years ago
- NumPy-compatible multidimensional arrays in C++☆163Mar 1, 2026Updated 3 weeks ago
- Set of basic classes (vector, matrix, images and memory array) for CPU and GPU☆17Feb 17, 2021Updated 5 years ago
- Matrix Shadow:Lightweight CPU/GPU Matrix and Tensor Template Library in C++/CUDA for (Deep) Machine Learning☆1,121Aug 4, 2019Updated 6 years ago
- Hybrid CPU and GPU real-time dynamic digital image correlation engine and application☆10Apr 26, 2023Updated 2 years ago
- Accelerating CNN's convolution operation on GPUs by using memory-efficient data access patterns.☆14Dec 8, 2017Updated 8 years ago
- Parametric Integer Programming Library☆15Jan 23, 2024Updated 2 years ago
- A structure from motion implemention in C++ and accelerated using CUDA☆48Oct 12, 2019Updated 6 years ago
- Portable and vendor neutral framework for parallel programming on heterogeneous platforms.☆438Nov 7, 2025Updated 4 months ago
- Fast binary matrix product on CPU☆10Feb 11, 2016Updated 10 years ago
- A pytorch pretrained model of MnasNet☆21Dec 3, 2019Updated 6 years ago
- Minimal runtime core of Caffe, Forward only, GPU support and Memory efficiency.☆375Jul 15, 2020Updated 5 years ago
- TensorRT-7 Network Lib 包括常用目标检测、关键点检测、人脸检测、OCR等 可训练自己数据☆533Jul 17, 2021Updated 4 years ago
- Caffe Computation Graph Optimization.☆29Jan 7, 2020Updated 6 years ago
- 🔥 (yolov3 yolov4 yolov5 unet ...)A mini pytorch inference framework which inspired from darknet.☆738Apr 23, 2023Updated 2 years ago
- Extend DSO to a stereo system by scale optimization☆48Nov 20, 2020Updated 5 years ago
- Caffe 源码注释☆15Aug 15, 2017Updated 8 years ago
- C++ library for tensor computations☆36Apr 27, 2023Updated 2 years ago
- ☆10Dec 3, 2022Updated 3 years ago
- Implementation of LIBELAS in cuda.☆71Mar 27, 2017Updated 8 years ago
- A primitive library for neural network☆1,367Nov 24, 2024Updated last year
- An expression template based linear algebra library running completely on the GPU using CUDA☆25Jun 24, 2021Updated 4 years ago
- A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.☆1,003Sep 19, 2024Updated last year
- Concurrent CPU-GPU Programming using Task Models☆106Dec 19, 2019Updated 6 years ago
- A lightweight high performance tensor algebra framework for modern C++☆835Jul 8, 2025Updated 8 months ago
- TiledLower is a Dataflow Analysis and Codegen Framework written in Rust.☆13Nov 23, 2024Updated last year
- RISC-V GPGPU☆36Mar 6, 2020Updated 6 years ago
- Projected Overrelaxed Jacobi (JORProx) and Gauss-Seidel (SORProx) GPU implementations.☆13Jan 14, 2019Updated 7 years ago
- Masked Face Image Augmentation Tool for Dataset 300W-LP with 6D Head Pose Information.☆12Aug 12, 2022Updated 3 years ago
- ☆32Oct 26, 2020Updated 5 years ago
- Replacement for Fujitsu Cortex-M3/M4 serial programming mode☆13Sep 4, 2017Updated 8 years ago
- Code for Depth-wise Separable Convolutions: Performance Investigations☆19Jan 28, 2020Updated 6 years ago