a c++/cuda template library for tensor lazy evaluation
☆165May 8, 2023Updated 3 years ago
Alternatives and similar repositories for mtensor
Users that are interested in mtensor are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- TiledKernel is a code generation library based on macro kernels and memory hierarchy graph data structure.☆19May 12, 2024Updated last year
- Offload Eigen operations to GPUs☆20Feb 3, 2022Updated 4 years ago
- Documents and source code related to a Hybrid HPL run for IU's BR2 machine☆16Nov 27, 2012Updated 13 years ago
- C++ library for tensors☆13Dec 6, 2019Updated 6 years ago
- a simple general program language☆99Feb 2, 2026Updated 3 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- C++ tensors with broadcasting and lazy computing☆3,731Updated this week
- Subscribe Loomo published image messages and process☆10Oct 22, 2017Updated 8 years ago
- a tensor computing compiler based tile programming for gpu, cpu or tpu☆45Feb 2, 2026Updated 3 months ago
- Julian macros for wrapping ccall☆14May 27, 2021Updated 4 years ago
- Set of basic classes (vector, matrix, images and memory array) for CPU and GPU☆17Feb 17, 2021Updated 5 years ago
- Quickly warp 3D images on the GPU using CUDA. Works with C and Python.☆25Apr 16, 2021Updated 5 years ago
- ☆19Jan 12, 2021Updated 5 years ago
- Matrix Shadow:Lightweight CPU/GPU Matrix and Tensor Template Library in C++/CUDA for (Deep) Machine Learning☆1,118Aug 4, 2019Updated 6 years ago
- Hybrid CPU and GPU real-time dynamic digital image correlation engine and application☆10Apr 26, 2023Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- KMeans clustering in Eigen.☆26Apr 15, 2016Updated 10 years ago
- Accelerating CNN's convolution operation on GPUs by using memory-efficient data access patterns.☆14Dec 8, 2017Updated 8 years ago
- Parametric Integer Programming Library☆15Jan 23, 2024Updated 2 years ago
- A structure from motion implemention in C++ and accelerated using CUDA☆48Oct 12, 2019Updated 6 years ago
- Portable and vendor neutral framework for parallel programming on heterogeneous platforms.☆440Nov 7, 2025Updated 6 months ago
- A pytorch pretrained model of MnasNet☆21Dec 3, 2019Updated 6 years ago
- Minimal runtime core of Caffe, Forward only, GPU support and Memory efficiency.☆375Jul 15, 2020Updated 5 years ago
- Face-to-Parameter Translation for Game Character Auto-Creation. ICCV 2019☆16Apr 3, 2020Updated 6 years ago
- TensorRT-7 Network Lib 包括常用目标检测、关键点检测、人脸检测、OCR等 可训练自己数据☆537Jul 17, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Caffe Computation Graph Optimization.☆29Jan 7, 2020Updated 6 years ago
- 🔥 (yolov3 yolov4 yolov5 unet ...)A mini pytorch inference framework which inspired from darknet.☆737Apr 23, 2023Updated 3 years ago
- Extend DSO to a stereo system by scale optimization☆48Nov 20, 2020Updated 5 years ago
- Caffe 源码注释☆15Aug 15, 2017Updated 8 years ago
- LDSO 注释☆24Nov 6, 2019Updated 6 years ago
- C++ library for tensor computations☆37Apr 27, 2023Updated 3 years ago
- This is a cross-platform, CUDA-based C++ library for general-purpose, unconstrained nonlinear optimization on the GPU. It implements the …☆139Apr 3, 2020Updated 6 years ago
- ☆10Dec 3, 2022Updated 3 years ago
- Implementation of LIBELAS in cuda.☆71Mar 27, 2017Updated 9 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- An expression template based linear algebra library running completely on the GPU using CUDA☆25Jun 24, 2021Updated 4 years ago
- Different implementation of sparse matrix multiplication. All matrices are in CSR format. The code contains different CUDA kernels for mu…☆17Nov 15, 2010Updated 15 years ago
- A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.☆1,000Sep 19, 2024Updated last year
- Concurrent CPU-GPU Programming using Task Models☆109Dec 19, 2019Updated 6 years ago
- A lightweight high performance tensor algebra framework for modern C++☆837Jul 8, 2025Updated 10 months ago
- Minimal Deep Learning library is written in Python/Cython/C++ and Numpy/CUDA/cuDNN.☆102Feb 23, 2018Updated 8 years ago
- TiledLower is a Dataflow Analysis and Codegen Framework written in Rust.☆13Nov 23, 2024Updated last year