jrk / halide-python-tutorials
Fredo's version of the Halide introductory tutorials in Python, for 6.815/6.865
☆25Updated 10 years ago
Related projects ⓘ
Alternatives and complementary repositories for halide-python-tutorials
- A portable high-level API with CUDA or OpenCL back-end☆54Updated 7 years ago
- A Halide journey taken for pleasure, this repo will hopefully serve a collection of Halide imaging functions that are useful to the commu…☆15Updated 9 years ago
- Case Studies for Halide performance against C++ and OpenCL☆37Updated 11 years ago
- Skeletonide is a parallel implementation of Zhang-Suen morphological thinning algorithm written in Halide-lang. Use it for fast skeletoni…☆12Updated 4 years ago
- Parallel network flows using OpenMP and CUDA.☆27Updated 5 years ago
- GPU Automatically Tuned Linear Algebra Software☆28Updated 9 years ago
- CNNs in Halide☆23Updated 9 years ago
- Proof-of-Concept CNN in Halide☆21Updated 8 years ago
- This example builds on the parallel-forall repo separate compilation example by adding CMake to it.☆17Updated 6 years ago
- A GPU-ready drop-in replacement for numpy.☆32Updated 2 years ago
- Python Binding to NVRTC☆79Updated last month
- TTC: A high-performance Compiler for Tensor Transpositions☆20Updated 7 years ago
- HDF5 C++ wrapper☆38Updated 2 months ago
- Generating Families of Practical Fast Matrix Multiplication Algorithms☆12Updated 7 years ago
- ☆14Updated 2 years ago
- IPython / Jupyter integration for pybind11☆66Updated 7 years ago
- Whippletree, a novel approach to scheduling dynamic, irregular workloads on the GPU☆21Updated 8 years ago
- python wrapper for the OpenCL FFT library clFFT☆54Updated 4 months ago
- CMake module collection☆30Updated 9 years ago
- ☆14Updated 5 years ago
- Domain-specific language for IIR filters☆15Updated 8 years ago
- ☆102Updated 5 years ago
- Communication-Minimizing 2D Convolution in GPU Registers☆30Updated 11 years ago
- FluidNet re-written with ATen tensor lib☆51Updated 5 years ago
- Scientific library for high-precision computations and research☆50Updated 7 years ago
- Generalized Histograms for CUDA-capable GPUs☆43Updated 9 years ago
- Multi-core CPU implementation of deep learning for 2D and 3D sliding window convolutional networks (ConvNets).☆94Updated 7 years ago
- tensor4 - pytorch to C++ convertor using lightweight templated tensor library☆28Updated 4 years ago
- Code examples for the CUDA workshop☆35Updated 2 years ago