CUDA and OpenMP implementations of C2R/R2C inplace transposition
☆48Feb 10, 2015Updated 11 years ago
Alternatives and similar repositories for inplace
Users that are interested in inplace are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Full-speed Array of Structures access☆177Apr 25, 2023Updated 2 years ago
- ☆11Dec 5, 2018Updated 7 years ago
- ARPACK ported to JavaScript. Seriously!☆13Nov 6, 2015Updated 10 years ago
- An experimental method JIT for CPython 3☆29May 18, 2016Updated 9 years ago
- Oh My Fast Postgres!☆11Feb 4, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Improved performance for TensorFlow on Intel hardware.☆13Jun 25, 2018Updated 7 years ago
- ☆11Aug 8, 2021Updated 4 years ago
- Fast multidimensional algorithms☆18Feb 8, 2020Updated 6 years ago
- ☆13May 6, 2023Updated 2 years ago
- Tensor Contraction Code Generator☆39Aug 14, 2017Updated 8 years ago
- The SparseX sparse kernel optimization library☆43Jan 16, 2019Updated 7 years ago
- A collection of bit manipulation routines for C++☆21Jul 24, 2013Updated 12 years ago
- A fast implementation of the ECMA-182 CRC64 checksum using the CLMUL instruction set☆15Nov 1, 2016Updated 9 years ago
- A minimum demo for PyTorch distributed extension functionality for collectives.☆15Jul 29, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A Chainer extension for K-FAC☆20Jun 16, 2019Updated 6 years ago
- Vikunja is a performance portable algorithm library that defines functions operating on ranges of elements for a variety of purposes . It…☆16Oct 10, 2023Updated 2 years ago
- ☆21Jan 21, 2026Updated 2 months ago
- Accelerating CNN's convolution operation on GPUs by using memory-efficient data access patterns.☆14Dec 8, 2017Updated 8 years ago
- Open-source stochastic GW software☆13Apr 28, 2025Updated 11 months ago
- High-Performance Tensor Transpose library☆205May 13, 2023Updated 2 years ago
- ☆17Jul 24, 2023Updated 2 years ago
- Quantum Computing for Nuclear Physics☆13Jan 9, 2026Updated 3 months ago
- Library for fast image convolution in neural networks on Intel Architecture☆30Jun 25, 2017Updated 8 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- MPI accelerator-integrated communication extensions☆40Apr 4, 2023Updated 3 years ago
- ☆11Mar 13, 2021Updated 5 years ago
- Library and accelerator backend☆15Updated this week
- CUDA Tensor Transpose (cuTT) library☆54Aug 10, 2017Updated 8 years ago
- ☆18Apr 8, 2022Updated 4 years ago
- AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (N…☆12Jun 24, 2024Updated last year
- CLTune: An automatic OpenCL & CUDA kernel tuner☆185Dec 12, 2022Updated 3 years ago
- High Performance Computing for Weather and Climate☆42Feb 3, 2026Updated 2 months ago
- Programming Questions (July 2013)☆11Apr 4, 2015Updated 11 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- A vectorizable multi-dimensional iterator for C++ using the Coroutines TS☆12Jun 5, 2022Updated 3 years ago
- GPUDirect Async implementation of HPGMG-FV CUDA☆11May 11, 2018Updated 7 years ago
- finding set bits in large bitmaps☆15Nov 30, 2015Updated 10 years ago
- a heterogeneous multiGPU level-3 BLAS library☆46Dec 9, 2019Updated 6 years ago
- ☆16Jul 29, 2022Updated 3 years ago
- Collection of full, mini, proxy, and benchmark apps.☆11Feb 14, 2020Updated 6 years ago
- ☆31Oct 27, 2023Updated 2 years ago