ducha-aiki / fast_atan2
Realization of the atan2 approximations faster than standard function.
☆26Updated 11 years ago
Alternatives and similar repositories for fast_atan2
Users that are interested in fast_atan2 are comparing it to the libraries listed below
Sorting:
- SIMD optimised library for matrix inversion of 2x2, 3x3, and 4x4 matrices.☆93Updated 9 years ago
- PFFFT: a pretty fast FFT with Bluestein's FFT Algorithm(Non Power of Two)☆25Updated 6 years ago
- Automatically exported from code.google.com/p/math-neon☆40Updated 10 years ago
- Some C++ codes for computing a 1D and 2D convolution product using the FFT implemented with the GSL or FFTW☆58Updated 12 years ago
- A header only library implementing common mathematical functions using SIMD intrinsics☆105Updated 3 months ago
- Just my local copy of math-neon with build script☆93Updated 6 years ago
- Parallel Tasking Library (PTL) - Lightweight C++11 mutilthreading tasking system featuring thread-pool, task-groups, and lock-free task q…☆46Updated 6 months ago
- Implement ARM NEON intrinsics in C++☆21Updated last year
- The repository targets the OpenCL gemm function performance optimization. It compares several libraries clBLAS, clBLAST, MIOpenGemm, Inte…☆17Updated 6 years ago
- FastAC - Amir Said's Arithmetic and Huffman coding library, example code, and documentation☆29Updated 3 years ago
- SSE2 implementations of sin, cos, exp, log, tan, cot, atan, atan2☆93Updated 8 years ago
- Example of how to use CUDA with CMake >= 3.8☆70Updated last year
- CPP20 implementation of a 16-bit floating-point type mimicking most of the IEEE 754 behavior. Single file and header-only.☆41Updated last year
- how to design cpu gemm on x86 with avx256, that can beat openblas.☆70Updated 6 years ago
- Deep insight tensorrt, including but not limited to qat, ptq, plugin, triton_inference, cuda☆17Updated this week
- GEMM☆10Updated last year
- Collection of easy, well-documented and useful OpenCL examples in C++.☆77Updated 3 years ago
- step-by-step tutorial for optimizing a Gaussian image smoothing function☆24Updated 4 years ago
- Library for fast image convolution in neural networks on Intel Architecture☆29Updated 7 years ago
- A small and fast Discrete Fourier Transform library☆48Updated last year
- Portable 128-bit SIMD intrinsics☆58Updated last year
- C99/C++ header-only library for division via fixed-point multiplication by inverse☆51Updated last year
- A single header-only C++ library for automatic / algorithmic differentiation.☆12Updated 2 years ago
- Image processing library for learning purpose☆54Updated 5 months ago
- Verilog Implementation of the Census Transform Stereo Vision algorithm☆28Updated 2 years ago
- PyTorch -> ONNX -> TVM for autotuning☆24Updated 5 years ago
- IEEE 754-based c++ half-precision floating point library forked from http://half.sourceforge.net☆23Updated 3 years ago
- Fast C++ library to resize images, downscale (supersampling), upscale (Lanczos), gamma correction☆35Updated 9 years ago
- Fast Fourier Transform C++ Header/MPI Transpose for FFTW3 with Implicitly Dealiased Convolutions☆132Updated this week
- Simple example of using Vulkan for GPGPU computing☆54Updated 6 years ago