High optimized fft library based on CUDA(the same fast as cufft and faster some times)
☆19Jun 13, 2017Updated 8 years ago
Alternatives and similar repositories for xfft
Users that are interested in xfft are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Optimized half precision gemm assembly kernels (deprecated due to ROCm)☆47Jun 16, 2017Updated 8 years ago
- Subpart source code of of deepcore v0.7☆27Jun 28, 2020Updated 5 years ago
- assembler for NVIDIA FERMI. Imported from Google Code☆77Mar 22, 2015Updated 11 years ago
- Matrix-Vector Multiplication Using Shared and Coalesced Memory Access☆16Apr 9, 2013Updated 13 years ago
- CUDA FFT convolution☆16Mar 18, 2015Updated 11 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- deepid2 face verification base on caffe.☆15Mar 31, 2018Updated 8 years ago
- Communication-Minimizing 2D Convolution in GPU Registers☆30Sep 21, 2013Updated 12 years ago
- This repo generates handwritten math equation image may apply to any uses☆14Jan 5, 2021Updated 5 years ago
- ROCm Command Line Profiler - Updated moved to https://github.com/GPUOpen-Tools/RCP☆10Aug 24, 2017Updated 8 years ago
- This repository contains a SystemVerilog implementation of a parametrized Round Robin arbiter with three instantiation options☆13Jan 28, 2024Updated 2 years ago
- Programming Assignment Project for Information Visualization Course on University of Chinese Academy of Sciences☆12Mar 10, 2017Updated 9 years ago
- An implementation of http://www.cs.huji.ac.il/~danix/ShadowRemoval/☆14May 11, 2017Updated 9 years ago
- Native systemd integration for Go (golang) programs☆12May 1, 2017Updated 9 years ago
- A fork of the main Verilator project for development work. The changes here are in preparation for committing back to the main project.☆18Nov 26, 2014Updated 11 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Plot pixels on a 320x200 256c canvas☆11Jan 8, 2024Updated 2 years ago
- Build Your Own Bundle-A Neural Combinatorial Optimization Method (BYOB)☆13Apr 27, 2022Updated 4 years ago
- HCC Sample Applications☆13Jan 3, 2017Updated 9 years ago
- Assembler for NVIDIA Maxwell architecture☆1,067Jan 3, 2023Updated 3 years ago
- record metrics and logs☆10Apr 2, 2018Updated 8 years ago
- Statically-typed localization messages.☆10Oct 11, 2020Updated 5 years ago
- Package vxlan implements marshaling and unmarshaling of Virtual eXtensible Local Area Network (VXLAN) frames, as described in RFC 7348. …☆12Apr 20, 2016Updated 10 years ago
- This is a tuned sparse matrix dense vector multiplication(SpMV) library☆23Mar 21, 2016Updated 10 years ago
- detect facial landmark with mini-caffe☆18Feb 23, 2017Updated 9 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Caffe triplet loss implementation☆17May 12, 2016Updated 10 years ago
- AnyDSL traversal code☆15Feb 18, 2019Updated 7 years ago
- CLRadeonExtender (GCN assembler, Radeon assembler) mirror☆103Feb 16, 2025Updated last year
- Simple example showing how to use DGMA in OpenCL☆13Feb 11, 2016Updated 10 years ago
- ☆17Mar 13, 2015Updated 11 years ago
- ☆10Aug 4, 2022Updated 3 years ago
- ☆13Sep 28, 2022Updated 3 years ago
- Distributed machine learning platform☆13Aug 20, 2015Updated 10 years ago
- an assembler/compiler for AMD’s GCN (Generation Core Next Architecture) Assembly Language☆42Jan 17, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Graph4NLP is the library for the easy use of Graph Neural Networks for Natural Language Processing☆15Feb 12, 2021Updated 5 years ago
- ☆16Dec 15, 2020Updated 5 years ago
- ROCm - AMDGPU Compute Application Binary Interface☆41Mar 19, 2022Updated 4 years ago
- Context2Bundle: Diversified Personalized Bundle Recommendation☆12Feb 22, 2018Updated 8 years ago
- A Script oriented compiler☆27Feb 13, 2017Updated 9 years ago
- Golang android library☆13Dec 20, 2015Updated 10 years ago
- Tomasulo Simulator written in React as the project for Computer Architecture course, Spring 2019, Tsinghua University☆11Jun 9, 2019Updated 6 years ago