High optimized fft library based on CUDA(the same fast as cufft and faster some times)
☆19Jun 13, 2017Updated 9 years ago
Alternatives and similar repositories for xfft
Users that are interested in xfft are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- flexible-gemm conv of deepcore☆17Dec 2, 2019Updated 6 years ago
- Optimized half precision gemm assembly kernels (deprecated due to ROCm)☆47Jun 16, 2017Updated 8 years ago
- Subpart source code of of deepcore v0.7☆27Jun 28, 2020Updated 5 years ago
- assembler for NVIDIA FERMI. Imported from Google Code☆77Mar 22, 2015Updated 11 years ago
- Matrix-Vector Multiplication Using Shared and Coalesced Memory Access☆16Apr 9, 2013Updated 13 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆35Dec 2, 2016Updated 9 years ago
- Communication-Minimizing 2D Convolution in GPU Registers☆30Sep 21, 2013Updated 12 years ago
- This repo generates handwritten math equation image may apply to any uses☆14Jan 5, 2021Updated 5 years ago
- ROCm Command Line Profiler - Updated moved to https://github.com/GPUOpen-Tools/RCP☆10Aug 24, 2017Updated 8 years ago
- Third party assembler and GEMM library for NVIDIA Kepler GPU☆85Oct 8, 2019Updated 6 years ago
- CoreOS, Nvidia kernel module Dockerfile☆15Aug 20, 2015Updated 10 years ago
- Programming Assignment Project for Information Visualization Course on University of Chinese Academy of Sciences☆12Mar 10, 2017Updated 9 years ago
- An implementation of http://www.cs.huji.ac.il/~danix/ShadowRemoval/☆14May 11, 2017Updated 9 years ago
- HCC Sample Applications☆13Jan 3, 2017Updated 9 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Assembler for NVIDIA Maxwell architecture☆1,070Jan 3, 2023Updated 3 years ago
- Documents and source code related to a Hybrid HPL run for IU's BR2 machine☆16Nov 27, 2012Updated 13 years ago
- This is a tuned sparse matrix dense vector multiplication(SpMV) library☆23Mar 21, 2016Updated 10 years ago
- detect facial landmark with mini-caffe☆18Feb 23, 2017Updated 9 years ago
- CLRadeonExtender (GCN assembler, Radeon assembler) mirror☆103Feb 16, 2025Updated last year
- ☆10Aug 4, 2022Updated 3 years ago
- An implementation of the Pregel graph processing system on the Spark cluster computing framework. Merged into Spark; please see:☆11Apr 9, 2011Updated 15 years ago
- Distributed machine learning platform☆13Aug 20, 2015Updated 10 years ago
- an assembler/compiler for AMD’s GCN (Generation Core Next Architecture) Assembly Language☆42Jan 17, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- best CPU/GPU sparse solver for large sparse matrices☆21Oct 5, 2021Updated 4 years ago
- An AVX/AVX2/x64/pure-Go implementation of the ChaCha20 stream cipher for Golang. [Deprecated].☆11Mar 15, 2018Updated 8 years ago
- A p2p gossip protocol for requesting artifacts.☆11May 5, 2015Updated 11 years ago
- A pure-go implementation of the Axolotl Ratchet, extracted from pond☆21Feb 8, 2017Updated 9 years ago
- A lightweight user land implementation of the UDP/IPv4 stack designed to plug into the netmap framework. The 's' stands for speed.☆10Dec 11, 2024Updated last year
- Golang wayland protocol implementation☆13Oct 17, 2015Updated 10 years ago
- Golang DKIM Verifier☆11Sep 3, 2025Updated 9 months ago
- 支付宝 Web Wap Mobile 支付接口 golang 实现☆10Dec 1, 2016Updated 9 years ago
- A macrospin simulation tool for nanoparticles☆12Dec 4, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Implementation for <Neural Similarity Learning> in NeurIPS'19.☆33Aug 23, 2020Updated 5 years ago
- Implementation of Deep Residual Learning / Residual Network for MSR paper http://arxiv.org/abs/1512.03385☆36Jan 5, 2016Updated 10 years ago
- example of the DynASM library☆22Jul 11, 2011Updated 14 years ago
- ☆11Apr 2, 2021Updated 5 years ago
- Package tftp implements a TFTP server, as described in RFC 1350. MIT Licensed.☆11Jun 19, 2015Updated 10 years ago
- A Modular System for Flexible, High-Performance Traffic http://www.ict-mplane.eu/☆24Oct 4, 2018Updated 7 years ago
- A lightweight hypervisor for forensics☆12Sep 1, 2015Updated 10 years ago