XiuYuLi/xfft

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/XiuYuLi/xfft)

XiuYuLi / xfft

High optimized fft library based on CUDA(the same fast as cufft and faster some times)

☆19

Alternatives and similar repositories for xfft

Users that are interested in xfft are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

XiuYuLi / flexible-gemm
View on GitHub
flexible-gemm conv of deepcore
☆17Dec 2, 2019Updated 6 years ago
hyln9 / GCNGEMM
View on GitHub
Optimized half precision gemm assembly kernels (deprecated due to ROCm)
☆47Jun 16, 2017Updated 9 years ago
hyqneuron / asfermi
View on GitHub
assembler for NVIDIA FERMI. Imported from Google Code
☆77Mar 22, 2015Updated 11 years ago
wuqianliang / deepid2_caffe
View on GitHub
deepid2 face verification base on caffe.
☆15Mar 31, 2018Updated 8 years ago
yogesh-desai / TiledMatrixMultiplicationInCUDA
View on GitHub
TILED Matrix Multiplication in CUDA using Shared Memory. An efficient and fast way.
☆23Nov 16, 2018Updated 7 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
rocmarchive / ROCm-Profiler
View on GitHub
ROCm Command Line Profiler - Updated moved to https://github.com/GPUOpen-Tools/RCP
☆10Aug 24, 2017Updated 8 years ago
tom-urkin / Round-Robin
View on GitHub
This repository contains a SystemVerilog implementation of a parametrized Round Robin arbiter with three instantiation options
☆13Jan 28, 2024Updated 2 years ago
Avalanche-io / coreos-nvidia
View on GitHub
CoreOS, Nvidia kernel module Dockerfile
☆15Aug 20, 2015Updated 10 years ago
PAA-NCIC / PPoPP2017_artifact
View on GitHub
Third party assembler and GEMM library for NVIDIA Kepler GPU
☆86Oct 8, 2019Updated 6 years ago
johnnyjana730 / UCPR
View on GitHub
UCPR: User-Centric Path Reasoning towards Explainable Recommendation, SIGIR 2021
☆13Jun 18, 2022Updated 4 years ago
hijiangtao / infovis-ucas
View on GitHub
Programming Assignment Project for Information Visualization Course on University of Chinese Academy of Sciences
☆12Mar 10, 2017Updated 9 years ago
Orcuslc / ShadowRemoval
View on GitHub
An implementation of http://www.cs.huji.ac.il/~danix/ShadowRemoval/
☆14May 11, 2017Updated 9 years ago
TI-LPRF-Software / CC2640_SimpleEddystoneBeacon
View on GitHub
A connectable broadcaster sample application compatible with Google Eddystone protocol. Runs with TI BLE-Stack 2.1.
☆12Sep 2, 2015Updated 10 years ago
progschj / reziretsar
View on GitHub
A simple triangle rasterizer
☆17Jun 21, 2014Updated 12 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
AdamSturge / Real-Time-Collision-Detection
View on GitHub
A place for me to store my code while reading through Real Time Collision Detection by Christer Ericson
☆10Jul 15, 2021Updated 5 years ago
ROCm / HCC-Example-Application
View on GitHub
HCC Sample Applications
☆13Jan 3, 2017Updated 9 years ago
sgatev / g11n
View on GitHub
Statically-typed localization messages.
☆10Oct 11, 2020Updated 5 years ago
NervanaSystems / maxas
View on GitHub
Assembler for NVIDIA Maxwell architecture
☆1,074Jan 3, 2023Updated 3 years ago
vmware-archive / blackbox
View on GitHub
record metrics and logs
☆10Apr 2, 2018Updated 8 years ago
jinhkim / ObjView
View on GitHub
3d model viewer for android. Displays 3D meshes that you can interact with in real time.
☆18Jan 14, 2015Updated 11 years ago
luoyetx / mini-caffe-example
View on GitHub
detect facial landmark with mini-caffe
☆18Feb 23, 2017Updated 9 years ago
pva701 / caffe
View on GitHub
Caffe triplet loss implementation
☆17May 12, 2016Updated 10 years ago
fcitx / fcitx-fbterm
View on GitHub
Fbterm support for fcitx
☆23Dec 18, 2012Updated 13 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
gpu-pdl-nudt / GeRelion
View on GitHub
GPU-enhanced parallel implementation of single particle cryo-EM image processing
☆12Oct 2, 2017Updated 8 years ago
CLRX / CLRX-mirror
View on GitHub
CLRadeonExtender (GCN assembler, Radeon assembler) mirror
☆103Feb 16, 2025Updated last year
intel / mklnn
View on GitHub
☆10Aug 4, 2022Updated 3 years ago
melver / mc2lib
View on GitHub
Memory consistency model checking and test generation library.
☆15Oct 14, 2016Updated 9 years ago
rohith10 / ForwardPlus-InstantRadiosity
View on GitHub
☆24May 12, 2014Updated 12 years ago
ankurdave / bagel
View on GitHub
An implementation of the Pregel graph processing system on the Spark cluster computing framework. Merged into Spark; please see:
☆11Apr 9, 2011Updated 15 years ago
wmbest2 / android
View on GitHub
Golang android library
☆13Dec 20, 2015Updated 10 years ago
closest-git / GSS
View on GitHub
best CPU/GPU sparse solver for large sparse matrices
☆21Oct 5, 2021Updated 4 years ago
ROCm / ROCm-ComputeABI-Doc
View on GitHub
ROCm - AMDGPU Compute Application Binary Interface
☆41Mar 19, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
geru-scotland / ThreadPoolLib
View on GitHub
A simple but efficient C++ thread/worker pool library for asynchronous task management.
☆10Jul 11, 2023Updated 3 years ago
SunsetQuest / Asm4GCN
View on GitHub
an assembler/compiler for AMD’s GCN (Generation Core Next Architecture) Assembly Language
☆42Jan 17, 2023Updated 3 years ago
axsh / openvdc
View on GitHub
Extendable Tiny Datacenter Hypervisor on top of Mesos architecture. Wakame-vdc v2 Project.
☆12Oct 9, 2018Updated 7 years ago
xxkkrr / FPAN
View on GitHub
☆16Dec 15, 2020Updated 5 years ago
sippy / libsinet
View on GitHub
A lightweight user land implementation of the UDP/IPv4 stack designed to plug into the netmap framework. The 's' stands for speed.
☆10Dec 11, 2024Updated last year
wubinzzu / Context2Bundle
View on GitHub
Context2Bundle: Diversified Personalized Bundle Recommendation
☆12Feb 22, 2018Updated 8 years ago
KodyKantor / p2p-gossip
View on GitHub
A p2p gossip protocol for requesting artifacts.
☆11May 5, 2015Updated 11 years ago