BoyuanFeng/APNN-TC

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/BoyuanFeng/APNN-TC)

BoyuanFeng / APNN-TC

☆20

Alternatives and similar repositories for APNN-TC

Users that are interested in APNN-TC are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

apuaaChen / vectorSparse
View on GitHub
☆32Aug 24, 2022Updated 3 years ago
pnnl / TCBNN
View on GitHub
☆39Jul 25, 2022Updated 4 years ago
JamesTheZ / VersaPipe
View on GitHub
A framework for pipelined computing on GPU
☆30Jul 17, 2019Updated 7 years ago
xxcclong / GNN-Computing
View on GitHub
Artifact for PPoPP20 "Understanding and Bridging the Gaps in Current GNN Performance Optimizations"
☆42Nov 16, 2021Updated 4 years ago
gilshm / sparq
View on GitHub
Post-training sparsity-aware quantization
☆34Feb 26, 2023Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
YukeWang96 / TC-GNN_ATC23
View on GitHub
Artifact for USENIX ATC'23: TC-GNN: Bridging Sparse GNN Computation and Dense Tensor Cores on GPUs.
☆58Oct 16, 2023Updated 2 years ago
GVProf / GVProf
View on GitHub
GVProf: A Value Profiler for GPU-based Clusters
☆54Mar 24, 2024Updated 2 years ago
YukeWang96 / QGTC_PPoPP22
View on GitHub
Artifact for PPoPP22 QGTC: Accelerating Quantized GNN via GPU Tensor Core.
☆30Feb 12, 2022Updated 4 years ago
yondonfu / sol-baby-jubjub
View on GitHub
Solidity implementation of the baby jubjub curve
☆21Apr 30, 2024Updated 2 years ago
SNUCP / fast-ksw
View on GitHub
POC implementation of "Accelerating HE Operations Using Key Decomposition"[KLSS23]
☆19Jun 11, 2025Updated last year
ThisisBillhe / BiViT
View on GitHub
The official implementation of BiViT: Extremely Compressed Binary Vision Transformers
☆16Jun 18, 2023Updated 3 years ago
wahibium / KFF
View on GitHub
Scalable GPU Kernel Fission/Fusion Transformation for Memory-Bound Kernels
☆14Aug 26, 2015Updated 10 years ago
VITA-Group / Linearity-Grafting
View on GitHub
[ICML 2022] "Linearity Grafting: Relaxed Neuron Pruning Helps Certifiable Robustness" by Tianlong Chen*, Huan Zhang*, Zhenyu Zhang, Shiyu…
☆16Jun 22, 2022Updated 4 years ago
daadaada / turingas
View on GitHub
Assembler for NVIDIA Volta and Turing GPUs
☆246Jan 13, 2022Updated 4 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
getianao / ngAP
View on GitHub
ngAP's artifact for ASPLOS'24
☆25Jul 29, 2025Updated 11 months ago
ziplab / QLLM
View on GitHub
[ICLR 2024] This is the official PyTorch implementation of "QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Mod…
☆31Mar 12, 2024Updated 2 years ago
huqinghao / PalQuant
View on GitHub
☆12Aug 26, 2022Updated 3 years ago
czkkkkkk / gccl
View on GitHub
☆13Jan 23, 2021Updated 5 years ago
segmind / cral
View on GitHub
Open Source Deep Learning Computer Vision (DLCV) Library
☆16Nov 26, 2020Updated 5 years ago
ThisisBillhe / torch_quantizer
View on GitHub
torch_quantizer is a out-of-box quantization tool for PyTorch models on CUDA backend, specially optimized for Diffusion Models.
☆25Mar 29, 2024Updated 2 years ago
ziplab / QTool
View on GitHub
Collections of model quantization algorithms. Any issues, please contact Peng Chen (blueardour@gmail.com)
☆73Oct 7, 2021Updated 4 years ago
sjtu-epcc / Tacker
View on GitHub
Tacker: Tensor-CUDA Core Kernel Fusion for Improving the GPU Utilization while Ensuring QoS
☆33Feb 10, 2025Updated last year
sderek / CUDAAdvisor
View on GitHub
CUDAAdvisor: a GPU profiling tool
☆53Aug 24, 2018Updated 7 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
google-research / sputnik
View on GitHub
A library of GPU kernels for sparse matrix operations.
☆289Nov 24, 2020Updated 5 years ago
quiver-team / quiver-feature
View on GitHub
High performance RDMA-based distributed feature collection component for training GNN model on EXTREMELY large graph
☆55Jul 3, 2022Updated 4 years ago
ModelTC / Outlier_Suppression_Plus
View on GitHub
Official implementation of the EMNLP23 paper: Outlier Suppression+: Accurate quantization of large language models by equivalent and opti…
☆52Oct 21, 2023Updated 2 years ago
lixiuhong / batched_gemm
View on GitHub
☆40Feb 28, 2020Updated 6 years ago
deJQK / FracBits
View on GitHub
Neural Network Quantization With Fractional Bit-widths
☆11Feb 19, 2021Updated 5 years ago
KlugerLab / deepcytof
View on GitHub
☆19Mar 15, 2017Updated 9 years ago
Tianshi-Xu / PrivCirNet
View on GitHub
[NeurIPS'24] Official implement of "PrivCirNet: Efficient Private Inference via Block Circulant Transformation"
☆14Feb 26, 2026Updated 4 months ago
Cornell-RelaxML / Hyperdimensional-Computing
View on GitHub
Official implementation for the paper "Understanding Hyperdimensional Computing for Parallel Single-Pass Learning"
☆25Jun 10, 2023Updated 3 years ago
linnanwang / superneurons-release
View on GitHub
this is the release repository of superneurons
☆54Feb 13, 2021Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Res2Net / Res2Net-fcanet
View on GitHub
Res2Net for Interactive Image Segmentation using fcanet (CVPR 2020)
☆12May 13, 2020Updated 6 years ago
marsupialtail / sparsednn
View on GitHub
Fast sparse deep learning on CPUs
☆55Sep 28, 2022Updated 3 years ago
DohyeongKi / better-homomorphic-sine-evaluation
View on GitHub
☆12May 24, 2022Updated 4 years ago
hgyhungry / ge-spmm
View on GitHub
☆115Jul 3, 2021Updated 5 years ago
SuperScientificSoftwareLaboratory / TileSpGEMM
View on GitHub
Source code of the PPoPP '22 paper: "TileSpGEMM: A Tiled Algorithm for Parallel Sparse General Matrix-Matrix Multiplication on GPUs" by Y…
☆48May 22, 2024Updated 2 years ago
PointCloudYC / SQN_tensorflow
View on GitHub
TensorFlow re-implementation of SQN for weakly supervised segmentation on point clouds.
☆14Apr 16, 2026Updated 3 months ago
vancemiller / CUDA-preemption
View on GitHub
Experiments evaluating preemption on the NVIDIA Pascal architecture
☆16Nov 10, 2016Updated 9 years ago