UofT-EcoSystem / Minuet
[EuroSys'24] Minuet: Accelerating 3D Sparse Convolutions on GPUs
☆75Updated 10 months ago
Alternatives and similar repositories for Minuet:
Users that are interested in Minuet are comparing it to the libraries listed below
- CUda Matrix Multiply library.☆75Updated last month
- [CVPR'23] FlatFormer: Flattened Window Attention for Efficient Point Cloud Transformer☆130Updated last year
- sparse convolution lib. derived from spconv☆55Updated 4 years ago
- Python C++ Code Manager☆14Updated 6 months ago
- A neural network training interface based on PyTorch, with a focus on flexibility☆62Updated last year
- Code of ICCV23 paper: Ada3D : Exploiting the Spatial Redundancy with Adaptive Inference for Efficient 3D Object Detection☆28Updated last year
- ☆45Updated 4 years ago
- PyTorch-Based Fast and Efficient Processing for Various Machine Learning Applications with Diverse Sparsity☆108Updated 3 weeks ago
- the CPU implementation of bucket based farthest point sampling, achieves 7-81x speedup than the conventional implementation☆21Updated last year
- The four major frameworks for 3D point cloud sparse acceleration, which are currently mainstream, are compared. These include MIT-HAN-LAB…☆26Updated 2 months ago
- Pytorch implementation of our paper MaxQ: Multi-Axis Query for N:M Sparsity Network accepted by CVPR 2024.☆36Updated last year
- ICLR2024: LiDAR-PTQ: Post-Training Quantization for Point Cloud 3D Object Detection.☆76Updated 7 months ago
- Repository for artifact evaluation of ASPLOS 2023 paper "SparseTIR: Composable Abstractions for Sparse Compilation in Deep Learning"☆25Updated 2 years ago
- the GPU implementation of bucket based farthest point sampling, achieves 3-4x speedup than the conventional implementation☆13Updated last year
- A fast and memory-efficient libarary for sparse transformer with varying token numbers (e.g., 3D point cloud).☆163Updated last year
- [ECCV 2024] Occupancy as Set of Points☆88Updated 9 months ago
- The codes for RFNet: Recurrent Forward Network for Dense Point Cloud Completion☆21Updated 3 years ago
- The codes for ECCV'22: Resolution-free Point Cloud Sampling Network with Data Distillation☆17Updated 2 years ago
- SGEMM optimization with cuda step by step☆18Updated last year
- This project is the official implementation of our accepted ICLR 2021 paper BiPointNet: Binary Neural Network for Point Clouds.☆75Updated 4 years ago
- Patch convolution to avoid large GPU memory usage of Conv2D☆86Updated 2 months ago
- [CVPR'23] SparseViT: Revisiting Activation Sparsity for Efficient High-Resolution Vision Transformer☆68Updated 11 months ago
- Artifacts of EVT ASPLOS'24☆23Updated last year
- (NeurIPS 2024) LiT: Unifying LiDAR "Languages" with LiDAR Translator☆19Updated 3 months ago
- ☆22Updated 6 months ago
- ⚡️Write HGEMM from scratch using Tensor Cores with WMMA, MMA and CuTe API, Achieve Peak⚡️ Performance.☆70Updated 3 weeks ago
- tutorial for writing custom pytorch cpp+cuda kernel, applied on volume rendering (NeRF)☆27Updated last year
- High-res 3D Occupancy Dataset for Unified 3D Scene Understanding.☆24Updated 9 months ago
- [ICRA 2025] Official implementation for "TrackOcc: Camera-based 4D Panoptic Occupancy Tracking"☆27Updated last week
- Quantized Attention on GPU☆45Updated 5 months ago