horizon-research / Efficient-Deep-Learning-for-Point-Clouds
☆45Updated 4 years ago
Alternatives and similar repositories for Efficient-Deep-Learning-for-Point-Clouds:
Users that are interested in Efficient-Deep-Learning-for-Point-Clouds are comparing it to the libraries listed below
- PyTorch-Based Fast and Efficient Processing for Various Machine Learning Applications with Diverse Sparsity☆108Updated 3 weeks ago
- ☆19Updated 4 years ago
- A general framework for optimizing DNN dataflow on systolic array☆34Updated 4 years ago
- Official implementation of "Searching for Winograd-aware Quantized Networks" (MLSys'20)☆27Updated last year
- Simulator for BitFusion☆99Updated 4 years ago
- [EuroSys'24] Minuet: Accelerating 3D Sparse Convolutions on GPUs☆75Updated 10 months ago
- ☆33Updated 3 years ago
- ☆34Updated 4 years ago
- ☆70Updated 5 years ago
- ☆25Updated 3 years ago
- [FPGA'21] CoDeNet is an efficient object detection model on PyTorch, with SOTA performance on VOC and COCO based on CenterNet and Co-Desi…☆25Updated 2 years ago
- Linux docker for the DNN accelerator exploration infrastructure composed of Accelergy and Timeloop☆52Updated last week
- QuickEst repository: Quick Estimation of Quality of Results☆26Updated 6 years ago
- Provides the code for the paper "EBPC: Extended Bit-Plane Compression for Deep Neural Network Inference and Training Accelerators" by Luk…☆17Updated 5 years ago
- [ICASSP'20] DNN-Chip Predictor: An Analytical Performance Predictor for DNN Accelerators with Various Dataflows and Hardware Architecture…☆23Updated 2 years ago
- PyTorch implementation of DiracDeltaNet from paper Synetgy: Algorithm-hardware Co-design for ConvNet Accelerators on Embedded FPGAs☆31Updated 5 years ago
- Training with Block Minifloat number representation☆14Updated 3 years ago
- Benchmark for matrix multiplications between dense and block sparse (BSR) matrix in TVM, blocksparse (Gray et al.) and cuSparse.☆24Updated 4 years ago
- A Comprehensive Model-Based Analysis Framework for High Level Synthesis of Real Applications☆34Updated 4 years ago
- Approximate layers - TensorFlow extension☆27Updated last week
- Algorithm-hardware Co-design for Deformable Convolution☆24Updated 4 years ago
- Repository for artifact evaluation of ASPLOS 2023 paper "SparseTIR: Composable Abstractions for Sparse Compilation in Deep Learning"☆25Updated 2 years ago
- [ICML 2021] "Auto-NBA: Efficient and Effective Search Over the Joint Space of Networks, Bitwidths, and Accelerators" by Yonggan Fu, Yonga…☆15Updated 3 years ago
- ☆19Updated 5 years ago
- MAERI: A DNN accelerator with reconfigurable interconnects to support flexible dataflow (http://synergy.ece.gatech.edu/tools/maeri/)☆64Updated 3 years ago
- An open-sourced PyTorch library for developing energy efficient multiplication-less models and applications.☆13Updated 2 months ago
- A Generic Distributed Auto-Tuning Infrastructure☆22Updated 3 years ago
- Static Block Floating Point Quantization for CNN☆32Updated 3 years ago
- pytorch fixed point training tool/framework☆34Updated 4 years ago
- C++ RTL simulator for EIE(https://arxiv.org/abs/1602.01528)☆22Updated 4 years ago