Mengjintao / FastCNN
☆18Updated 2 years ago
Alternatives and similar repositories for FastCNN:
Users that are interested in FastCNN are comparing it to the libraries listed below
- Yet another Polyhedra Compiler for DeepLearning☆19Updated last year
- ☆14Updated 2 years ago
- This is a demo how to write a high performance convolution run on apple silicon☆52Updated 2 years ago
- symmetric int8 gemm☆66Updated 4 years ago
- study of cutlass☆20Updated 2 months ago
- A standalone GEMM kernel for fp16 activation and quantized weight, extracted from FasterTransformer☆88Updated 11 months ago
- ☆17Updated 9 months ago
- My learning notes about AI, including Machine Learning and Deep Learning.☆18Updated 5 years ago
- ☆94Updated 3 years ago
- ☆30Updated last year
- play gemm with tvm☆85Updated last year
- ☆11Updated 11 months ago
- OneFlow->ONNX☆42Updated last year
- flexible-gemm conv of deepcore☆17Updated 5 years ago
- ICML2017 MEC: Memory-efficient Convolution for Deep Neural Network C++实现(非官方)☆17Updated 5 years ago
- ☆69Updated last year
- Benchmark scripts for TVM☆73Updated 2 years ago
- An external memory allocator example for PyTorch.☆14Updated 3 years ago
- ☆19Updated 3 years ago
- A set of examples around MegEngine☆31Updated last year
- Tencent NCNN with added CUDA support☆68Updated 4 years ago
- ☆38Updated 4 years ago
- ☆33Updated 3 months ago
- TQT's pytorch implementation.☆21Updated 3 years ago
- Fast NPU-aware Neural Architecture Search☆22Updated 3 years ago
- ☆25Updated 9 months ago
- quantize aware training package for NCNN on pytorch☆70Updated 3 years ago
- arm-neon☆89Updated 5 months ago
- CVFusion is an open-source deep learning compiler to fuse the OpenCV operators.☆28Updated 2 years ago
- An easy way to run, test, benchmark and tune OpenCL kernel files☆23Updated last year