CVFusion is an open-source deep learning compiler to fuse the OpenCV operators.
☆33Aug 31, 2022Updated 3 years ago
Alternatives and similar repositories for CVFusion
Users that are interested in CVFusion are comparing it to the libraries listed below
Sorting:
- ☆74Oct 31, 2024Updated last year
- ☆17Jan 1, 2024Updated 2 years ago
- TiledLower is a Dataflow Analysis and Codegen Framework written in Rust.☆13Nov 23, 2024Updated last year
- mmdetection -> TVM☆15Aug 22, 2020Updated 5 years ago
- ☆23Jan 3, 2024Updated 2 years ago
- Yet another Polyhedra Compiler for DeepLearning☆19Apr 14, 2023Updated 2 years ago
- 🐱 ncnn int8 模型量化评估☆14Oct 10, 2022Updated 3 years ago
- ☆23Apr 25, 2023Updated 2 years ago
- a single-header math library☆17Nov 7, 2025Updated 4 months ago
- MSLK (Meta Superintelligence Labs Kernels) is a collection of PyTorch GPU operator libraries that are designed and optimized for GenAI tr…☆71Updated this week
- GPTQ inference TVM kernel☆40Apr 25, 2024Updated last year
- A tool convert TensorRT engine/plan to a fake onnx☆40Nov 22, 2022Updated 3 years ago
- An external memory allocator example for PyTorch.☆16Aug 10, 2025Updated 7 months ago
- DenseShuffleNet for Semantic Segmentation using Caffe for Cityscapes and Mapillary Vistas Dataset☆10Mar 21, 2018Updated 7 years ago
- ☆16Mar 24, 2025Updated 11 months ago
- A standalone GEMM kernel for fp16 activation and quantized weight, extracted from FasterTransformer☆95Feb 20, 2026Updated last month
- A Triton JIT runtime and ffi provider in C++☆32Updated this week
- The docs repository of Pulsar2 which is AXera's SoC 2rd AI toolchain. Such as AX650A, AX650N☆17Feb 12, 2026Updated last month
- Simple examples of using bazel to cross compile AI applicaions for armv7hf devices.☆25Mar 17, 2022Updated 4 years ago
- ☆150Jan 9, 2025Updated last year
- The Gstreamer hardware encoder/decoder plugins for Rockchip platform☆13Oct 8, 2023Updated 2 years ago
- IntLLaMA: A fast and light quantization solution for LLaMA☆18Jul 21, 2023Updated 2 years ago
- ☆11Dec 26, 2025Updated 2 months ago
- TVMScript kernel for deformable attention☆25Dec 15, 2021Updated 4 years ago
- Official resporitory for "IPDPS' 24 QSync: Quantization-Minimized Synchronous Distributed Training Across Hybrid Devices".☆20Feb 23, 2024Updated 2 years ago
- only contain face detect 、5/81 points 、face recognization models☆11Jul 9, 2020Updated 5 years ago
- This sample shows how to use the oneAPI Video Processing Library (oneVPL) to perform a single and multi-source video decode and preproces…☆15Jun 15, 2023Updated 2 years ago
- ☆11Dec 16, 2021Updated 4 years ago
- High Performan Ai Model Web Server. Mainly support computer vision model. Quickly establish your own ai-model server. https://github.com/…☆44May 13, 2025Updated 10 months ago
- ☆52Sep 30, 2022Updated 3 years ago
- 使用onnxruntime部署夜间雾霾图像的可见度增强,包含C++和Python两个版本的程序☆13Feb 17, 2024Updated 2 years ago
- Document the demo and a series of documents for learning the diffusion model.☆41Jun 29, 2023Updated 2 years ago
- HunyuanDiT with TensorRT and libtorch☆17May 22, 2024Updated last year
- ☆60Nov 21, 2024Updated last year
- DDK for Rockchip NPU☆69Dec 29, 2020Updated 5 years ago
- Transmit and receive programs for Arduino Feather M0 LoRa module to transmit one way data over LoRa☆13Feb 25, 2026Updated 3 weeks ago
- ☆14Jun 30, 2021Updated 4 years ago
- ☆10Jun 14, 2023Updated 2 years ago
- ☆38Oct 12, 2024Updated last year