octoml / deformable-attention-kernelView external linksLinks
TVMScript kernel for deformable attention
☆25Dec 15, 2021Updated 4 years ago
Alternatives and similar repositories for deformable-attention-kernel
Users that are interested in deformable-attention-kernel are comparing it to the libraries listed below
Sorting:
- [CVPR 2023] RILS: Masked Visual Reconstruction in Language Semantic Space (https://arxiv.org/abs/2301.06958)☆44Sep 5, 2023Updated 2 years ago
- ☆12Mar 13, 2023Updated 2 years ago
- DQN-MxNet-Gluon☆23Nov 12, 2017Updated 8 years ago
- OneFlow->ONNX☆43Apr 19, 2023Updated 2 years ago
- Depict GPU memory footprint during DNN training of PyTorch☆11Nov 17, 2022Updated 3 years ago
- ☆25Jun 24, 2021Updated 4 years ago
- [CVPR 2023] Official implementation of "SAP-DETR: Bridging the Gap between Salient Points and Queries-Based Transformer Detector for Fast…☆30May 28, 2023Updated 2 years ago
- ☆13Mar 27, 2023Updated 2 years ago
- This is a demo how to write a high performance convolution run on apple silicon☆57Feb 8, 2022Updated 4 years ago
- An external memory allocator example for PyTorch.☆16Aug 10, 2025Updated 6 months ago
- ☆16Nov 21, 2017Updated 8 years ago
- AutoTorch, A HPO Toolkit☆60May 25, 2020Updated 5 years ago
- A fork of tvm/unity☆14Aug 12, 2023Updated 2 years ago
- ☆17Jan 1, 2024Updated 2 years ago
- Document the demo and a series of documents for learning the diffusion model.☆42Jun 29, 2023Updated 2 years ago
- A codebase & model zoo for pretrained backbone based on MegEngine.☆32Mar 6, 2023Updated 2 years ago
- [ICCV2023] NoiseDet: Learning from Noisy Data for Semi-Superivsed 3D Object Detection☆21Feb 5, 2023Updated 3 years ago
- ☆250Jul 27, 2025Updated 6 months ago
- Ahead of Time (AOT) Triton Math Library☆88Updated this week
- Call ncnn from Fortran☆18Dec 18, 2022Updated 3 years ago
- useful dotfiles included vim, zsh, tmux and vscode☆19Jan 17, 2026Updated 3 weeks ago
- ☆19Jan 27, 2021Updated 5 years ago
- MXNet Gluon Synchronized Batch Normalization Preview☆77Jul 16, 2018Updated 7 years ago
- ☆46Nov 25, 2024Updated last year
- the C++ version of Seq2Seq with ncnn☆23Jun 27, 2021Updated 4 years ago
- An object detection codebase based on MegEngine.☆28Dec 14, 2022Updated 3 years ago
- ☆23Apr 25, 2023Updated 2 years ago
- Implementation for <Regularizing Neural Networks via Minimizing Hyperspherical Energy> in CVPR'20.☆24Jun 23, 2020Updated 5 years ago
- Distributed DataLoader For Pytorch Based On Ray☆25Nov 5, 2021Updated 4 years ago
- python package of rocm-smi-lib☆24Dec 15, 2025Updated last month
- BigDetection: A Large-scale Benchmark for Improved Object Detector Pre-training☆400Oct 23, 2024Updated last year
- Implementation of Enhancing Your Trained DETRs with Box Refinement☆60Jul 26, 2023Updated 2 years ago
- LiBai(李白): A Toolbox for Large-Scale Distributed Parallel Training☆405Jul 31, 2025Updated 6 months ago
- ☆60Apr 18, 2022Updated 3 years ago
- Standalone Flash Attention v2 kernel without libtorch dependency☆114Sep 10, 2024Updated last year
- [AAAI 2023] DQ-DETR: Dual Query Detection Transformer for Phrase Extraction and Grounding☆57Nov 28, 2022Updated 3 years ago
- auto-tuning momentum SGD optimizer☆23Jul 14, 2017Updated 8 years ago
- Dive into Deep Learning Compiler☆644Jun 19, 2022Updated 3 years ago
- Implement ARM NEON intrinsics in C++☆22May 14, 2024Updated last year