hertasecurity / gpu-nms
This repository contains the CUDA implementation of the paper "Work-efficient Parallel Non-Maximum Suppression Kernels".
☆13Updated 4 years ago
Alternatives and similar repositories for gpu-nms:
Users that are interested in gpu-nms are comparing it to the libraries listed below
- Useful plugins for Pytorch1.5+ and TensorRT7/8.☆37Updated 3 years ago
- ☆25Updated 3 years ago
- TensorRT plugin forDCNv2 layer in ONNX model☆60Updated 4 years ago
- Simple demo of tensorrt plugin☆43Updated 3 years ago
- YOLOV3-Tiny TensorRT6.0 13个类别☆32Updated 4 years ago
- TensorRT implementation of "RepVGG: Making VGG-style ConvNets Great Again"☆76Updated 4 years ago
- Useful tensorrt plugin. For pytorch and mmdetection model conversion.☆165Updated 5 months ago
- Implement yolov5 with Tensorrt C++ api, and integrate batchedNMSPlugin. A Python wrapper is also provided.☆50Updated 3 years ago
- Benchmark inference speed of CNNs with various quantization methods in Pytorch+TensorRT with Jetson Nano/Xavier☆55Updated last year
- ☆59Updated 4 years ago
- some idea about object detection☆18Updated 4 years ago
- Based of paper "Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference"☆63Updated 4 years ago
- CUDA implementation of NMS for PyTorch☆85Updated 5 years ago
- nanodet int8 量化,实测推理2ms一帧!☆37Updated 3 years ago
- convert torch module to tensorrt network or tvm function☆88Updated 5 years ago
- Parallel CUDA implementation of NON maximum Suppression☆79Updated 4 years ago
- 基于TensorRT7实现DCNv2插件☆48Updated 3 years ago
- This repository provides a sample to run yolov3 on int8 mode in tensorRT☆26Updated 5 years ago
- centernet, mobilenetv2, centerface☆52Updated 5 years ago
- OrienMask: Real-time Instance Segmentation with Discriminative Orientation Maps☆50Updated 3 years ago
- Darknet implementation of CenterNet☆29Updated 5 years ago
- ☆79Updated 4 years ago
- This repository has been moved. The new location is in https://github.com/TexasInstruments/edgeai-tensorlab☆71Updated 11 months ago
- FFBNET : LIGHTWEIGHT BACKBONE FOR OBJECT DETECTION BASED FEATURE FUSION BLOCK☆13Updated 5 years ago
- A resnet18 version of CenterNet(objects as points)☆124Updated 3 years ago
- Distance-IoU Loss: Faster and Better Learning for Bounding Box Regression (AAAI 2020)☆43Updated 3 years ago
- SOLOv2 on onnx & tensorRT☆54Updated 3 years ago
- ☆27Updated last year
- useful cuda code .☆42Updated 3 years ago
- A fully cuda implementation of DCNv2(deformable convolution) forward. Without dependent of cuTorch(THC).☆10Updated 5 years ago