Generalist YOLO: Towards Real-Time End-to-End Multi-Task Visual Language Models
☆92May 1, 2025Updated 10 months ago
Alternatives and similar repositories for GeneralistYOLO
Users that are interested in GeneralistYOLO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official PyTorch Implementation of Bucketed Ranking-based Losses for Efficient Training of Object Detectors [ECCV2024]☆26Apr 27, 2025Updated 11 months ago
- CLIP and SigLIP models optimized with TensorRT with a Transformers-like API☆32Sep 29, 2024Updated last year
- A cutlass cute implementation of headdim-64 flashattentionv2 TensorRT plugin for LightGlue. Run on Jetson Orin NX 8GB with TensorRT 8.5.…☆20Mar 3, 2025Updated last year
- Yolo26 model supports android deployment.☆30Jan 21, 2026Updated 2 months ago
- HunyuanDiT with TensorRT and libtorch☆18May 22, 2024Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Parallel LiDAR Point Cloud Preprocessing for Autonomous Driving Applications☆10Apr 2, 2024Updated last year
- ☆14May 16, 2023Updated 2 years ago
- Spatial Transformer Network YOLO Model for Agricultural Object Detection☆18Sep 18, 2024Updated last year
- A very simple tool that compresses the overall size of the ONNX model by aggregating duplicate constant values as much as possible.☆52Sep 8, 2022Updated 3 years ago
- ppstructure deploy by ncnn☆36Jul 16, 2024Updated last year
- Object tracking pipelines complete with RF-DETR, YOLOv9, YOLO-NAS, YOLOv8, and YOLOv7 detection and BYTETracker tracking☆86Jun 9, 2025Updated 9 months ago
- Un-offical PyTorch Implementation of "Class-Balanced Distillation for Long-Tailed Visual Recognition" paper.☆17Oct 31, 2021Updated 4 years ago
- A tool convert TensorRT engine/plan to a fake onnx☆41Nov 22, 2022Updated 3 years ago
- C++ implementation of "Mobile Vision Transformer-based Visual Object Tracking" (BMVC2023) and "Separable Self and Mixed Attention Transf…☆12Apr 23, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆15Oct 31, 2022Updated 3 years ago
- ☆23Jan 3, 2024Updated 2 years ago
- Real-time CenterNet based object detection on fused IR/Depth images from Kinect sensor. Works on NVIDIA Jetson.☆19Jan 5, 2021Updated 5 years ago
- A Deeplearn Model to rec table in photo with ncnn. 一个深度学习模型用于检测图片中的表格 画像内のテーブルを検出するためのディープラーニング モデル☆20Mar 2, 2025Updated last year
- Generate bird's-eye views of conference proceedings.☆24Jul 17, 2025Updated 8 months ago
- Multi-Sensor Time Synchronisation System☆20Jan 5, 2025Updated last year
- [ACM MM 2024] Hierarchical Multimodal Fine-grained Modulation for Visual Grounding.☆60Nov 10, 2025Updated 4 months ago
- An official implementation for APNet: Urban-level Scene Segmentation of Aerial Images and Point Clouds☆10Feb 7, 2024Updated 2 years ago
- SAM and lama inpaint,包含QT的GUI交互界面,实现了交互式可实时显示结果的画点、画框进行SAM,然后通过进行Inpaint,具体操作看readme里的视频。☆53Jan 30, 2024Updated 2 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ☆384Nov 17, 2025Updated 4 months ago
- [NeurIPS 2024] OneRef: Unified One-tower Expression Grounding and Segmentation with Mask Referring Modeling.☆31Nov 13, 2025Updated 4 months ago
- RF-DETR C++ tensorrt : Real-Time End-to-End Object Detection☆71Dec 25, 2025Updated 3 months ago
- Code for ChatRex: Taming Multimodal LLM for Joint Perception and Understanding☆212Oct 15, 2025Updated 5 months ago
- Knock your images before you get stressed.☆11Jan 9, 2022Updated 4 years ago
- An MIT License of YOLOv9, YOLOv7, YOLO-RD☆1,636Mar 16, 2026Updated 2 weeks ago
- 用于学习GOT/Qwen/OnnxLLm☆55Oct 8, 2024Updated last year
- Part of 5th place solution for Peking University/Baidu - Autonomous Driving on Kaggle (https://www.kaggle.com/c/pku-autonomous-driving).☆23Sep 11, 2020Updated 5 years ago
- Learning from Noisy Anchors for One-stage Object Detection☆27Apr 14, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Anchor Assignment and Sampling Heuristics in Deep Object Detection: A Review☆11Aug 2, 2022Updated 3 years ago
- ☆32Mar 25, 2024Updated 2 years ago
- ☆16Mar 24, 2025Updated last year
- OW-OVD: Unified Open World and Open Vocabulary Object Detection (CVPR 2025)☆25Dec 2, 2024Updated last year
- This is a C++ implementation of cocoapi bbox evaluation code.☆11Dec 9, 2021Updated 4 years ago
- XDoG(Extended Difference of Gaussians)アルゴリズムを用いた線画抽出のサンプルです。☆15Jan 28, 2021Updated 5 years ago
- Gstreamer based Edge AI reference application☆12Feb 26, 2024Updated 2 years ago