Generalist YOLO: Towards Real-Time End-to-End Multi-Task Visual Language Models
☆92May 1, 2025Updated 11 months ago
Alternatives and similar repositories for GeneralistYOLO
Users that are interested in GeneralistYOLO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official PyTorch Implementation of Bucketed Ranking-based Losses for Efficient Training of Object Detectors [ECCV2024]☆26Apr 27, 2025Updated 11 months ago
- Implementation of YOLOv9 QAT optimized for deployment on TensorRT platforms.☆132Apr 24, 2025Updated 11 months ago
- CLIP and SigLIP models optimized with TensorRT with a Transformers-like API☆33Sep 29, 2024Updated last year
- A cutlass cute implementation of headdim-64 flashattentionv2 TensorRT plugin for LightGlue. Run on Jetson Orin NX 8GB with TensorRT 8.5.…☆20Mar 3, 2025Updated last year
- HunyuanDiT with TensorRT and libtorch☆18May 22, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Parallel LiDAR Point Cloud Preprocessing for Autonomous Driving Applications☆10Apr 2, 2024Updated 2 years ago
- Yolo26 model supports android deployment.☆35Jan 21, 2026Updated 2 months ago
- ☆14May 16, 2023Updated 2 years ago
- Spatial Transformer Network YOLO Model for Agricultural Object Detection☆18Sep 18, 2024Updated last year
- A very simple tool that compresses the overall size of the ONNX model by aggregating duplicate constant values as much as possible.☆53Sep 8, 2022Updated 3 years ago
- ppstructure deploy by ncnn☆37Jul 16, 2024Updated last year
- Object tracking pipelines complete with RF-DETR, YOLOv9, YOLO-NAS, YOLOv8, and YOLOv7 detection and BYTETracker tracking☆88Jun 9, 2025Updated 10 months ago
- NeuSyRE: A Neuro-Symbolic Visual Understanding and Reasoning Framework based on Scene Graph Enrichment☆23Mar 10, 2024Updated 2 years ago
- A faster implementation of OpenCV-CUDA that uses OpenCV objects, and more!☆54Updated this week
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆15Oct 31, 2022Updated 3 years ago
- ☆23Jan 3, 2024Updated 2 years ago
- a VINS algorithm with a combination of stereo fisheye images, cubemap, line features, dense mapping and loop closure☆36May 11, 2023Updated 2 years ago
- C++ implementation of "Mobile Vision Transformer-based Visual Object Tracking" (BMVC2023) and "Separable Self and Mixed Attention Transf…☆13Apr 23, 2024Updated last year
- Real-time CenterNet based object detection on fused IR/Depth images from Kinect sensor. Works on NVIDIA Jetson.☆19Jan 5, 2021Updated 5 years ago
- Generate bird's-eye views of conference proceedings.☆24Jul 17, 2025Updated 9 months ago
- Multi-Sensor Time Synchronisation System☆20Jan 5, 2025Updated last year
- Python scripts for performing monocular depth estimation using the SC_Depth model in ONNX☆34Nov 13, 2022Updated 3 years ago
- [ACM MM 2024] Hierarchical Multimodal Fine-grained Modulation for Visual Grounding.☆62Nov 10, 2025Updated 5 months ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆19Apr 6, 2026Updated last week
- [NeurIPS 2024] OneRef: Unified One-tower Expression Grounding and Segmentation with Mask Referring Modeling.☆31Nov 13, 2025Updated 5 months ago
- ☆387Nov 17, 2025Updated 5 months ago
- Code for ChatRex: Taming Multimodal LLM for Joint Perception and Understanding☆213Oct 15, 2025Updated 6 months ago
- Knock your images before you get stressed.☆11Jan 9, 2022Updated 4 years ago
- Export HDR movie from Scene Capture 2D on Unreal Engine 4☆12Sep 1, 2015Updated 10 years ago
- YOLOv8摔倒检测, Pyside6的GUI, 手动选择使用摄像头或者是选择文件进行检测. 请看VCR----☆33May 6, 2025Updated 11 months ago
- An MIT License of YOLOv9, YOLOv7, YOLO-RD☆1,652Mar 16, 2026Updated last month
- 用于学习GOT/Qwen/OnnxLLm☆55Oct 8, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Part of 5th place solution for Peking University/Baidu - Autonomous Driving on Kaggle (https://www.kaggle.com/c/pku-autonomous-driving).☆23Sep 11, 2020Updated 5 years ago
- ☆10Dec 21, 2020Updated 5 years ago
- ☆13Apr 10, 2022Updated 4 years ago
- ☆16Mar 24, 2025Updated last year
- OW-OVD: Unified Open World and Open Vocabulary Object Detection (CVPR 2025)☆28Dec 2, 2024Updated last year
- This is a C++ implementation of cocoapi bbox evaluation code.☆11Dec 9, 2021Updated 4 years ago
- Gstreamer based Edge AI reference application☆12Feb 26, 2024Updated 2 years ago