Generalist YOLO: Towards Real-Time End-to-End Multi-Task Visual Language Models
☆93May 1, 2025Updated last year
Alternatives and similar repositories for GeneralistYOLO
Users that are interested in GeneralistYOLO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official PyTorch Implementation of Bucketed Ranking-based Losses for Efficient Training of Object Detectors [ECCV2024]☆27Apr 27, 2025Updated last year
- A cutlass cute implementation of headdim-64 flashattentionv2 TensorRT plugin for LightGlue. Run on Jetson Orin NX 8GB with TensorRT 8.5.…☆20Mar 3, 2025Updated last year
- Acuitylite is an end-to-end neural network deployment tool☆22Nov 4, 2025Updated 7 months ago
- Parallel LiDAR Point Cloud Preprocessing for Autonomous Driving Applications☆10Apr 2, 2024Updated 2 years ago
- ☆14May 16, 2023Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A very simple tool that compresses the overall size of the ONNX model by aggregating duplicate constant values as much as possible.☆53Sep 8, 2022Updated 3 years ago
- Yolo26 model supports android deployment.☆40Jan 21, 2026Updated 4 months ago
- Object tracking pipelines complete with RF-DETR, YOLOv9, YOLO-NAS, YOLOv8, and YOLOv7 detection and BYTETracker tracking☆88Jun 9, 2025Updated last year
- A tool convert TensorRT engine/plan to a fake onnx☆41Nov 22, 2022Updated 3 years ago
- A faster implementation of OpenCV-CUDA that uses OpenCV objects, and more!☆55Jun 4, 2026Updated 2 weeks ago
- a VINS algorithm with a combination of stereo fisheye images, cubemap, line features, dense mapping and loop closure☆37May 11, 2023Updated 3 years ago
- C++ implementation of "Mobile Vision Transformer-based Visual Object Tracking" (BMVC2023) and "Separable Self and Mixed Attention Transf…☆13Apr 23, 2024Updated 2 years ago
- Real-time CenterNet based object detection on fused IR/Depth images from Kinect sensor. Works on NVIDIA Jetson.☆19Jan 5, 2021Updated 5 years ago
- Multi-Sensor Time Synchronisation System☆20Jan 5, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- SAM and lama inpaint,包含QT的GUI交互界面,实现了交互式可实时显示结果的画点、画框进行SAM,然后通过进行Inpaint,具体操作看readme里的视频。☆54Jan 30, 2024Updated 2 years ago
- Python scripts for performing monocular depth estimation using the SC_Depth model in ONNX☆34Nov 13, 2022Updated 3 years ago
- [ACM MM 2024] Hierarchical Multimodal Fine-grained Modulation for Visual Grounding.☆65Nov 10, 2025Updated 7 months ago
- Integrating Dual Coordinate Attention with Adaptive Kernel Based Convolution Network for Medicinal Flower Identification☆18Nov 5, 2025Updated 7 months ago
- RF-DETR C++ tensorrt : Real-Time End-to-End Object Detection☆78Dec 25, 2025Updated 5 months ago
- Code for ChatRex: Taming Multimodal LLM for Joint Perception and Understanding☆214Oct 15, 2025Updated 8 months ago
- Knock your images before you get stressed.☆11Jan 9, 2022Updated 4 years ago
- Export HDR movie from Scene Capture 2D on Unreal Engine 4☆12Sep 1, 2015Updated 10 years ago
- 用于学习GOT/Qwen/OnnxLLm☆55Oct 8, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- An MIT License of YOLOv9, YOLOv7, YOLO-RD☆1,693Mar 16, 2026Updated 3 months ago
- Part of 5th place solution for Peking University/Baidu - Autonomous Driving on Kaggle (https://www.kaggle.com/c/pku-autonomous-driving).☆23Sep 11, 2020Updated 5 years ago
- ☆10Dec 21, 2020Updated 5 years ago
- Learning from Noisy Anchors for One-stage Object Detection☆27Apr 14, 2021Updated 5 years ago
- ☆13Apr 10, 2022Updated 4 years ago
- Anchor Assignment and Sampling Heuristics in Deep Object Detection: A Review☆11Aug 2, 2022Updated 3 years ago
- TAPFormer is a model that fuses images and events for high-frame-rate tracking any point (pixel) .☆44Updated this week
- ☆16Mar 24, 2025Updated last year
- CMake wrapper for installing LibTorch (PyTorch C++ API)☆11Jan 13, 2026Updated 5 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- XDoG(Extended Difference of Gaussians)アルゴリズムを用いた線画抽出のサンプルです。☆15Jan 28, 2021Updated 5 years ago
- Tengine 管子是用来快速生产 demo 的辅助工具☆11Jul 15, 2021Updated 4 years ago
- a plugin-oriented framework for video structured. 国产程序员请加微信zhzhi78拉群交流。☆18May 28, 2024Updated 2 years ago
- ☆11Feb 4, 2024Updated 2 years ago
- Implementing realtime photometric-stereo using a monochromatic Point Grey Chameleon camera and 8 leds.☆66Apr 28, 2017Updated 9 years ago
- Multi-Person Tracking in Tour Guide Robot☆10Aug 23, 2022Updated 3 years ago
- 📥 🎯 (1,4/4) an MLIR-based toolchain with Vitis HLS LLVM input/output targeting FPGAs.☆15Nov 15, 2022Updated 3 years ago