Mozhgan91 / LEOView external linksLinks
LEO: A powerful Hybrid Multimodal LLM
☆19Jan 18, 2025Updated last year
Alternatives and similar repositories for LEO
Users that are interested in LEO are comparing it to the libraries listed below
Sorting:
- ☆10Apr 7, 2025Updated 10 months ago
- ☆13Mar 28, 2025Updated 10 months ago
- Implementation of Pix2Seq in PyTorch☆10Feb 3, 2022Updated 4 years ago
- Efficient Visual Question Answering for Autonomous Vehicles with Reasoning-Enhanced Small Vision-Language Models☆21Apr 16, 2025Updated 9 months ago
- Visual Spatial Tuning☆172Feb 1, 2026Updated last week
- ☆23Jul 11, 2025Updated 7 months ago
- [CVPR 2025] Official code of "DiET-GS: Diffusion Prior and Event Stream-Assisted Motion Deblurring 3D Gaussian Splatting"☆46Sep 5, 2025Updated 5 months ago
- [NeurIPS25] Official Implementation (Pytorch) of "DeepVideo-R1"☆31Nov 15, 2025Updated 2 months ago
- [ECCV24] VISA: Reasoning Video Object Segmentation via Large Language Model☆19Jul 20, 2024Updated last year
- LLMBind: A Unified Modality-Task Integration Framework☆19Jun 16, 2024Updated last year
- [CVPR 2025] LLaVA-ST: A Multimodal Large Language Model for Fine-Grained Spatial-Temporal Understanding☆81Jul 4, 2025Updated 7 months ago
- Task Preference Optimization: Improving Multimodal Large Language Models with Vision Task Alignment☆64Jul 22, 2025Updated 6 months ago
- Offical implementation of "Re-Aligning Language to Visual Objects with an Agentic Workflow"☆31Apr 20, 2025Updated 9 months ago
- ☆37Jun 20, 2025Updated 7 months ago
- The official implementation of "Cross-modal Causal Relation Alignment for Video Question Grounding. (CVPR 2025 Highlight)"☆42Apr 27, 2025Updated 9 months ago
- The official implementation of our work Hawkeye: Discovering and Grounding Implicit Anomalous Sentiment in Recon-videos via Scene-enhanc…☆12Oct 14, 2024Updated last year
- The code for paper entitled "Data-Driven Modulation Optimization with LMMSE Equalization for Reliability Enhancement in Underwater Acoust…☆19Oct 4, 2025Updated 4 months ago
- ☆33Sep 27, 2024Updated last year
- Open-vocabulary Semantic Segmentation☆33Feb 16, 2024Updated last year
- [AAAI2025 selected as oral] - Multi-task Visual Grounding with Coarse-to-Fine Consistency Constraints☆44Jul 2, 2025Updated 7 months ago
- DisTime: Distribution-based Time Representation for Video Large Language Models.☆18Jul 10, 2025Updated 7 months ago
- Official Implementation of CVPR 2022 paper: "Mimicking the Oracle: An Initial Phase Decorrelation Approach for Class Incremental Learning…☆35Feb 10, 2023Updated 3 years ago
- [IROS 2024] Official implementation of paper: DriVLMe: "Enhancing LLM-based Autonomous Driving Agents with Embodied and Social Experience…☆53Nov 16, 2024Updated last year
- Finetuning & extending DiffusionDet to video & pedestrian multi-object-tracking☆13Apr 12, 2023Updated 2 years ago
- The repository of VG-Refiner paper☆17Dec 9, 2025Updated 2 months ago
- Make Large Multimodal Models excel in object detection, ICCV 2025☆63Aug 1, 2025Updated 6 months ago
- ☆101Dec 27, 2024Updated last year
- [ICML 2025] Official Github Repo for WOMD-Reasoning Dataset☆41Nov 27, 2025Updated 2 months ago
- Image caption and manage tool for AI training☆11Jan 24, 2025Updated last year
- ☆11Jan 18, 2025Updated last year
- 在线学习网站 教师端+学生端 (课件资源上传下载删除、教学团队、班级管理、学生管理、考勤、作业提交批改评分、讨论区、找回密码)☆11Feb 16, 2022Updated 3 years ago
- [CVPR 2024] LoSh: Long-Short Text Joint Prediction Network for Referring Video Object Segmentation☆13Jun 17, 2024Updated last year
- Official repository of the paper "High-Quality Mask Tuning Matters for Open-Vocabulary Segmentation"☆45Mar 25, 2025Updated 10 months ago
- ☆24Nov 27, 2025Updated 2 months ago
- DigiKam media files search by contained objects☆13Feb 23, 2022Updated 3 years ago
- ☆10Apr 13, 2019Updated 6 years ago
- The official repository of UVOSAM☆13Jun 5, 2024Updated last year
- ☆13Jan 21, 2025Updated last year
- Official repository of paper "LOVE-R1: Advancing Long Video Understanding with Adaptive Zoom-in Mechanism via Multi-Step Reasoning"☆20Nov 1, 2025Updated 3 months ago