LEO: A powerful Hybrid Multimodal LLM
☆20Jan 18, 2025Updated last year
Alternatives and similar repositories for LEO
Users that are interested in LEO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Cadeira AeC☆14Jan 11, 2022Updated 4 years ago
- ☆10Apr 7, 2025Updated 11 months ago
- ☆13Mar 28, 2025Updated 11 months ago
- Visual Spatial Tuning☆187Mar 17, 2026Updated last week
- Implementation of Pix2Seq in PyTorch☆10Feb 3, 2022Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- More reliable Video Understanding Evaluation☆14Sep 23, 2025Updated 6 months ago
- Evaluation results for Machine Translation within the BigScience project☆11May 15, 2023Updated 2 years ago
- Efficient Visual Question Answering for Autonomous Vehicles with Reasoning-Enhanced Small Vision-Language Models☆23Apr 16, 2025Updated 11 months ago
- [ECCV24] VISA: Reasoning Video Object Segmentation via Large Language Model☆19Jul 20, 2024Updated last year
- ☆25Jan 15, 2025Updated last year
- [CVPR 2025] Official code of "DiET-GS: Diffusion Prior and Event Stream-Assisted Motion Deblurring 3D Gaussian Splatting"☆50Sep 5, 2025Updated 6 months ago
- LLMBind: A Unified Modality-Task Integration Framework☆19Jun 16, 2024Updated last year
- [NeurIPS25] Official Implementation (Pytorch) of "DeepVideo-R1"☆31Feb 22, 2026Updated last month
- Code and data for the paper: Learning Action and Reasoning-Centric Image Editing from Videos and Simulation☆35Jun 30, 2025Updated 8 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- List of papers on Hallucination in LMM☆10Nov 29, 2023Updated 2 years ago
- [CVPR'24] Code for Emergent Open-Vocabulary Semantic Segmentation from Off-the-shelf Vision-Language Models☆18Jul 22, 2024Updated last year
- (AAAI'20) The source code for the paper "Joint Parsing and Generation for Abstractive Summarization".☆15Apr 3, 2020Updated 5 years ago
- [CVPR 2025] LLaVA-ST: A Multimodal Large Language Model for Fine-Grained Spatial-Temporal Understanding☆83Jul 4, 2025Updated 8 months ago
- phyOS arch linux (pacman) repository☆11Feb 27, 2026Updated 3 weeks ago
- ☆15Jan 24, 2018Updated 8 years ago
- 在线学习网站 教师端+学生端 (课件资源上传下载删除、教学团队、班级管理、学生管理、考勤、作业提交批改评分、讨论区、找回密码)☆11Feb 16, 2022Updated 4 years ago
- ☆11Oct 31, 2024Updated last year
- The paper list of multilingual pre-trained models (Continual Updated).☆24Jun 18, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [NeurIPS'25] Backdoor Cleaning without External Guidance in MLLM Fine-tuning☆18Oct 13, 2025Updated 5 months ago
- Offical implementation of "Re-Aligning Language to Visual Objects with an Agentic Workflow"☆32Apr 20, 2025Updated 11 months ago
- Official Implementation of CVPR 2022 paper: "Mimicking the Oracle: An Initial Phase Decorrelation Approach for Class Incremental Learning…☆35Feb 10, 2023Updated 3 years ago
- "Todos sabemos que esta vida está agreste mas o português é mestre na arte de desenrascar"☆19Mar 8, 2026Updated 2 weeks ago
- Image caption and manage tool for AI training☆11Jan 24, 2025Updated last year
- Structuring Hour-Long Videos into Navigable Chapters and Hierarchical Summaries☆35Nov 19, 2025Updated 4 months ago
- Open-vocabulary Semantic Segmentation☆33Feb 16, 2024Updated 2 years ago
- Code for DeCo: Decoupling token compression from semanchc abstraction in multimodal large language models☆78Jul 14, 2025Updated 8 months ago
- ☆33Sep 27, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- [ICML 2025] Official Github Repo for WOMD-Reasoning Dataset☆43Nov 27, 2025Updated 3 months ago
- 学生作业上传、预览、打分系统☆11Jul 18, 2016Updated 9 years ago
- 给科研小白的一些资源与工具推荐☆17Jul 6, 2020Updated 5 years ago
- (AAAI 24) Step Vulnerability Guided Mean Fluctuation Adversarial Attack against Conditional Diffusion Models☆11Oct 12, 2024Updated last year
- ☆18Nov 14, 2025Updated 4 months ago
- The official implementation of "Cross-modal Causal Relation Alignment for Video Question Grounding. (CVPR 2025 Highlight)"☆45Apr 27, 2025Updated 11 months ago
- Applies ROME and MEMIT on Mamba-S4 models☆14Apr 5, 2024Updated last year