LEO: A powerful Hybrid Multimodal LLM
☆20Jan 18, 2025Updated last year
Alternatives and similar repositories for LEO
Users that are interested in LEO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Cadeira AeC☆14Jan 11, 2022Updated 4 years ago
- ☆10Apr 7, 2025Updated last year
- ☆13Mar 28, 2025Updated last year
- Visual Spatial Tuning☆197Mar 25, 2026Updated 2 months ago
- ☆23Jul 11, 2025Updated 11 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Implementation of Pix2Seq in PyTorch☆10Feb 3, 2022Updated 4 years ago
- VideoEval-Pro: Robust and Realistic Long Video Understanding Evaluation [TMLR26]☆17Jun 1, 2026Updated 2 weeks ago
- Evaluation results for Machine Translation within the BigScience project☆11May 15, 2023Updated 3 years ago
- The code for paper entitled "Data-Driven Modulation Optimization with LMMSE Equalization for Reliability Enhancement in Underwater Acoust…☆19Apr 9, 2026Updated 2 months ago
- Efficient Visual Question Answering for Autonomous Vehicles with Reasoning-Enhanced Small Vision-Language Models☆25Apr 16, 2025Updated last year
- [ECCV24] VISA: Reasoning Video Object Segmentation via Large Language Model☆22Jul 20, 2024Updated last year
- ☆25Jan 15, 2025Updated last year
- LLMBind: A Unified Modality-Task Integration Framework☆19Jun 16, 2024Updated 2 years ago
- [NeurIPS25] Official Implementation (Pytorch) of "DeepVideo-R1"☆35Feb 22, 2026Updated 3 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Task Preference Optimization: Improving Multimodal Large Language Models with Vision Task Alignment☆65Jul 22, 2025Updated 10 months ago
- Code and data for the paper: Learning Action and Reasoning-Centric Image Editing from Videos and Simulation☆35Jun 30, 2025Updated 11 months ago
- List of papers on Hallucination in LMM☆10Nov 29, 2023Updated 2 years ago
- [CVPR'24] Code for Emergent Open-Vocabulary Semantic Segmentation from Off-the-shelf Vision-Language Models☆18Jul 22, 2024Updated last year
- (AAAI'20) The source code for the paper "Joint Parsing and Generation for Abstractive Summarization".☆15Apr 3, 2020Updated 6 years ago
- [CVPR 2025] LLaVA-ST: A Multimodal Large Language Model for Fine-Grained Spatial-Temporal Understanding☆83Jul 4, 2025Updated 11 months ago
- the pytorch implementation of SubCenterArcface and sphereface2. And i add the prove of easy_margin part of Arcface in the codes.☆12Dec 1, 2021Updated 4 years ago
- phyOS arch linux (pacman) repository☆11Feb 27, 2026Updated 3 months ago
- 在线学习网站 教师端+学生端 (课件资源上传下载删除、教学团队、班级管理、学生管理、考勤、作业提交批改评分、讨论区、找回密码)☆11Feb 16, 2022Updated 4 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆16Jan 24, 2018Updated 8 years ago
- ☆15Feb 24, 2023Updated 3 years ago
- Provides a selection of 12 logic gates that you can interconnect with patch cables to make a variety of different logic circuits.☆11Feb 28, 2026Updated 3 months ago
- The paper list of multilingual pre-trained models (Continual Updated).☆24Jun 18, 2024Updated last year
- Official Implementation of CVPR 2022 paper: "Mimicking the Oracle: An Initial Phase Decorrelation Approach for Class Incremental Learning…☆35Feb 10, 2023Updated 3 years ago
- [NeurIPS'25] Backdoor Cleaning without External Guidance in MLLM Fine-tuning☆20Oct 13, 2025Updated 8 months ago
- Learning to Rewrite for Non-Autoregressive Neural Machine Translation☆21Dec 23, 2021Updated 4 years ago
- Image caption and manage tool for AI training☆11Jan 24, 2025Updated last year
- Code for the paper "Balancing Training for Multilingual Neural Machine Translation, ACL 2020"☆23May 26, 2021Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Open-vocabulary Semantic Segmentation☆33Feb 16, 2024Updated 2 years ago
- ☆32Updated this week
- Code for DeCo: Decoupling token compression from semanchc abstraction in multimodal large language models☆79Jul 14, 2025Updated 11 months ago
- From Words to Wheels: Automated Style-Customized Policy Generation for Autonomous Driving☆11Mar 16, 2025Updated last year
- [ICML 2025] Official Github Repo for WOMD-Reasoning Dataset☆45Nov 27, 2025Updated 6 months ago
- 🐧 Unify-Agent: An end-to-end unified multimodal agent for faithful, knowledge-grounded image generation.☆82May 2, 2026Updated last month
- 学生作业上传、预览、打分系统☆11Jul 18, 2016Updated 9 years ago