☆401Dec 12, 2024Updated last year
Alternatives and similar repositories for llava-phi
Users that are interested in llava-phi are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- a family of highly capabale yet efficient large multimodal models☆193Aug 23, 2024Updated last year
- A Framework of Small-scale Large Multimodal Models☆965Updated this week
- A family of lightweight multimodal models.☆1,054Nov 18, 2024Updated last year
- Strong and Open Vision Language Assistant for Mobile Devices☆1,345Apr 15, 2024Updated last year
- 🔥🔥 LLaVA++: Extending LLaVA with Phi-3 and LLaMA-3 (LLaVA LLaMA-3, LLaVA Phi-3)☆848Aug 5, 2025Updated 7 months ago
- [ECCV2024] Grounded Multimodal Large Language Model with Localized Visual Tokenization☆584Jun 7, 2024Updated last year
- TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones☆1,310Feb 5, 2026Updated last month
- 【TMM 2025🔥】 Mixture-of-Experts for Large Vision-Language Models☆2,307Jul 15, 2025Updated 8 months ago
- An open-source implementation for training LLaVA-NeXT.☆436Oct 23, 2024Updated last year
- TxBKG - Knowledge Graph Generation for Any PDFs☆188Nov 22, 2024Updated last year
- Official Repo For OMG-LLaVA and OMG-Seg codebase [CVPR-24 and NeurIPS-24]☆1,344Oct 15, 2025Updated 5 months ago
- Welcome to the 'Open-Alteryx-Macro' project. This project is aimed at providing an open-source solution for managing and updating Alteryx…☆156May 25, 2024Updated last year
- ☆247Nov 24, 2024Updated last year
- [CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale☆1,171Oct 21, 2024Updated last year
- ☆4,607Sep 14, 2025Updated 6 months ago
- An Workspace for HMI tools☆164Jul 11, 2024Updated last year
- A flexible and efficient codebase for training visually-conditioned language models (VLMs)☆956Jul 4, 2024Updated last year
- Book Recommendation System☆235May 2, 2024Updated last year
- ☆287Jul 6, 2024Updated last year
- Codebear: A fast and memory efficient code completion system based on CodeLlama☆78Jun 3, 2024Updated last year
- A curated list of awesome papers related to adversarial attacks and defenses for information retrieval. If I missed any papers, feel free…☆221Jul 11, 2024Updated last year
- Harnessing 1.4M GPT4V-synthesized Data for A Lite Vision-Language Model☆281Jun 25, 2024Updated last year
- One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks☆3,916Mar 15, 2026Updated last week
- ☆143May 25, 2024Updated last year
- AI solution for Patent Classification☆143Jun 29, 2020Updated 5 years ago
- [ECCV 2024] The official code of paper "Open-Vocabulary SAM".☆1,030Aug 4, 2025Updated 7 months ago
- YiTu is an easy-to-use runtime to fully exploit the hybrid parallelism of different hardwares (e.g., GPU) to efficiently support the exec…☆254Jan 7, 2026Updated 2 months ago
- Deep Reinforcement Learning Algorithms for solving Atari 2600 Games☆143Mar 23, 2023Updated 3 years ago
- ☆125Jul 29, 2024Updated last year
- ☆87Dec 20, 2024Updated last year
- Collection of typescript utility types that extends the official utility types.☆73Jul 5, 2024Updated last year
- Large-Scale Selfie Video Dataset (L-SVD): A Benchmark for Emotion Recognition☆306Aug 18, 2024Updated last year
- The first open-source, cloud-native TCP long-connection gateway for edge environments, enabling direct service-to-edge/client communicati…☆287Feb 24, 2026Updated last month
- Harnessing the Power of AI to Navigate the Information Age – Uncovering Truth, Promoting Transparency, and Championing Fact-Based Discour…☆147Jun 2, 2023Updated 2 years ago
- 🔗 Serverless blockchain analytics pipeline on AWS - Extract, process and visualize Ethereum data using Kinesis, Lambda, Redshift Serverl…☆103Oct 5, 2023Updated 2 years ago
- Imagine building a whole operating system around just your notes.☆80Feb 5, 2025Updated last year
- Official Implementation of AttentionShift: Iteratively Estimated Part-based Attention Map for Pointly Supervised Instance Segmentation☆155Oct 18, 2024Updated last year
- ML-Bench: Evaluating Large Language Models and Agents for Machine Learning Tasks on Repository-Level Code (https://arxiv.org/abs/2311.098…☆318Jul 31, 2025Updated 7 months ago
- Build a simple yet effective CNN to work as a sketch recognizer. Just like Google Quick-Draw Project.☆143Mar 23, 2023Updated 3 years ago