paper: https://arxiv.org/abs/2307.02469 page: https://lynx-llm.github.io/
☆272Aug 9, 2023Updated 2 years ago
Alternatives and similar repositories for lynx-llm
Users that are interested in lynx-llm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 🦩 Official repository of paper "Visual Instruction Tuning with Polite Flamingo" (AAAI-24 Oral)☆65Dec 9, 2023Updated 2 years ago
- (CVPR2024)A benchmark for evaluating Multimodal LLMs using multiple-choice questions.☆364Jan 14, 2025Updated last year
- The official GitHub page for ''What Makes for Good Visual Instructions? Synthesizing Complex Visual Reasoning Instructions for Visual Ins…☆19Nov 10, 2023Updated 2 years ago
- Code/Data for the paper: "LLaVAR: Enhanced Visual Instruction Tuning for Text-Rich Image Understanding"☆268Jun 12, 2024Updated 2 years ago
- MMICL, a state-of-the-art VLM with the in context learning ability from ICL, PKU☆361Dec 18, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- mPLUG-Owl: The Powerful Multi-modal Large Language Model Family☆2,539Apr 2, 2025Updated last year
- Touchstone: Evaluating Vision-Language Models by Language Models☆83Jan 18, 2024Updated 2 years ago
- Official code for "What Makes for Good Visual Tokenizers for Large Language Models?".☆60Jun 27, 2023Updated 3 years ago
- Emu Series: Generative Multimodal Models from BAAI☆1,775Jan 12, 2026Updated 5 months ago
- Codes for VPGTrans: Transfer Visual Prompt Generator across LLMs. VL-LLaMA, VL-Vicuna.☆269Oct 13, 2023Updated 2 years ago
- [ICLR'24] Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning☆297Mar 13, 2024Updated 2 years ago
- ☆813Jul 8, 2024Updated last year
- ☆134Dec 22, 2023Updated 2 years ago
- 🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing imp…☆3,416Mar 5, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- MM-Vet: Evaluating Large Multimodal Models for Integrated Capabilities (ICML 2024)☆327Jan 20, 2025Updated last year
- LaVIT: Empower the Large Language Model to Understand and Generate Visual Content☆603Oct 6, 2024Updated last year
- SVIT: Scaling up Visual Instruction Tuning☆168Jun 20, 2024Updated 2 years ago
- VideoNIAH: A Flexible Synthetic Method for Benchmarking Video MLLMs☆57Mar 9, 2025Updated last year
- The official GitHub page for ''Evaluating Object Hallucination in Large Vision-Language Models''☆263Aug 21, 2025Updated 10 months ago
- [ICLR 2024 & ECCV 2024] The All-Seeing Projects: Towards Panoptic Visual Recognition&Understanding and General Relation Comprehension of …☆509Aug 9, 2024Updated last year
- ☆363Jan 27, 2024Updated 2 years ago
- Lion: Kindling Vision Intelligence within Large Language Models☆51Jan 25, 2024Updated 2 years ago
- (ECCVW 2025)GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest☆556Jun 3, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- LAVIS - A One-stop Library for Language-Vision Intelligence☆11,243Jun 2, 2026Updated 3 weeks ago
- [NeurIPS 2023] Official implementations of "Cheap and Quick: Efficient Vision-Language Instruction Tuning for Large Language Models"☆523Jan 27, 2024Updated 2 years ago
- ☆354May 25, 2024Updated 2 years ago
- An open-source framework for training large multimodal models.☆4,110Aug 31, 2024Updated last year
- Latest Advances on Multimodal Large Language Models☆17,900Jun 18, 2026Updated last week
- ☆91Nov 25, 2023Updated 2 years ago
- Official code of *Virgo: A Preliminary Exploration on Reproducing o1-like MLLM*☆110May 27, 2025Updated last year
- [TLLM'23] PandaGPT: One Model To Instruction-Follow Them All☆858Jun 1, 2023Updated 3 years ago
- Aligning LMMs with Factually Augmented RLHF☆396Nov 1, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters☆5,920Mar 14, 2024Updated 2 years ago
- [CVPR2023] The code for 《Position-guided Text Prompt for Vision-Language Pre-training》☆150Jun 7, 2023Updated 3 years ago
- [NAACL 2024] MMC: Advancing Multimodal Chart Understanding with LLM Instruction Tuning☆95Jan 7, 2025Updated last year
- Chatbot Arena meets multi-modality! Multi-Modality Arena allows you to benchmark vision-language models side-by-side while providing imag…☆566Apr 21, 2024Updated 2 years ago
- ✨✨Woodpecker: Hallucination Correction for Multimodal Large Language Models☆650Dec 23, 2024Updated last year
- [ICLR'24 spotlight] Chinese and English Multimodal Large Model Series (Chat and Paint) | 基于CPM基础模型的中英双语多模态大模型系列☆1,067Jun 13, 2024Updated 2 years ago
- Cambrian-1 is a family of multimodal LLMs with a vision-centric design.☆2,005Nov 7, 2025Updated 7 months ago