hongwang600 / FLAIRLinks
☆61Updated last year
Alternatives and similar repositories for FLAIR
Users that are interested in FLAIR are comparing it to the libraries listed below
Sorting:
- ☆80Updated last year
- This is a repository for "PMET: Precise Model Editing in a Transformer"☆55Updated 2 years ago
- Official implementation of "Gemini in Reasoning: Unveiling Commonsense in Multimodal Large Language Models"☆37Updated 2 years ago
- ☆140Updated last month
- research work on multimodal cognitive ai☆68Updated last month
- ☆87Updated 2 years ago
- How Good is Google Bard's Visual Understanding? An Empirical Study on Open Challenges☆30Updated 2 years ago
- Official Implementation of InstructZero; the first framework to optimize bad prompts of ChatGPT(API LLMs) and finally obtain good prompts…☆197Updated last year
- [ACL2025] Unsolvable Problem Detection: Robust Understanding Evaluation for Large Multimodal Models☆79Updated 8 months ago
- ☆17Updated 2 years ago
- Code for T-MARS data filtering☆35Updated 2 years ago
- ☆139Updated 2 years ago
- PyTorch codes for the paper "An Empirical Study of Multimodal Model Merging"☆37Updated 2 years ago
- Co-LLM: Learning to Decode Collaboratively with Multiple Language Models☆125Updated last year
- LLMScore: Unveiling the Power of Large Language Models in Text-to-Image Synthesis Evaluation☆134Updated 2 years ago
- ☆83Updated 2 years ago
- ☆150Updated 2 years ago
- M4 experiment logbook☆58Updated 2 years ago
- Scripts for generating synthetic finetuning data for reducing sycophancy.☆119Updated 2 years ago
- Code for the paper: "No Zero-Shot Without Exponential Data: Pretraining Concept Frequency Determines Multimodal Model Performance" [NeurI…☆94Updated last year
- Multimodal-Procedural-Planning☆93Updated 2 years ago
- Touchstone: Evaluating Vision-Language Models by Language Models☆83Updated 2 years ago
- Official implementation of the paper The Hidden Language of Diffusion Models☆78Updated 2 years ago
- ☆29Updated 2 years ago
- An official codebase for paper " CHAMPAGNE: Learning Real-world Conversation from Large-Scale Web Videos (ICCV 23)"☆52Updated 2 years ago
- Public code repo for EMNLP 2024 Findings paper "MACAROON: Training Vision-Language Models To Be Your Engaged Partners"☆14Updated last year
- [ACL2025 Findings] Benchmarking Multihop Multimodal Internet Agents☆48Updated 11 months ago
- ☆41Updated last year
- Code for Paper: Harnessing Webpage Uis For Text Rich Visual Understanding☆53Updated last year
- [NAACL 2025 Oral] Multimodal Needle in a Haystack (MMNeedle): Benchmarking Long-Context Capability of Multimodal Large Language Models☆54Updated 8 months ago