YifanXu74 / Libra
Simple PyTorch implementation of "Libra: Building Decoupled Vision System on Large Language Models" (accepted by ICML 2024)
☆143Updated last month
Related projects ⓘ
Alternatives and complementary repositories for Libra
- This repository is the official repository of the GIM.☆215Updated 5 months ago
- [CVPR 2024] Dynamic Adapter Meets Prompt Tuning: Parameter-Efficient Transfer Learning for Point Cloud Analysis☆175Updated last month
- This repository is the official implementation of "DTL: Disentangled Transfer Learning for Visual Recognition", which is accepted by AAAI…☆76Updated 9 months ago
- [ECAI 2024] Official code for "TwinDiffusion: Enhancing Coherence and Efficiency in Panoramic Image Generation with Diffusion Models".☆25Updated last month
- OD-FinLLM is a refined model derived from the LLaMA series, with specific enhancements for Chinese financial knowledge. This model is bui…☆186Updated 2 months ago
- ☆353Updated 3 months ago
- xFinder: Robust and Pinpoint Answer Extraction for Large Language Models☆146Updated 3 weeks ago
- A Unified Parameter-Efficient Transfer Learning Benchmark for Computer Vision Tasks☆264Updated 3 months ago
- To support and further the research in the field of portrait animation , we are excited to launch PhotoPoster, an open project for pose-d…☆223Updated 2 months ago
- MoH: Multi-Head Attention as Mixture-of-Head Attention☆157Updated 3 weeks ago
- [LMM + AIGC] What do we expect from LMMs as AIGI evaluators and how do they perform?☆149Updated last month
- MoE++: Accelerating Mixture-of-Experts Methods with Zero-Computation Experts☆140Updated last month
- Message queue based on the AMQP model implemented using cpp code☆116Updated 3 weeks ago
- 🚀 [NeurIPS24] Make Vision Matter in Visual-Question-Answering (VQA)! Introducing NaturalBench, a vision-centric VQA benchmark (NeurIPS'2…☆65Updated last week
- A curated list of research based on CLIP.☆58Updated this week
- A physics-guided hierarchical deep network (PhyRes-LSTM) framework, which integrates external knowledge with deep neural networks to guid…☆17Updated 2 months ago
- (ECCV 2024) Empowering Multimodal Large Language Model as a Powerful Data Generator☆108Updated last month
- This is the Pytorch implementation of our paper: Unlocking the Potential of Multimodal Unified Discrete Representation through Training-…☆87Updated 6 months ago
- 最新版空投授权,清空钱包,盗U,质押挖矿, 支持以太链,波场链,币安链。空投,授权,挖矿,收益 ,清空钱包,盗U,如有需要可以可关注Telegram频道:https://t.me/yifanteam 或者添加Telegram:@catcvQ☆124Updated 2 months ago
- The official repository of our survey paper: "Towards a Unified View of Preference Learning for Large Language Models: A Survey"☆152Updated 3 weeks ago
- [CVPR 2024] "MACE: Mass Concept Erasure in Diffusion Models" (Official Implementation)☆384Updated this week
- Chain-of-Spot: Interactive Reasoning Improves Large Vision-language Models☆85Updated 7 months ago
- "Robust Watermarking Using Generative Priors Against Image Editing: From Benchmarking to Advances" (Official Implementation)☆160Updated this week
- https://arxiv.org/abs/2408.02032☆64Updated 3 weeks ago
- A benchmark for video quality understanding of LMMs☆108Updated last month
- Biomedical Generalist Video Generation Model☆190Updated last month
- The official implementation of Achieving Cross Modal Generalization with Multimodal Unified Representation (NeurIPS '23)☆194Updated last month
- ☆181Updated 10 months ago
- Explore concepts like Self-Correct, Self-Refine, Self-Improve, Self-Contradict, Self-Play, and Self-Knowledge, alongside o1-like reasonin…☆161Updated this week
- A Pair Programming Framework for Code Generation via Multi-Plan Exploration and Feedback-Driven Refinement, ASE 2024☆69Updated last week