ShilinSun / mxai_review
☆10Updated 4 months ago
Alternatives and similar repositories for mxai_review
Users that are interested in mxai_review are comparing it to the libraries listed below
Sorting:
- The official GitHub page for paper "NegativePrompt: Leveraging Psychology for Large Language Models Enhancement via Negative Emotional St…☆22Updated last year
- [NeurIPS 2024 Spotlight] Code for the paper "Flex-MoE: Modeling Arbitrary Modality Combination via the Flexible Mixture-of-Experts"☆51Updated 6 months ago
- The benchmark and datasets of the ICML 2024 paper "VisionGraph: Leveraging Large Multimodal Models for Graph Theory Problems in Visual C…☆15Updated 11 months ago
- Implementation of the "the first large-scale multimodal mixture of experts models." from the paper: "Multimodal Contrastive Learning with…☆29Updated last month
- ☆21Updated 6 months ago
- Code for paper "Unraveling Cross-Modality Knowledge Conflicts in Large Vision-Language Models."☆42Updated 7 months ago
- Code for CVPR2025 "MMRL: Multi-Modal Representation Learning for Vision-Language Models".☆33Updated last month
- ☆19Updated last year
- MuCR is a benchmark designed to evaluate Multimodal Large Language Models' (MLLMs) ability to discern causal links across modalities☆15Updated 3 months ago
- ☆17Updated 9 months ago
- Official Repo for FoodieQA paper (EMNLP 2024)☆16Updated 6 months ago
- The official repository of "SmartAgent: Chain-of-User-Thought for Embodied Personalized Agent in Cyber World".☆27Updated last month
- [CVPR] MergeVQ: A Unified Framework for Visual Generation and Representation with Token Merging and Quantization☆23Updated last month
- CLIP-MoE: Mixture of Experts for CLIP☆34Updated 7 months ago
- ☆19Updated last month
- ☆12Updated 4 months ago
- Official Code for ACL 2023 Outstanding Paper: World-to-Words: Grounded Open Vocabulary Acquisition through Fast Mapping in Vision-Languag…☆32Updated last year
- Awesome Learn From Model Beyond Fine-Tuning: A Survey☆63Updated 5 months ago
- Do Vision and Language Models Share Concepts? A Vector Space Alignment Study☆14Updated 5 months ago
- Code for "CREAM: Consistency Regularized Self-Rewarding Language Models", ICLR 2025.☆22Updated 3 months ago
- A Survey of Personalization: From RAG to Agent☆32Updated 3 weeks ago
- Look, Compare, Decide: Alleviating Hallucination in Large Vision-Language Models via Multi-View Multi-Path Reasoning☆21Updated 8 months ago
- Code for paper: Unified Text-to-Image Generation and Retrieval☆15Updated 10 months ago
- The official repo for "VisualWebInstruct: Scaling up Multimodal Instruction Data through Web Search"☆24Updated last week
- ☆18Updated 3 weeks ago
- Official Implementation for EMNLP 2024 (main) "AgentReview: Exploring Academic Peer Review with LLM Agent."☆59Updated 6 months ago
- ☆34Updated last week
- The efficient tuning method for VLMs☆81Updated last year
- code for Learning the Unlearned: Mitigating Feature Suppression in Contrastive Learning☆16Updated 10 months ago
- ☆46Updated 4 months ago