MingyuJ666 / SEAttnGAN
[ICONIP'24]Mingyu.Jin's final year project
☆28Updated 5 months ago
Alternatives and similar repositories for SEAttnGAN:
Users that are interested in SEAttnGAN are comparing it to the libraries listed below
- VAEGAN, I Love u☆16Updated last year
- Official Repo for FoodieQA paper (EMNLP 2024)☆15Updated 2 months ago
- [FCS'24] LVLM Safety paper☆17Updated 3 weeks ago
- A hot-pluggable tool for visualizing LLaVA's attention.☆13Updated last year
- ☆75Updated 5 months ago
- A Self-Training Framework for Vision-Language Reasoning☆62Updated last week
- [Neurips'24 Spotlight] Visual CoT: Advancing Multi-Modal Language Models with a Comprehensive Dataset and Benchmark for Chain-of-Thought …☆212Updated last month
- Collection of papers and repos for multimodal chain-of-thought☆30Updated 2 months ago
- This is the code repository for StarCraft ll Agent.☆9Updated 8 months ago
- All about Robotics and AI Agents you need are here☆27Updated 9 months ago
- Official repo of M$^2$PT: Multimodal Prompt Tuning for Zero-shot Instruction Learning☆18Updated 2 months ago
- [NeurIPS2024] Repo for the paper `ControlMLLM: Training-Free Visual Prompt Learning for Multimodal Large Language Models'☆137Updated last week
- [NeurIPS 2024] Calibrated Self-Rewarding Vision Language Models☆63Updated 7 months ago
- ☆14Updated last month
- [EMNLP 2024 Findings] Unlocking Continual Learning Abilities in Language Models☆23Updated 3 months ago
- LoTa-Bench: Benchmarking Language-oriented Task Planners for Embodied Agents (ICLR 2024)☆65Updated 5 months ago
- ☆9Updated 10 months ago
- [CVPR2024] This is the official implement of MP5☆93Updated 7 months ago
- MMICL, a state-of-the-art VLM with the in context learning ability from ICL, PKU☆44Updated last year
- Official PyTorch implementation of Rethinking Guidance Information to Utilize Unlabeled Samples: A Label-Encoding Perspective.☆19Updated 4 months ago
- ☆25Updated this week
- Latest Advances on Embodied Multimodal LLMs (or Vison-Language-Action Models).☆95Updated 6 months ago
- Official repository of DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models☆81Updated 4 months ago
- [ACL 2024] Logical Closed Loop: Uncovering Object Hallucinations in Large Vision-Language Models. Detect and mitigate object hallucinatio…☆20Updated this week
- Visualizing the attention of vision-language models☆102Updated 3 months ago
- [ICLR 2025] MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation☆33Updated last month
- The official repository for the Scientific Paper Idea Proposer (SciPIP)☆55Updated last month
- [NeurIPS 2024] TransAgent: Transfer Vision-Language Foundation Models with Heterogeneous Agent Collaboration☆22Updated 3 months ago
- [ICLR 2024] Analyzing and Mitigating Object Hallucination in Large Vision-Language Models☆141Updated 9 months ago
- This repository is used for advertising PhD recruitment opportunities. Contributions are welcome!☆165Updated 3 weeks ago