MingyuJ666 / SEAttnGANLinks
[ICONIP'24]Mingyu.Jin's final year project
☆29Updated last year
Alternatives and similar repositories for SEAttnGAN
Users that are interested in SEAttnGAN are comparing it to the libraries listed below
Sorting:
- Official PyTorch implementation of Rethinking Guidance Information to Utilize Unlabeled Samples: A Label-Encoding Perspective.☆19Updated last year
- We introduce Reasoning via Video, a new paradigm that uses maze-solving video generation to probe multimodal reasoning; our VR-Bench show…☆48Updated last week
- [ICLR 2025] Code for Self-Correcting Decoding with Generative Feedback for Mitigating Hallucinations in Large Vision-Language Models☆24Updated 8 months ago
- Official eval code for ROVER: Benchmarking Reciprocal Cross-Modal Reasoning for Omnimodal Generation☆27Updated last month
- [NeurIPS 2024] Efficiency for Free: Ideal Data Are Transportable Representations☆19Updated 11 months ago
- [ICLR 2025] Mitigating Modality Prior-Induced Hallucinations in Multimodal Large Language Models via Deciphering Attention Causality☆60Updated 6 months ago
- [NeurIPS 2025] Official Implementation of paper "Sherlock: Self-Correcting Reasoning in Vision-Language Models"☆29Updated 3 months ago
- Official Repo for FoodieQA paper (EMNLP 2024)☆17Updated 6 months ago
- ☆38Updated last month
- Official implementation of "Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model Evaluation" (CVPR 202…☆40Updated 7 months ago
- [FCS'24] LVLM Safety paper☆19Updated last year
- RFTT: Reasoning with Reinforced Functional Token Tuning☆29Updated 7 months ago
- [ICML'25] Our study systematically investigates massive values in LLMs' attention mechanisms. First, we observe massive values are concen…☆84Updated 6 months ago
- [EMNLP 2025 Main] AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time☆89Updated 7 months ago
- ☆16Updated last year
- [NeurIPS 2025] Unsupervised Post-Training for Multi-Modal LLM Reasoning via GRPO☆73Updated 2 months ago
- [AAAI'26 Oral] Official Implementation of STAR-1: Safer Alignment of Reasoning LLMs with 1K Data☆33Updated 9 months ago
- VeriWeb: Verifiable Long-Chain Web Benchmark for Agentic Information-Seeking☆83Updated last month
- [NeurIPS25 Spotlight] EMPO, A Fully Unsupervised RLVR Method☆90Updated last month
- [ICLR 2025] Official codebase for the ICLR 2025 paper "Multimodal Situational Safety"☆30Updated 6 months ago
- Code for "TrustRAG: Enhancing Robustness and Trustworthiness in RAG" AAAI 2026 Workshop on Trust and Control in Agentic AI (TrustAgent)☆53Updated 9 months ago
- [NeurIPS'25] The official code of "PeRL: Permutation-Enhanced Reinforcement Learning for Interleaved Vision-Language Reasoning"☆27Updated 3 months ago
- [NeurIPS 2025] Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models☆52Updated 3 months ago
- [NeurIPS 2025] More Thinking, Less Seeing? Assessing Amplified Hallucination in Multimodal Reasoning Models☆73Updated 7 months ago
- Reinforcement Learning of Vision Language Models with Self Visual Perception Reward☆157Updated 3 months ago
- ☆77Updated last year
- Official implementation for the paper"Towards Understanding How Knowledge Evolves in Large Vision-Language Models"☆25Updated 9 months ago
- [NeurIPS'25 Spotlight🔥] Official Implementation of RobustMerge: Parameter-Efficient Model Merging for MLLMs with Direction Robustness☆53Updated 2 weeks ago
- Agentic MLLMs☆133Updated 2 months ago
- [NeurIPS 2025] Reinforcing Spatial Reasoning in Vision-Language Models with Interwoven Thinking and Visual Drawing☆87Updated 5 months ago