AAAI 2024: Visual Instruction Generation and Correction
☆97Feb 4, 2024Updated 2 years ago
Alternatives and similar repositories for VIGC
Users that are interested in VIGC are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Data Set Description Language Specification (新一代人工智能数据集描述语言DSDL)☆46May 29, 2024Updated last year
- SDK of OpenDataLab - https://opendatalab.org.cn☆60Jul 31, 2025Updated 9 months ago
- MLLM-DataEngine: An Iterative Refinement Approach for MLLM☆48May 24, 2024Updated 2 years ago
- datasets resource☆142May 15, 2026Updated last week
- ☆25Nov 7, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆121Jan 15, 2026Updated 4 months ago
- ECCV2024_Parrot Captions Teach CLIP to Spot Text☆66Sep 6, 2024Updated last year
- 万卷1.0多模态语料☆574Oct 20, 2023Updated 2 years ago
- WanJuan-CC是以CommonCrawl为基础,经过数据抽取,规则清洗,去重,安全过滤,质量清洗等步骤得到的高质量数据。☆13Apr 18, 2024Updated 2 years ago
- Beyond Hallucinations: Enhancing LVLMs through Hallucination-Aware Direct Preference Optimization☆103Jan 30, 2024Updated 2 years ago
- Open-source multimodal data annotation platform with AI auto-annotation support.☆1,575Apr 22, 2026Updated last month
- The Open-Source Data Annotation Platform☆1,226Feb 19, 2025Updated last year
- ☆121Jun 11, 2024Updated last year
- HalluciDoctor: Mitigating Hallucinatory Toxicity in Visual Instruction Data (Accepted by CVPR 2024)☆52Jul 16, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This is the repo for the paper Multi-Agent Collaborative Data Selection for Efficient LLM Pretraining.☆48Aug 22, 2025Updated 9 months ago
- [CVPR 2024 Highlight] OPERA: Alleviating Hallucination in Multi-Modal Large Language Models via Over-Trust Penalty and Retrospection-Allo…☆409Aug 24, 2024Updated last year
- UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition☆476Sep 28, 2025Updated 7 months ago
- WanJuan3.0( “万卷·丝路”)一个作为综合性的纯文本语料库,采集了多个国家地区的网络公开信息、文献、专利等资料,数据总规模超1.2TB,Token总数超过300B,处于国际领先水平,首期开源的语料库主要由泰语、俄语、阿拉伯语、韩语和越南语5个子集构成,每个子集的数据…☆46Feb 13, 2025Updated last year
- [ICCV25 Highlight] The official implementation of the paper "LEGION: Learning to Ground and Explain for Synthetic Image Detection"☆79Oct 22, 2025Updated 7 months ago
- [ICLR 2024] Analyzing and Mitigating Object Hallucination in Large Vision-Language Models☆156Apr 30, 2024Updated 2 years ago
- The official GitHub page for ''Evaluating Object Hallucination in Large Vision-Language Models''☆260Aug 21, 2025Updated 9 months ago
- Enhancing Large Vision Language Models with Self-Training on Image Comprehension.☆68May 31, 2024Updated last year
- [ICML2022] "Identity-Disentangled Adversarial Augmentation for Self-Supervised Learning"☆10Jul 24, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [ICLR'24] Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning☆297Mar 13, 2024Updated 2 years ago
- PICABench: How Far Are We from Physically Realistic Image Editing?☆38Nov 5, 2025Updated 6 months ago
- Normal Learning in Videos with Attention Prototype Network☆18Jan 19, 2023Updated 3 years ago
- (ICCV 2025) OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generation☆101Dec 3, 2025Updated 5 months ago
- [CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(…☆337Oct 14, 2025Updated 7 months ago
- Sparkles: Unlocking Chats Across Multiple Images for Multimodal Instruction-Following Models☆45Jun 14, 2024Updated last year
- List of papers on Hallucination in LMM☆10Nov 29, 2023Updated 2 years ago
- MLLM-Bench: Evaluating Multimodal LLMs with Per-sample Criteria☆76Oct 16, 2024Updated last year
- [ACL 2026 Oral] Official implementation of LaMI: Augmenting Large Language Models via Late Multi-Image Fusion☆19Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [NeurIPS 2024] This repo contains evaluation code for the paper "Are We on the Right Way for Evaluating Large Vision-Language Models"☆211Sep 26, 2024Updated last year
- ☆75Mar 7, 2024Updated 2 years ago
- [ACL 2024 Main Conference] Chinese commonsense benchmark for LLMs☆45Jul 27, 2024Updated last year
- ✨✨Woodpecker: Hallucination Correction for Multimodal Large Language Models☆649Dec 23, 2024Updated last year
- [CVPR 2024] CapsFusion: Rethinking Image-Text Data at Scale☆214Feb 27, 2024Updated 2 years ago
- [ACL 2025 Findings] Official pytorch implementation of "Don't Miss the Forest for the Trees: Attentional Vision Calibration for Large Vis…☆25Jul 21, 2024Updated last year
- Implementation of "PaLM2-VAdapter:" from the multi-modal model paper: "PaLM2-VAdapter: Progressively Aligned Language Model Makes a Stron…☆17Nov 11, 2024Updated last year