X-PLUG/mPLUG

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/X-PLUG/mPLUG)

X-PLUG / mPLUG

mPLUG: Effective and Efficient Vision-Language Learning by Cross-modal Skip-connections. (EMNLP 2022)

☆97

Alternatives and similar repositories for mPLUG

Users that are interested in mPLUG are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

X-PLUG / mPLUG-2
View on GitHub
mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and Video (ICML 2023)
☆227Jul 21, 2023Updated 3 years ago
X-PLUG / Youku-mPLUG
View on GitHub
Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Pre-training Dataset and Benchmarks
☆307Jan 8, 2024Updated 2 years ago
X-PLUG / ChatPLUG
View on GitHub
A Chinese Open-Domain Dialogue System
☆324Aug 16, 2023Updated 2 years ago
X-PLUG / mPLUG-HalOwl
View on GitHub
mPLUG-HalOwl: Multimodal Hallucination Evaluation and Mitigating
☆100Jan 29, 2024Updated 2 years ago
X-PLUG / mPLUG-Owl
View on GitHub
mPLUG-Owl: The Powerful Multi-modal Large Language Model Family
☆2,535Apr 2, 2025Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
X-PLUG / Multi-LLM-Agent
View on GitHub
☆242Apr 23, 2024Updated 2 years ago
prdwb / okvqa-release
View on GitHub
☆15May 10, 2021Updated 5 years ago
HAWLYQ / Qc-TextCap
View on GitHub
☆16Dec 25, 2021Updated 4 years ago
CarolineGao / LoRA-Dataset
View on GitHub
[NeurIPS2023] LoRA: A Logical Reasoning Augmented Dataset for Visual Question Answering
☆12Jan 5, 2024Updated 2 years ago
guoyang9 / UnifER
View on GitHub
Official implementation for the MM'22 paper.
☆14Jun 30, 2022Updated 4 years ago
val-iisc / RMLVQA
View on GitHub
☆19May 31, 2023Updated 3 years ago
YiyangZhou / LURE
View on GitHub
[ICLR 2024] Analyzing and Mitigating Object Hallucination in Large Vision-Language Models
☆158Apr 30, 2024Updated 2 years ago
szzexpoi / POEM
View on GitHub
Official Implementation for CVPR 2023 paper "Divide and Conquer: Answering Questions with Object Factorization and Compositional Reasonin…
☆10Jun 16, 2024Updated 2 years ago
CCIIPLab / DPT
View on GitHub
The code of IJCAI2022 paper, Declaration-based Prompt Tuning for Visual Question Answering
☆20May 10, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
richard-peng-xia / KD-CGEC
View on GitHub
Code for Chinese grammatical error correction based on knowledge distillation
☆11Aug 16, 2022Updated 3 years ago
ovguyo / captions-in-VQA
View on GitHub
Using image captions with LLM for zero-shot VQA
☆19Mar 14, 2024Updated 2 years ago
shenxiang-vqa / LSAT
View on GitHub
Local self-attention in Transformer for visual question answering
☆13Mar 17, 2024Updated 2 years ago
xiaojino / RUArt
View on GitHub
RUArt: A Novel Text-Centered Solution for Text-Based Visual Question Answering
☆10Nov 27, 2022Updated 3 years ago
YuanLi95 / KECPM
View on GitHub
Tis is code for Few-Shot Joint Multimodal Entity-Relation Extraction via Knowledge-Enhanced Cross-modal Prompt Model (ACM MM 2024))
☆12Aug 27, 2024Updated last year
alibaba-mmai-research / DiST
View on GitHub
ICCV2023: Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer Learning
☆41Sep 25, 2023Updated 2 years ago
360CVGroup / Bridge_Diffusion_Model
View on GitHub
Chinese-native image generation while compatible with SD eco-system, 1st-gen, AAAI2025
☆13Jun 25, 2024Updated 2 years ago
iLearn-Lab / CVPR22-SHA-GCL-for-SGG
View on GitHub
Code for paper "Stacked Hybrid-Attention and Group Collaborative Learning for Unbiased Scene Graph Generation"
☆39Apr 8, 2026Updated 3 months ago
AntNLP / antnlp-tawp
View on GitHub
A Template for Academic Writing Projects
☆16Nov 22, 2019Updated 6 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
THUNLP-MT / MUSEG
View on GitHub
Repo for paper "MUSEG: Reinforcing Video Temporal Understanding via Timestamp-Aware Multi-Segment Grounding".
☆40Jun 9, 2025Updated last year
juhayna-zh / Awesome-Music-Generation-Papers
View on GitHub
Curated list of groundbreaking music generation research.
☆21Apr 24, 2026Updated 2 months ago
rulixiang / MtS-WH-Dataset
View on GitHub
Multi-temporal Scene dataset for Scene Change Detection.
☆15Apr 14, 2021Updated 5 years ago
cubenlp / CERRU
View on GitHub
CCL2024 Chinese Essay Rhetoric Recognition and Understanding
☆17Oct 1, 2024Updated last year
HAWLYQ / InfoMetIC
View on GitHub
☆13Sep 5, 2023Updated 2 years ago
yifanzhang-pro / AutoMathText
View on GitHub
[ACL 2025 Findings] Autonomous Data Selection with Zero-shot Generative Classifiers for Mathematical Texts (https://huggingface.co/papers…
☆92Nov 23, 2025Updated 7 months ago
X-PLUG / CValues
View on GitHub
面向中文大模型价值观的评估与对齐研究
☆560Jul 20, 2023Updated 3 years ago
alirezasalemi7 / DEDR-MM-FiD
View on GitHub
the code for paper: A Symmetric Dual Encoding Dense Retrieval Framework for Knowledge-Intensive Visual Question Answering
☆14Aug 22, 2023Updated 2 years ago
ajinkya98 / PyTorchCNN
View on GitHub
Implementing CNN in PyTorch with Custom Dataset and Transfer Learning
☆11Aug 24, 2020Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
aneeshan95 / Sketch_LVM
View on GitHub
Project page for the paper 'CLIP for All Things Zero-Shot Sketch-Based Image Retrieval, Fine-Grained or Not'
☆78Aug 6, 2023Updated 2 years ago
YiyangZhou / POVID
View on GitHub
[Arxiv] Aligning Modalities in Vision Large Language Models via Preference Fine-tuning
☆94Apr 30, 2024Updated 2 years ago
ThalesGroup / ConceptBERT
View on GitHub
Implementation of ConceptBert: Concept-Aware Representation for Visual Question Answering
☆31Apr 30, 2024Updated 2 years ago
360CVGroup / Inner-Adaptor-Architecture
View on GitHub
LMM solved catastrophic forgetting, AAAI2025
☆45Apr 15, 2025Updated last year
LukeForeverYoung / QRNet
View on GitHub
☆41Jun 3, 2022Updated 4 years ago
Picsart-AI-Research / Social-Reward
View on GitHub
[ICLR 2024 Spotlight] Social Reward: Evaluating and Enhancing Generative AI through Million-User Feedback from an Online Creative Communi…
☆12Mar 29, 2024Updated 2 years ago
richard-peng-xia / Chinese-Noisy-Text
View on GitHub
This repository stores the code of the data augmentation method from Chinese word and character levels, which adds noise to words and cha…
☆22Aug 26, 2022Updated 3 years ago