OpenMICG/mcg

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/OpenMICG/mcg)

OpenMICG / mcg

Multigranularity Contrastive cross-modal collaborative Generation (MCG) model for Video QA

☆12

Alternatives and similar repositories for mcg

Users that are interested in mcg are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

OpenMICG / CSLAKE
View on GitHub
A consistent Med-VQA dataset, C-SLAKE , extended by Slake for further consistency assessment .
☆17Jan 12, 2024Updated 2 years ago
OpenMICG / CoCoMeD
View on GitHub
Consistency Conditioned Memory Augmented Dynamic Diagnosis Model for Medical Visual Question Answering
☆16Jan 12, 2024Updated 2 years ago
OpenMICG / AHP
View on GitHub
Adapter-Enhanced Hierarchical Cross-Modal Pre-training for Lightweight Medical Report Generation
☆15Jan 25, 2025Updated last year
OpenMICG / MossVLN
View on GitHub
Observation Driven Memory Synergistic Planning for Continuous Vision-Language Navigation
☆33Jun 14, 2024Updated 2 years ago
ZYangChen / DC-SatMVS
View on GitHub
[IEEE JSTARS] The official implementation of "Surface Depth Estimation from Multi-view Stereo Satellite Images with Distribution Contrast…
☆11May 16, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
OpenMICG / VisionDreamer
View on GitHub
VisionDreamer: High-Fidelity Text-to-3D Generation via Mesh-Guided 3D Gaussian Splatting
☆18Jul 7, 2025Updated last year
ZYangChen / FDN-MVS
View on GitHub
[The Visual Computer] The official implementation of "Feature Distribution Normalization Network for Multi-View Stereo”.
☆15Mar 5, 2025Updated last year
PKU-EPIC / Uni-NaVid
View on GitHub
☆13Oct 15, 2025Updated 9 months ago
EIT-NLP / HiDrop
View on GitHub
☆17Apr 5, 2026Updated 3 months ago
shuishida / calvin
View on GitHub
☆17Jul 21, 2022Updated 4 years ago
OpenMICG / FAVP
View on GitHub
☆16Sep 17, 2025Updated 10 months ago
HanqingWangAI / CCC-VLN
View on GitHub
Repository of our accepted CVPR2022 paper "Counterfactual Cycle-Consistent Learning for Instruction Following and Generation in Vision-La…
☆28Mar 4, 2022Updated 4 years ago
Peratham / video2text.pytorch
View on GitHub
PyTorch implementation of video captioning
☆13Sep 24, 2017Updated 8 years ago
EIT-NLP / Awesome-MLLM-Compression
View on GitHub
☆23Apr 12, 2026Updated 3 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
EIT-NLP / UTPTrack
View on GitHub
☆29Apr 5, 2026Updated 3 months ago
MGitHubL / TMac
View on GitHub
☆14Feb 26, 2024Updated 2 years ago
pspdada / SENTINEL
View on GitHub
[ICCV 2025] Official repository of "Mitigating Object Hallucinations via Sentence-Level Early Intervention".
☆31Jul 2, 2026Updated 2 weeks ago
bowong / Layered-Memory-Network
View on GitHub
A Layered Memory Network for MovieQA
☆16Apr 27, 2018Updated 8 years ago
2282588541a / HiRAG
View on GitHub
code for paper Hierarchical Retrieval-Augmented Generation Model with Rethink for Multi-hop Question Answering
☆14Aug 13, 2024Updated last year
MAGIC-AI4Med / ChestX-Reasoner
View on GitHub
☆39Mar 19, 2026Updated 4 months ago
Hoar012 / TDC-Video
View on GitHub
Official implementation of TDC.
☆15Jul 22, 2025Updated 11 months ago
yl3800 / EIGV
View on GitHub
☆15Aug 12, 2022Updated 3 years ago
CCIIPLab / DPT
View on GitHub
The code of IJCAI2022 paper, Declaration-based Prompt Tuning for Visual Question Answering
☆20May 10, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
iLearn-Lab / MM23-RTQ
View on GitHub
ACM Multimedia 2023 (Oral) - RTQ: Rethinking Video-language Understanding Based on Image-text Model
☆15Apr 7, 2026Updated 3 months ago
deep-spin / Infinite-Video
View on GitHub
\infty-Video: A Training-Free Approach to Long Video Understanding via Continuous-Time Memory Consolidation
☆21Feb 14, 2025Updated last year
noagarcia / ROLL-VideoQA
View on GitHub
PyTorch code for ROLL, a knowledge-based video story question answering model.
☆21Sep 29, 2020Updated 5 years ago
GeWu-Lab / TSPM
View on GitHub
Official repository for "Boosting Audio Visual Question Answering via Key Semantic-Aware Cues" in ACM MM 2024.
☆17Oct 25, 2024Updated last year
salesforce / paprika
View on GitHub
Code for CVPR 2023 paper "Procedure-Aware Pretraining for Instructional Video Understanding"
☆50Jun 2, 2026Updated last month
zhyang2226 / OPA-DPO
View on GitHub
[CVPR 2025 (Oral)] Mitigating Hallucinations in Large Vision-Language Models via DPO: On-Policy Data Hold the Key
☆111Jan 9, 2026Updated 6 months ago
LiangThree / MCMA
View on GitHub
☆15Jan 12, 2026Updated 6 months ago
minghangz / cpl
View on GitHub
CPL: Weakly Supervised Temporal Sentence Grounding with Gaussian-based Contrastive Proposal Learning
☆65Mar 22, 2026Updated 3 months ago
ffmpbgrnn / tflibs
View on GitHub
☆25Sep 8, 2017Updated 8 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
sakura20221 / RT-RAG
View on GitHub
☆17Jan 16, 2026Updated 6 months ago
WHB139426 / GCG
View on GitHub
Weakly Supervised Gaussian Contrastive Grounding with Large Multimodal Models for Video Question Answering [ACM MM'24]
☆10Jul 22, 2024Updated last year
exoskeletonzj / MARS
View on GitHub
A Multi-Agent Approach Integrating Socratic Guidance for Automated Prompt Optimization
☆18Dec 15, 2025Updated 7 months ago
bebr2 / RACE
View on GitHub
Code for RACE.
☆15Nov 12, 2025Updated 8 months ago
noagarcia / knowit-rock
View on GitHub
ROCK model for Knowledge-Based VQA in Videos
☆31Oct 19, 2020Updated 5 years ago
Haiyang0226 / Symphony
View on GitHub
code of cvpr26 paper Symphony
☆17Apr 7, 2026Updated 3 months ago
escorciav / deep-action-proposals
View on GitHub
Action Proposals generated by deep models
☆29Mar 19, 2017Updated 9 years ago