tuyunbin/Review-of-Change-Captioning

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/tuyunbin/Review-of-Change-Captioning)

tuyunbin / Review-of-Change-Captioning

This repository offers a comprehensive overview of existing datasets and methods in the field of change captioning.

☆17

Alternatives and similar repositories for Review-of-Change-Captioning

Users that are interested in Review-of-Change-Captioning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yili-19 / SSGPA
View on GitHub
☆17Jul 14, 2025Updated last year
HDUyiming / SOCCER
View on GitHub
We are very happy that our work has been accepted by ACM Multimedia 2024！🥰
☆12Jan 8, 2025Updated last year
kevendai / fandp-ijcai2025-issues
View on GitHub
☆17Oct 13, 2025Updated 9 months ago
ZZDoog / ProDubber
View on GitHub
[CVPR 2025] Official implementation of paper "Prosody-Enhanced Acoustic Pre-training and Acoustic-Disentangled Prosody Adapting for Movie…
☆23Jun 6, 2025Updated last year
Junxi-Chen / PE-MIL
View on GitHub
[CVPR 2024] Official code for paper: Prompt-Enhanced Multiple Instance Learning for Weakly Supervised Video Anomaly Detection.
☆27Aug 19, 2024Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
shinianzhihou / lazy_cd
View on GitHub
Towards practical change detection, including annotation, algorithms and deployment.
☆12Dec 15, 2022Updated 3 years ago
GalaxyCong / EmoDubber
View on GitHub
[CVPR 2025] Official source codes for the paper: EmoDubber: Towards High Quality and Emotion Controllable Movie Dubbing.
☆38Jun 3, 2025Updated last year
kaistmm / AlignDiT
View on GitHub
[ACM MM 2025] AlignDiT: Multimodal Aligned Diffusion Transformer for Synchronized Speech Generation
☆24Oct 28, 2025Updated 9 months ago
ZZDoog / Avatar
View on GitHub
Avatar: An easy-to-use digital portrait PPT presentation video generation system based on Gradio
☆20Nov 7, 2023Updated 2 years ago
WayneTomas / Balance-Constraint-KMeans
View on GitHub
[Symmetry 2019] This is the Matlab code for our paper "Optimizing MSE for Clustering with Balanced Size Constraints".
☆20Mar 25, 2025Updated last year
WayneTomas / TransCP
View on GitHub
[TPAMI 2024] This is the official Pytorch code for our paper "Context Disentangling and Prototype Inheriting for Robust Visual Grounding"…
☆28May 8, 2025Updated last year
tuyunbin / SCORER
View on GitHub
[ICCV 2023] This is the Pytorch code for our paper "Self-Supervised Cross-View Representation Reconstruction for Change Captioning".
☆20Sep 25, 2025Updated 10 months ago
ZZDoog / Speaker2Dubber
View on GitHub
[ACM MM24] Official implementation of paper "From Speaker to Dubber: Movie Dubbing with Prosody and Duration Consistency Learning"
☆34Jul 14, 2026Updated 2 weeks ago
ruc-aimc-lab / TeachCLIP
View on GitHub
[CVPR 2024] TeachCLIP for Text-to-Video Retrieval
☆42May 7, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
ictnlp / LSG
View on GitHub
The code for AAAI 2025 “Large Language Models Are Read/Write Policy-Makers for Simultaneous Generation”
☆15Jan 3, 2025Updated last year
tuyunbin / Video-Description-with-Spatial-Temporal-Attention
View on GitHub
[ACM MM 2017 & IEEE TMM 2020] This is the Theano code for the paper "Video Description with Spatial Temporal Attention"
☆61Oct 20, 2020Updated 5 years ago
mira-ai-lab / MUSIC-AVQA-R
View on GitHub
☆13May 21, 2024Updated 2 years ago
Liqq1 / awesome-medical-vision-and-language-pretraining
View on GitHub
The collection of medical VLP papars
☆20Jul 24, 2024Updated 2 years ago
zzma2 / medical-llm-reasoning-survey
View on GitHub
A curated list of medical reasoning research on large language models, organized by modality, technique, application, and benchmark.
☆19Oct 17, 2025Updated 9 months ago
GalaxyCong / StyleDubber
View on GitHub
[ACL 2024] This is the Pytorch code for our paper "StyleDubber: Towards Multi-Scale Style Learning for Movie Dubbing"
☆98Nov 14, 2024Updated last year
zhuduowang / Change3D
View on GitHub
[CVPR 2025 Highlight] Change3D: Revisiting Change Detection and Captioning from A Video Modeling Perspective.
☆92Jul 24, 2025Updated last year
AntXinyuan / sph2pob
View on GitHub
(IJCAI 2023) Sph2Pob: Boosting Object Detection on Spherical Images with Planar Oriented Boxes Methods
☆14Aug 23, 2023Updated 2 years ago
Lee-zixu / FineCIR
View on GitHub
☆12Mar 31, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
LinjieMu / MMXU
View on GitHub
☆25Nov 27, 2025Updated 8 months ago
XiuchuanLi / fmixup
View on GitHub
TIFS2022: Decision-based Adversarial Attack with Frequency Mixup
☆22Aug 8, 2023Updated 2 years ago
wzb-bupt / GaitParsing
View on GitHub
GaitParsing: Human Semantic Parsing for Gait Recognition (IEEE TMM)
☆13May 20, 2024Updated 2 years ago
RanaCM / DSU-AVO
View on GitHub
Source code and speech samples for the DSU-AVO paper accepted to INTERSPEECH 2023
☆12May 13, 2024Updated 2 years ago
shinianzhihou / labelme_cd
View on GitHub
Change detection annotation tool built on the well-used labelme, so called labelme_cd.
☆40Mar 6, 2023Updated 3 years ago
microsoft / chexprompt
View on GitHub
Expert-level AI radiology report evaluator
☆37Apr 1, 2025Updated last year
titizheng / PAMIL
View on GitHub
Implementation of "Dynamic Policy-Driven Adaptive Multi-Instance Learning for Whole Slide Image Classification", (CVPR 2024 Highlight).
☆25Mar 6, 2025Updated last year
ml-researcher / VAE
View on GitHub
☆11Oct 8, 2022Updated 3 years ago
danfenghong / ISPRS_HD-Net
View on GitHub
Yuxuan Li, Danfeng Hong, Chenyu Li, Jing Yao, Jocelyn Chanussot. HD-Net: High-resolution decoupled network for building footprint extrac…
☆47Nov 30, 2024Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
banjiuyufen / ArXiv-Agent
View on GitHub
🕵️ ArXiv Agent v1.0 - Your Intelligent Research Assistant
☆27Dec 29, 2025Updated 7 months ago
Chiangsonw / CaLa
View on GitHub
The official code of "CaLa: Complementary Association Learning for Augmenting Composed Image Retrieval"
☆15Sep 19, 2024Updated last year
hongwang600 / fashion-iq-metadata
View on GitHub
this repo contains some useful metadata for Fashion IQ challenge: https://sites.google.com/view/lingir/fashion-iq
☆15Jun 28, 2019Updated 7 years ago
junkunyuan / HAP
View on GitHub
[NeurIPS 2023] HAP: Structure-Aware Masked Image Modeling for Human-Centric Perception
☆44Mar 25, 2024Updated 2 years ago
YuankaiQi / ORIST
View on GitHub
Know What and Know Where: An Object-and-Room Informed Sequential BERT for Indoor Vision-Language Navigation
☆16Feb 7, 2022Updated 4 years ago
jyliu-98 / MoSketch
View on GitHub
[ICCV 2025] This repo is the official implementation of "Multi-Object Sketch Animation by Scene Decomposition and Motion Planning"
☆28Jul 30, 2025Updated 11 months ago
rtst777 / TextGAN
View on GitHub
GAN-Based Text Generation
☆14Apr 19, 2020Updated 6 years ago