tuyunbin / Review-of-Change-CaptioningView external linksLinks
This repository offers a comprehensive overview of existing datasets and methods in the field of change captioning.
☆17Sep 2, 2025Updated 5 months ago
Alternatives and similar repositories for Review-of-Change-Captioning
Users that are interested in Review-of-Change-Captioning are comparing it to the libraries listed below
Sorting:
- Robust Change Captioning in Remote Sensing: SECOND-CC Dataset and MModalCC Framework☆18Sep 8, 2025Updated 5 months ago
- Code for the paper "RADAR: Enhancing Radiology Report Generation with Supplementary Knowledge Injection" (ACL'25).☆34Jul 23, 2025Updated 6 months ago
- [CVPR 2025] Official implementation of paper "Prosody-Enhanced Acoustic Pre-training and Acoustic-Disentangled Prosody Adapting for Movie…☆23Jun 6, 2025Updated 8 months ago
- [ICCV 2023] This is the Pytorch code for our paper "Self-Supervised Cross-View Representation Reconstruction for Change Captioning".☆20Sep 25, 2025Updated 4 months ago
- [CVPR 2024] Official code for paper: Prompt-Enhanced Multiple Instance Learning for Weakly Supervised Video Anomaly Detection.☆26Aug 19, 2024Updated last year
- Official source codes for the paper: EmoDubber: Towards High Quality and Emotion Controllable Movie Dubbing.☆37Jun 3, 2025Updated 8 months ago
- Expert-level AI radiology report evaluator☆36Apr 1, 2025Updated 10 months ago
- [CVPR 2024] TeachCLIP for Text-to-Video Retrieval☆42May 7, 2025Updated 9 months ago
- ☆67Oct 31, 2025Updated 3 months ago
- ☆10Dec 16, 2023Updated 2 years ago
- [JAG 2026] DreamCD: A change-label-free framework for change detection via a weakly conditional semantic diffusion model in optical VHR i…☆20Jan 30, 2026Updated 2 weeks ago
- This repository summarizes the human-centered applications of event data☆13Jan 31, 2025Updated last year
- Official repository of the UPAR dataset for pedestrian attribute recognition and attribute-based person retrieval☆14Jan 22, 2024Updated 2 years ago
- [PRCV-2023, IEEE TMM-2025] Learning Bottleneck Transformer for Event Image-Voxel Feature Fusion based Classification☆12Dec 20, 2025Updated last month
- Towards practical change detection, including annotation, algorithms and deployment.☆12Dec 15, 2022Updated 3 years ago
- ☆10Mar 31, 2025Updated 10 months ago
- In OLHWDB ,you can find the ptts files, this code can help you get the information of the ptts☆11Mar 8, 2022Updated 3 years ago
- DeepEarth: AI Foundation Model for Planetary Science & Sustainability☆25Jan 28, 2026Updated 2 weeks ago
- Used in M4C feature extraction script: https://github.com/facebookresearch/mmf/blob/project/m4c/projects/M4C/scripts/extract_ocr_frcn_fea…☆13Jan 30, 2020Updated 6 years ago
- Bidirectional Likelihood Estimation with Multi-Modal Large Language Models for Text-Video Retrieval (ICCV 2025 Highlight)☆20Aug 1, 2025Updated 6 months ago
- DQN for Stock Trading leverages Deep Q-Network (DQN) to develop an intelligent trading agent for stock markets. The project aims to maxim…☆11Jun 27, 2024Updated last year
- Offical respority for Gait Recogniton with Drones: A benchmark (TMM 2023)☆10Feb 2, 2024Updated 2 years ago
- ☆17Nov 16, 2025Updated 3 months ago
- Bu repoda ESA SNAP yazılımı ile temel Sentinel-2 görüntü işleme süreci özetlenecektir.☆12Apr 2, 2024Updated last year
- This repository contains the official implementation of the paper "LandSegmenter: Towards a Flexible Foundation Model for Land Use and La…☆26Dec 8, 2025Updated 2 months ago
- [NeurIPS 2023] HAP: Structure-Aware Masked Image Modeling for Human-Centric Perception☆44Mar 25, 2024Updated last year
- Fast Style Transfer in Pytorch☆10Mar 1, 2017Updated 8 years ago
- Using Reinforcement learning for object detection & localization CPTS_580 Project☆12May 4, 2017Updated 8 years ago
- Code for ACL2018 paper "Learn How to Actively Learn: An Imitation Learning Approach"☆10Mar 8, 2019Updated 6 years ago
- ☆11Mar 5, 2025Updated 11 months ago
- [CVPR 2025] Official implementation of paper "Multi-Granularity Class Prototype Topology Distillation for Class-Incremental Source-Free …☆17Aug 26, 2025Updated 5 months ago
- code for Cross-Modality Distillation for Multi-modal Tracking☆17Jan 4, 2026Updated last month
- siyuan-note plugin for dashboard.☆11Dec 30, 2024Updated last year
- Source code for AAAI 2024 paper "Finding Visual Saliency in Continuous Spike Stream"☆13Aug 21, 2025Updated 5 months ago
- Official implementation of "cmSalGAN: RGB-D Salient Object Detection with Cross-View Generative Adversarial Networks" (IEEE TMM 2020)☆10Aug 23, 2021Updated 4 years ago
- GaitParsing: Human Semantic Parsing for Gait Recognition (IEEE TMM)☆12May 20, 2024Updated last year
- ☆14Dec 25, 2020Updated 5 years ago
- Web Interface for gaze recording: CVPR 2018☆10Jul 10, 2018Updated 7 years ago
- FINALLY: Fast and universal speech enhancement model delivering studio-quality audio for a wide range of recordings.☆25Dec 11, 2025Updated 2 months ago