yangbang18/CARE

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yangbang18/CARE)

yangbang18 / CARE

(TIP'2023) Concept-Aware Video Captioning: Describing Videos with Effective Prior Information

☆32

Alternatives and similar repositories for CARE

Users that are interested in CARE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

liupeng0606 / clip4caption
View on GitHub
The first unofficial implementation of CLIP4Caption: CLIP for Video Caption (ACMMM 2021)
☆16Jan 2, 2023Updated 3 years ago
Sejong-VLI / V2T-Action-Graph-JKSUCIS-2023
View on GitHub
The implementation of a paper entitled "Action Knowledge for Video Captioning with Graph Neural Networks" (JKSUCIS 2023).
☆14Mar 29, 2023Updated 3 years ago
zchoi / VCRN
View on GitHub
☆11Jul 11, 2023Updated 3 years ago
nasib-ullah / video-captioning-models-in-Pytorch
View on GitHub
A PyTorch implementation of state of the art video captioning models from 2015-2019 on MSVD and MSRVTT datasets.
☆73Jul 30, 2023Updated 2 years ago
bladewaltz1 / PromptSwitch
View on GitHub
☆30Aug 14, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
luo3300612 / Semantics-AssistedVideoCaptioning.pytorch
View on GitHub
pytorch implementation of Semantics-AssistedVideoCaptioning
☆11Feb 16, 2023Updated 3 years ago
sauradip / MUPPET
View on GitHub
[ Arxiv 2023 ] This repository contains the code for "MUPPET: Multi-Modal Few-Shot Temporal Action Detection"
☆16Aug 30, 2023Updated 2 years ago
MarcusNerva / HMN
View on GitHub
[CVPR2022] Official code for Hierarchical Modular Network for Video Captioning. Our proposed HMN is implemented with PyTorch.
☆50Sep 30, 2022Updated 3 years ago
yangbang18 / MultiCapCLIP
View on GitHub
(ACL'2023) MultiCapCLIP: Auto-Encoding Prompts for Zero-Shot Multilingual Visual Captioning
☆36Aug 8, 2024Updated last year
hobincar / SGN
View on GitHub
Official pytorch implementation of the AAAI 2021 paper "Semantic Grouping Network for Video Captioning"
☆54Jul 9, 2021Updated 5 years ago
LiJiaBei-7 / rivrl
View on GitHub
Source code of our TCSVT'22 paper Reading-strategy Inspired Visual Representation Learning for Text-to-Video Retrieval
☆19Feb 13, 2022Updated 4 years ago
yiskw713 / VideoCaptioning
View on GitHub
video captioning using 3DCNN and LSTM (pytorch)
☆11Sep 26, 2019Updated 6 years ago
mengcaopku / LocVTP
View on GitHub
[ECCV 22] LocVTP: Video-Text Pre-training for Temporal Localization
☆39Jul 29, 2022Updated 3 years ago
bladewaltz1 / ModeCap
View on GitHub
Controllable mage captioning model with unsupervised modes
☆21Apr 14, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
hrtang22 / MUSE
View on GitHub
Code implementation of paper "MUSE: Mamba is Efficient Multi-scale Learner for Text-video Retrieval (AAAI2025)"
☆26Feb 2, 2025Updated last year
zjuchenlong / WSAG
View on GitHub
[EMNLP'22] Weakly-Supervised Temporal Article Grounding
☆14Nov 25, 2023Updated 2 years ago
deep-real / DEAL
View on GitHub
The PyTorch implementation for "DEAL: Disentangle and Localize Concept-level Explanations for VLMs" (ECCV 2024 Strong Double Blind)
☆20Mar 9, 2026Updated 4 months ago
sharad5 / OWL-ViT-Object-Detection
View on GitHub
Capstone Project: Training and Finetuning for OWL ViT for Referring Expression Task
☆12Jan 13, 2024Updated 2 years ago
ShiYaya / emscore
View on GitHub
Research code for CVPR 2022 paper: "EMScore: Evaluating Video Captioning via Coarse-Grained and Fine-Grained Embedding Matching"
☆26Oct 20, 2022Updated 3 years ago
yafuly / SyntacticGen
View on GitHub
☆16Jul 11, 2023Updated 3 years ago
yytzsy / SMCG
View on GitHub
Code for the paper "Controllable Video Captioning with an Exemplar Sentence"
☆12Apr 14, 2021Updated 5 years ago
yangbang18 / Non-Autoregressive-Video-Captioning
View on GitHub
The PyTorch code of the AAAI2021 paper "Non-Autoregressive Coarse-to-Fine Video Captioning".
☆57Oct 22, 2023Updated 2 years ago
dingfengshi / ReAct
View on GitHub
[ECCV 2022] Code for the paper, ReAct: Temporal Action Detection with Relational Queries
☆39Oct 19, 2022Updated 3 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
eric-xw / kinetics-i3d-pytorch
View on GitHub
☆35Mar 22, 2019Updated 7 years ago
PopeyePxx / GAAF-DEX
View on GitHub
Granularity-Aware Affordance Understanding from human-object interaction for Dexterous Robotic Functional Grasping
☆15Sep 2, 2025Updated 10 months ago
Jiaxuan-Li / EVCap
View on GitHub
[CVPR 2024] Retrieval-Augmented Image Captioning with External Visual-Name Memory for Open-World Comprehension
☆64Apr 8, 2024Updated 2 years ago
herbwood / face_liveness_detector
View on GitHub
얼굴 검증(face verification) 및 얼굴 생동감(facial liveness) 감지기
☆10Nov 11, 2021Updated 4 years ago
ninatu / in_style
View on GitHub
Official implementation of "In-style: Bridging Text and Uncurated Videos with Style Transfer for Cross-modal Retrieval." ICCV 2023
☆11Oct 5, 2023Updated 2 years ago
CrossmodalGroup / ESL
View on GitHub
☆12May 3, 2024Updated 2 years ago
Yinghao-Li / GuiGen
View on GitHub
☆14Oct 6, 2020Updated 5 years ago
RongKaiWeskerMA / INSTA
View on GitHub
The implementation of Learning Instance and Task-Aware Dynamic Kernels for Few Shot Learning
☆13Apr 14, 2024Updated 2 years ago
mudabek / encoding-cxr-report-gen
View on GitHub
On the Importance of Image Encoding in Automated Chest X-Ray Report Generation, BMVC 2022
☆16Dec 22, 2022Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
dmoltisanti / air-cvpr23
View on GitHub
This repository contains the Adverbs in Recipes (AIR) dataset and the code published at the CVPR 23 paper: "Learning Action Changes by Me…
☆13May 25, 2023Updated 3 years ago
ruslantau / media-annotator
View on GitHub
Web-based annotation tool for media data. The easiest way to create you own media dataset.
☆16May 12, 2023Updated 3 years ago
xiaoneil / LPNet
View on GitHub
☆13Nov 28, 2021Updated 4 years ago
OpenNLPLab / FAVDBench
View on GitHub
[CVPR 2023] Official implementation of the paper: Fine-grained Audible Video Description
☆76Dec 4, 2023Updated 2 years ago
AntXinyuan / SSP
View on GitHub
Semantic-decoupled Spatial Partition Guided Point-supervised Oriented Object Detection
☆13Jul 7, 2026Updated 2 weeks ago
lazyZhou1997 / DreamLand
View on GitHub
微软创新杯参赛作品，用C#语言，Unity 3D游戏引擎和Vuforia AR引擎制作的一款解密类AR小游戏
☆13Mar 13, 2018Updated 8 years ago
Vill-Lab / 2023-TIFS-ISTVT
View on GitHub
Official implement of ISTVT: Interpretable Spatial-Temporal Video Transformer for Deepfake Detection.
☆15Jan 18, 2024Updated 2 years ago