liupeng0606/clip4caption

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/liupeng0606/clip4caption)

liupeng0606 / clip4caption

The first unofficial implementation of CLIP4Caption: CLIP for Video Caption (ACMMM 2021)

☆16

Alternatives and similar repositories for clip4caption

Users that are interested in clip4caption are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Sejong-VLI / V2T-Action-Graph-JKSUCIS-2023
View on GitHub
The implementation of a paper entitled "Action Knowledge for Video Captioning with Graph Neural Networks" (JKSUCIS 2023).
☆14Mar 29, 2023Updated 3 years ago
MarcusNerva / HMN
View on GitHub
[CVPR2022] Official code for Hierarchical Modular Network for Video Captioning. Our proposed HMN is implemented with PyTorch.
☆50Sep 30, 2022Updated 3 years ago
zchoi / VCRN
View on GitHub
☆11Jul 11, 2023Updated 3 years ago
ylqi / GL-RG
View on GitHub
The code of IJCAI22 paper "GL-RG: Global-Local Representation Granularity for Video Captioning".
☆18May 10, 2023Updated 3 years ago
hobincar / SGN
View on GitHub
Official pytorch implementation of the AAAI 2021 paper "Semantic Grouping Network for Video Captioning"
☆54Jul 9, 2021Updated 5 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
UARK-AICV / VLTinT
View on GitHub
[AAAI 2023 Oral] VLTinT: Visual-Linguistic Transformer-in-Transformer for Coherent Video Paragraph Captioning
☆68Feb 16, 2024Updated 2 years ago
nasib-ullah / video-captioning-models-in-Pytorch
View on GitHub
A PyTorch implementation of state of the art video captioning models from 2015-2019 on MSVD and MSRVTT datasets.
☆73Jul 30, 2023Updated 2 years ago
microsoft / SwinBERT
View on GitHub
Research code for CVPR 2022 paper "SwinBERT: End-to-End Transformers with Sparse Attention for Video Captioning"
☆251May 26, 2022Updated 4 years ago
baopj / Vid-Morp
View on GitHub
☆12Dec 6, 2024Updated last year
bladewaltz1 / PromptSwitch
View on GitHub
☆30Aug 14, 2023Updated 2 years ago
yiskw713 / VideoCaptioning
View on GitHub
video captioning using 3DCNN and LSTM (pytorch)
☆11Sep 26, 2019Updated 6 years ago
casper9429-kth / Siamese-Masked-Autoencoders---Learning-and-Exploration
View on GitHub
Course: DD2412 Deep Learning Advanced at KTH Project by Casper, Magnus, and Friso Focus: Self-supervised learning and computer vision wit…
☆12Dec 15, 2023Updated 2 years ago
dipika-singhania / ICC-Semi-Supervised-TAS
View on GitHub
Iterative Contrast-Classify For Semi-supervised Temporal Action Segmentation
☆11Jul 24, 2023Updated 3 years ago
W-Wu / DEER
View on GitHub
☆12Aug 25, 2023Updated 2 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
Lihr747 / CgtGAN
View on GitHub
☆20May 3, 2025Updated last year
Dslab-NLP / Tibetan-PLM
View on GitHub
☆18Oct 8, 2023Updated 2 years ago
Wangdoudou8 / text-summarization-csdn
View on GitHub
An open source project on my CSDN blog, whose dataset is the CNN/DM and whose model is T5.
☆12Jul 9, 2023Updated 3 years ago
jpthu17 / EMCL
View on GitHub
[NeurIPS 2022 Spotlight] Expectation-Maximization Contrastive Learning for Compact Video-and-Language Representations
☆148Apr 9, 2024Updated 2 years ago
ezeli / InSentiCap_model
View on GitHub
A pytorch implementation of our paper Image Captioning with Inherent Sentiment (ICME 2021 Oral).
☆11Jul 18, 2022Updated 4 years ago
yangbang18 / Non-Autoregressive-Video-Captioning
View on GitHub
The PyTorch code of the AAAI2021 paper "Non-Autoregressive Coarse-to-Fine Video Captioning".
☆57Oct 22, 2023Updated 2 years ago
jinhyunj / EaTR
View on GitHub
Official pytorch repository for "Knowing Where to Focus: Event-aware Transformer for Video Grounding" (ICCV 2023)
☆55Sep 7, 2023Updated 2 years ago
mysilver / comp9321
View on GitHub
COMP9321 Data Services Engineering Lab
☆10Apr 23, 2018Updated 8 years ago
jacobswan1 / Video2Commonsense
View on GitHub
Video captioning baseline models on Video2Commonsense Dataset.
☆56Apr 15, 2021Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
yafuly / SyntacticGen
View on GitHub
☆16Jul 11, 2023Updated 3 years ago
yytzsy / SMCG
View on GitHub
Code for the paper "Controllable Video Captioning with an Exemplar Sentence"
☆12Apr 14, 2021Updated 5 years ago
Alvin-Zeng / GCM
View on GitHub
Graph Convolutional Module for Temporal Action Localization in Videos
☆10Jul 4, 2020Updated 6 years ago
joeyz0z / ConZIC
View on GitHub
Official implementation of "ConZIC: Controllable Zero-shot Image Captioning by Sampling-Based Polishing"
☆76Sep 20, 2023Updated 2 years ago
dhg-wei / DeCap
View on GitHub
ICLR 2023 DeCap: Decoding CLIP Latents for Zero-shot Captioning
☆144Mar 16, 2023Updated 3 years ago
amazon-science / lv-mae
View on GitHub
☆18Sep 30, 2025Updated 9 months ago
Kyle1210 / uni-shop-2
View on GitHub
☆10Jun 30, 2022Updated 4 years ago
chenkang455 / TRMD
View on GitHub
[TMM 2024] Motion Deblur by Learning Residual from Events
☆27Jan 5, 2025Updated last year
zjr2000 / Untrimmed-Video-Feature-Extractor
View on GitHub
A simple and effective feature extractor for untrimmed videos
☆13Sep 1, 2022Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
CrossmodalGroup / ESL
View on GitHub
☆12May 3, 2024Updated 2 years ago
Yinghao-Li / GuiGen
View on GitHub
☆14Oct 6, 2020Updated 5 years ago
bladewaltz1 / ModeCap
View on GitHub
Controllable mage captioning model with unsupervised modes
☆21Apr 14, 2023Updated 3 years ago
SydCaption / SAAT
View on GitHub
☆62May 11, 2021Updated 5 years ago
zjh31 / CPL
View on GitHub
☆21Apr 2, 2024Updated 2 years ago
junyangwang0410 / Knight
View on GitHub
SotA text-only image/video method (IJCAI 2023)
☆15Jan 9, 2024Updated 2 years ago
Soldelli / VLG-Net
View on GitHub
VLG-Net: Video-Language Graph Matching Networks for Video Grounding
☆31May 31, 2022Updated 4 years ago