zchoi/VCRN

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/zchoi/VCRN)

zchoi / VCRN

☆11

Alternatives and similar repositories for VCRN

Users that are interested in VCRN are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

zchoi / SPT
View on GitHub
[TCSVT23] Official code for "SPT: Spatial Pyramid Transformer for Image Captioning".
☆10Aug 14, 2024Updated last year
zchoi / PKOL
View on GitHub
[TIP 2022] Official code of paper “Video Question Answering with Prior Knowledge and Object-sensitive Learning”
☆46Jan 27, 2024Updated 2 years ago
RainBowLuoCS / MMEvol
View on GitHub
(ACL 2025) 🔥🔥🔥Code for "Empowering Multimodal Large Language Models with Evol-Instruct"
☆22May 15, 2025Updated last year
zchoi / S2-Transformer
View on GitHub
[IJCAI 2022] Official Pytorch code for paper “S2 Transformer for Image Captioning”
☆86Aug 14, 2024Updated last year
liupeng0606 / clip4caption
View on GitHub
The first unofficial implementation of CLIP4Caption: CLIP for Video Caption (ACMMM 2021)
☆16Jan 2, 2023Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
Sejong-VLI / V2T-Action-Graph-JKSUCIS-2023
View on GitHub
The implementation of a paper entitled "Action Knowledge for Video Captioning with Graph Neural Networks" (JKSUCIS 2023).
☆14Mar 29, 2023Updated 3 years ago
yytzsy / SMCG
View on GitHub
Code for the paper "Controllable Video Captioning with an Exemplar Sentence"
☆12Apr 14, 2021Updated 5 years ago
ylqi / GL-RG
View on GitHub
The code of IJCAI22 paper "GL-RG: Global-Local Representation Granularity for Video Captioning".
☆18May 10, 2023Updated 3 years ago
YangYY-Liu / MatrixChatGPTVoiceBot
View on GitHub
Talk to ChatGPT and Generate image via any Matrix client!
☆16Apr 25, 2023Updated 3 years ago
aa200647963 / SGG-DHL
View on GitHub
This repository contains code for the paper 'Dual-branch Hybrid Learning Network for Unbiased Scene Graph Generation'.
☆17Aug 6, 2022Updated 3 years ago
VL-Group / DPQ
View on GitHub
☆19Dec 16, 2020Updated 5 years ago
kaipengfang / ProS
View on GitHub
☆19Jul 22, 2024Updated 2 years ago
zchoi / GLSCL
View on GitHub
[TIP25] Code for "Text-Video Retrieval with Global-Local Semantic Consistent Learning"
☆16May 12, 2025Updated last year
yangbang18 / CARE
View on GitHub
(TIP'2023) Concept-Aware Video Captioning: Describing Videos with Effective Prior Information
☆32Dec 26, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
yiskw713 / VideoCaptioning
View on GitHub
video captioning using 3DCNN and LSTM (pytorch)
☆11Sep 26, 2019Updated 6 years ago
ylhz / FlexAC
View on GitHub
Official implementation for the NeurIPS 2025 paper: "FlexAC: Towards Flexible Control of Associative Reasoning in Multimodal Large Langua…
☆20Apr 25, 2026Updated 3 months ago
xiaosu-zhu / Aurora-Weather
View on GitHub
Aurora Weather
☆24Dec 8, 2016Updated 9 years ago
NovaMind-Z / PTSN
View on GitHub
Repository for an end-to-end image captioning method PTSN(ACM MM22).
☆60Dec 11, 2022Updated 3 years ago
jianxiong-zhou / TFE-DCN
View on GitHub
[WACV 2023] Temporal Feature Enhancement Dilated Convolution Network for Weakly-supervised Temporal Action Localization
☆13Mar 9, 2024Updated 2 years ago
VL-Group / PENET
View on GitHub
[CVPR 2023]Official Pytorch code for paper "Prototype-based Embedding Network for Scene Graph Generation"
☆62Jun 8, 2023Updated 3 years ago
yyyanglz / KAN
View on GitHub
Rich Visual Knowledge-based AugmentationNetwork for Visual Question Answering
☆10Dec 6, 2019Updated 6 years ago
yl3800 / EIGV
View on GitHub
☆15Aug 12, 2022Updated 3 years ago
suny-sht / clip-red-circle
View on GitHub
Official implementation of "What does CLIP know about a red circle? Visual Prompt Engineering for VLMs", ICCV 2023
☆12Sep 21, 2023Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
MichiganNLP / In-the-wild-QA
View on GitHub
In-the-wild Question Answering
☆15May 10, 2023Updated 3 years ago
Wangdoudou8 / text-summarization-csdn
View on GitHub
An open source project on my CSDN blog, whose dataset is the CNN/DM and whose model is T5.
☆12Jul 9, 2023Updated 3 years ago
yafuly / SyntacticGen
View on GitHub
☆16Jul 11, 2023Updated 3 years ago
lixiangpengcs / Spatial-Temporal-Adaptive-Attention-for-Video-Captioning
View on GitHub
Extension of hLSTMat
☆19Apr 15, 2021Updated 5 years ago
hobincar / SGN
View on GitHub
Official pytorch implementation of the AAAI 2021 paper "Semantic Grouping Network for Video Captioning"
☆54Jul 9, 2021Updated 5 years ago
JoannaRay1 / project-xiechengHotel-crawl-analysis
View on GitHub
实现对携程网站的酒店评论爬取，并进行数据预处理和基于情感分类的数据分析，使用了jieba评论分词等处理技术，情感词典，特征值提取，机器学习模型等分析预测技术，词云，热力图等可视化技术
☆13Jul 15, 2022Updated 4 years ago
ZhuGeKongKong / SGG-G2S
View on GitHub
☆21Mar 1, 2022Updated 4 years ago
tgc1997 / RMN
View on GitHub
IJCAI2020: Learning to Discretely Compose Reasoning Module Networks for Video Captioning
☆79Nov 23, 2020Updated 5 years ago
avcourt / spamfilter-py
View on GitHub
A naïve Bayesian spam filter in Python
☆10Dec 18, 2019Updated 6 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
Kyle1210 / uni-shop-2
View on GitHub
☆10Jun 30, 2022Updated 4 years ago
Qinying-Liu / OpenWTAL
View on GitHub
a unified and simple codebase for weakly-supervised temporal action localization
☆23Sep 30, 2023Updated 2 years ago
EchoSafe-MLLM / EchoSafe
View on GitHub
[CVPR 2026] Code for Evolving Contextual Safety in Multi-Modal Large Language Models via Inference-Time Self-Reflective Memory
☆15Mar 18, 2026Updated 4 months ago
GAIR-NLP / weak-to-strong-reasoning
View on GitHub
☆59Sep 2, 2024Updated last year
jd-opensource / Citrus-V
View on GitHub
Advancing Medical Foundation Models with Unified Medical Image Grounding for Clinical Reasoning
☆17Sep 25, 2025Updated 10 months ago
sangminwoo / Temporal-Span-Proposal-Network-VidVRD
View on GitHub
[ESWA 2025] Official pytorch implementation of "What and When to look?: Temporal Span Proposal Network for Video Relation Detection"
☆16Aug 9, 2021Updated 4 years ago
scchy / XtunerGUI
View on GitHub
Xtuner Factory
☆35Mar 1, 2024Updated 2 years ago