Character Grounding and Re-Identification in Story of Videos and Text Descriptions
☆10Jan 17, 2021Updated 5 years ago
Alternatives and similar repositories for CiSIN
Users that are interested in CiSIN are comparing it to the libraries listed below
Sorting:
- M-VAD Names Dataset. Multimedia Tools and Applications (2019)☆22Jul 9, 2019Updated 6 years ago
- Weakly Supervised Video Moment Retrieval from Text Queries☆43Jul 20, 2020Updated 5 years ago
- Code for ECCV 2022 Workshop paper "See Finer, See More: Implicit Modality Alignment for Text-based Person Retrieval"☆21Nov 16, 2025Updated 3 months ago
- Code for "A Graph-Based Framework to Bridge Movies and Synopses", ICCV2019☆52Aug 9, 2020Updated 5 years ago
- Code for Knowledge-Embedded Routing Network for Scene Graph Generation (CVPR 2019)☆22Mar 25, 2019Updated 6 years ago
- ☆26Nov 30, 2019Updated 6 years ago
- Codebase for CVPR 2020 paper "Spatio-Temporal Graph for Video Captioning with Knowledge Distillation"☆23Mar 4, 2020Updated 6 years ago
- ☆32May 3, 2024Updated last year
- Finetuning & extending DiffusionDet to video & pedestrian multi-object-tracking☆13Apr 12, 2023Updated 2 years ago
- Style Transfer by Rigid Alignment in Neural Net Feature Space☆11Jan 23, 2021Updated 5 years ago
- 新词发现/新词挖掘/自由度/凝固度/python3☆10May 28, 2019Updated 6 years ago
- Learning phrase grounding from captioned images through InfoNCE bound on mutual information☆74Aug 22, 2020Updated 5 years ago
- Implementation of paper "Not All Frames Are Equal: Weakly-Supervised Video Grounding with Contextual Similarity and Visual Clustering Los…☆30Jun 29, 2020Updated 5 years ago
- ☆12Aug 30, 2022Updated 3 years ago
- Code Release for the paper "Make-A-Story: Visual Memory Conditioned Consistent Story Generation" in CVPR 2023☆43Jun 27, 2023Updated 2 years ago
- Retrieval Augmented Generation, but no servers involved. Backed by S3☆12Nov 3, 2023Updated 2 years ago
- Implementation of "Look, Listen and Recognise:character-aware audio-visual subtitling"☆19Nov 3, 2025Updated 4 months ago
- Adaptive Reconstruction Network for Weakly Supervised Referring Expression Grounding☆33Aug 29, 2019Updated 6 years ago
- Creating crowdsourcing based experiments made easy☆10May 25, 2020Updated 5 years ago
- an online variant of AVrateNG☆14Mar 20, 2025Updated 11 months ago
- 豆瓣电影评论可视化☆10May 19, 2016Updated 9 years ago
- Detects scene change or cuts in a video file☆11Oct 23, 2017Updated 8 years ago
- I have created a dataset of Image-Text-Pairs by using the cosine similarity of the CLIP embeddings of the image & it's caption derrived f…☆16Apr 22, 2021Updated 4 years ago
- ☆35Aug 26, 2024Updated last year
- Code and Models for paper "Reinforced Video Captioning with Entailment Rewards (EMNLP 2017)"☆44Nov 19, 2019Updated 6 years ago
- ☆10Aug 22, 2023Updated 2 years ago
- Generic classification model☆10Apr 2, 2025Updated 11 months ago
- Video Summarization Transformer: Implementation in PyTorch of the Transformer model for video summarisation☆10Oct 27, 2020Updated 5 years ago
- Deep learning for named entity recognition on CoNLL-2003☆10Dec 23, 2016Updated 9 years ago
- AI Playing Bulls and Cows Game☆15Sep 4, 2023Updated 2 years ago
- ☆13Nov 28, 2025Updated 3 months ago
- Quiz and assignment solutions for Coursera MOOC - Aerial Robotics☆13Aug 15, 2016Updated 9 years ago
- 一个基于trie树的具有联想功能的文本编辑器。采用python和pyqt☆10Sep 7, 2016Updated 9 years ago
- Port of Chromaprint C/C++ library to Ruby to extract fingerprints from audio sources.☆12Nov 7, 2013Updated 12 years ago
- A guide to structured generation using constrained decoding☆14Jun 9, 2024Updated last year
- SSL Video Representation Learning project☆14Jul 8, 2025Updated 8 months ago
- finetune script for SDXL adapted from waifu-diffusion trainer☆11Aug 21, 2023Updated 2 years ago
- Code for Advanced Algorithm Lecture, Konkuk Univ☆10Feb 1, 2023Updated 3 years ago
- 关于behance爬虫项目☆10May 16, 2019Updated 6 years ago