A large scale dataset for Video Captioning in Italian
☆13May 16, 2023Updated 2 years ago
Alternatives and similar repositories for msr-vtt-it
Users that are interested in msr-vtt-it are comparing it to the libraries listed below
Sorting:
- ☆12Dec 5, 2023Updated 2 years ago
- Source code of the paper titled *Attentive Visual Semantic Specialized Network for Video Captioning*☆15Apr 6, 2021Updated 4 years ago
- ☆30Feb 27, 2023Updated 3 years ago
- Language-Agnostic Visual-Semantic Embeddings (ICCV'19)☆22Nov 11, 2019Updated 6 years ago
- Official Implementation of the paper: A Complete Recipe for Diffusion Generative Models☆31Nov 1, 2024Updated last year
- ☆10Dec 10, 2022Updated 3 years ago
- Send message to Enterprise WeChat☆12Jan 5, 2022Updated 4 years ago
- MTLE method, winner of the Large Scale Movie Description Challenge (LSMDC) 2017 - Video Description Task.☆24Jul 12, 2019Updated 6 years ago
- Ad-hoc Video Search☆28Feb 18, 2021Updated 5 years ago
- An implementation of simple diffusion in PyTorch (and JAX)☆34Jan 28, 2023Updated 3 years ago
- MMMG: A Massive, Multidisciplinary, Multi-Tier Generation Benchmark for Text-to-Image Reasoning [NeurIPS 2025 Poster]☆23Dec 10, 2025Updated 2 months ago
- Finetuning & extending DiffusionDet to video & pedestrian multi-object-tracking☆13Apr 12, 2023Updated 2 years ago
- Style Transfer by Rigid Alignment in Neural Net Feature Space☆11Jan 23, 2021Updated 5 years ago
- 新词发现/新词挖掘/自由度/凝固度/python3☆10May 28, 2019Updated 6 years ago
- Implementation of Collage Diffusion (https://arxiv.org/abs/2303.00262)☆38May 29, 2023Updated 2 years ago
- 豆瓣电影评论可视化☆10May 19, 2016Updated 9 years ago
- Detects scene change or cuts in a video file☆11Oct 23, 2017Updated 8 years ago
- ☆12Aug 30, 2022Updated 3 years ago
- This repo provides a list of phprad helpful resources to make learning easier.☆10Oct 18, 2023Updated 2 years ago
- Open Set Semantic Segmentation☆10Dec 23, 2020Updated 5 years ago
- Fundraiser Tracker implemented as AWS Lambda with ability to manage through Slack and autosync with Monobank and Privatbank☆10Apr 24, 2025Updated 10 months ago
- Photorealism model use RealVisXL v4.0☆12Feb 20, 2024Updated 2 years ago
- I have created a dataset of Image-Text-Pairs by using the cosine similarity of the CLIP embeddings of the image & it's caption derrived f…☆16Apr 22, 2021Updated 4 years ago
- Bitcoin Address Checker☆11Jul 23, 2024Updated last year
- Create a simple menu for your leaflet map☆10Nov 27, 2016Updated 9 years ago
- Code and performance tests to demonstrate the COUNTLESS algorithm. https://medium.com/@willsilversmith/countless-high-performance-2x-down…☆10Oct 23, 2019Updated 6 years ago
- Software for the QO-100 groundstation of EA4GPZ☆10Aug 8, 2022Updated 3 years ago
- Implementation of "Make One-Shot Video Object Segmentation Efficient Again” and the semi-supervised fine-tuning "e-OSVOS" approach (NeurI…☆36Mar 24, 2021Updated 4 years ago
- Pytorch implementation of various token mixers; Attention Mechanisms, MLP, and etc for understanding computer vision papers and other tas…☆16Oct 7, 2024Updated last year
- Official implementation of "Describing Sets of Images with Textual-PCA".☆16Feb 13, 2023Updated 3 years ago
- Port of Chromaprint C/C++ library to Ruby to extract fingerprints from audio sources.☆12Nov 7, 2013Updated 12 years ago
- A multi-interface (REST and MCP) server for automatic license plate recognition 🚗☆21Dec 2, 2025Updated 3 months ago
- ☆13Feb 23, 2018Updated 8 years ago
- 一个基于trie 树的具有联想功能的文本编辑器。采用python和pyqt☆10Sep 7, 2016Updated 9 years ago
- A feishu bot daily push arxiv latest articles.☆10Nov 28, 2021Updated 4 years ago
- CVPR2022 update everyday!☆11Apr 12, 2022Updated 3 years ago
- Character Grounding and Re-Identification in Story of Videos and Text Descriptions☆10Jan 17, 2021Updated 5 years ago
- “My name is Gregory Guy. I have just purchased a video store, and I need an up to date, GUI driven system to keep track of all the stock …☆10May 7, 2015Updated 10 years ago
- ☆11Mar 19, 2024Updated last year