Ad-hoc Video Search
☆28Feb 18, 2021Updated 5 years ago
Alternatives and similar repositories for avs
Users that are interested in avs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [CVPR2019] Dual Encoding for Zero-Example Video Retrieval☆153Jan 10, 2023Updated 3 years ago
- Deep Learning for Video Retrieval by Natural Language☆11Oct 20, 2019Updated 6 years ago
- Word2VisualVec : Predicting Visual Features from Text for Image and Video Caption Retrieval☆70Jan 27, 2020Updated 6 years ago
- W2VV++: A fully deep learning solution for ad-hoc video search☆29Jul 25, 2024Updated last year
- Code and benchmarks for the Semantic Video Retrieval Task☆53Oct 18, 2022Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Video embeddings for retrieval with natural language queries☆343Feb 15, 2023Updated 3 years ago
- Code accompanying the paper "Fine-grained Video-Text Retrieval with Hierarchical Graph Reasoning".☆211Jun 12, 2020Updated 5 years ago
- Source code of our TPAMI'21 paper Dual Encoding for Video Retrieval by Text and CVPR'19 paper Dual Encoding for Zero-Example Video Retrie…☆88Jan 10, 2023Updated 3 years ago
- COOT: Cooperative Hierarchical Transformer for Video-Text Representation Learning☆291Sep 6, 2022Updated 3 years ago
- MDMMT: Multidomain Multimodal Transformer for Video Retrieval☆26Jun 28, 2021Updated 4 years ago
- PyTorch Code for the paper "VSE++: Improving Visual-Semantic Embeddings with Hard Negatives"☆521Dec 8, 2021Updated 4 years ago
- Extract MFCCs from videos and make bag-of-audio-words (BOAW) representations.☆11Dec 20, 2018Updated 7 years ago
- ☆35Mar 22, 2019Updated 7 years ago
- Codebase for CVPR 2020 paper "Spatio-Temporal Graph for Video Captioning with Knowledge Distillation"☆23Mar 4, 2020Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆14Oct 10, 2022Updated 3 years ago
- Easy to use video deep features extractor☆322Jul 5, 2020Updated 5 years ago
- ☆62May 11, 2021Updated 4 years ago
- An Evaluation Framework for Temporal Information Extraction Systems☆20Feb 19, 2026Updated last month
- ☆93Oct 20, 2017Updated 8 years ago
- MomentDiff: Generative Video Moment Retrieval from Random to Real--NeurIPS 2023☆80Nov 2, 2023Updated 2 years ago
- Research Notes☆11Sep 13, 2020Updated 5 years ago
- Multimodal Adversarial Network for Cross-modal Retrieval (PyTorch Code)☆29Apr 7, 2020Updated 5 years ago
- Ladder Loss for Coherent Visual-Semantic Embedding, AAAI, 2020☆13Aug 14, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- This repository contains the scripts used during my participation on CIKM Cup 2016 (see http://cikmcup.org/ and https://competitions.coda…☆11Nov 4, 2016Updated 9 years ago
- A Tree-LSTM-based dependency tree sentiment labeler☆15May 9, 2019Updated 6 years ago
- PyTorch Implementation of Consensus-based Sequence Training for Video Captioning☆60May 15, 2018Updated 7 years ago
- IJCAI2020: Learning to Discretely Compose Reasoning Module Networks for Video Captioning☆79Nov 23, 2020Updated 5 years ago
- [CVPR 2021 Best Student Paper Honorable Mention, Oral] Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning…☆730Aug 8, 2023Updated 2 years ago
- ☆10Oct 16, 2025Updated 5 months ago
- More reliable Video Understanding Evaluation☆15Sep 23, 2025Updated 6 months ago
- ☆16Dec 25, 2025Updated 3 months ago
- The code repository for "Cross-Modal and Hierarchical Modeling of Video and Text" in PyTorch☆20Apr 26, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- with reinforcement learning☆32May 19, 2020Updated 5 years ago
- F-16 is a powerful video large language model (LLM) that perceives high-frame-rate videos, which is developed by the Department of Electr…☆35Jul 3, 2025Updated 8 months ago
- A large-scale place image dataset with multi-faceted annotations. Multi-level place recognition.☆10Jul 15, 2020Updated 5 years ago
- The implementation of FINER-MLLM, which is accepted by MM2024.☆18Oct 8, 2024Updated last year
- ☐ ☐ A simple, out-of-the-box and cross-platform bbox annotation tool by Python. Try it by `pip install easybox`☆10May 28, 2021Updated 4 years ago
- The Theano code for the CVPR 2017 paper "Semantic Compositional Networks for Visual Captioning"☆68Mar 26, 2018Updated 8 years ago
- ChangeIt dataset with more than 2600 hours of video with state-changing actions published at CVPR 2022☆11Mar 23, 2022Updated 4 years ago