☆14Jan 5, 2022Updated 4 years ago
Alternatives and similar repositories for VideoMatch
Users that are interested in VideoMatch are comparing it to the libraries listed below
Sorting:
- Official implementation of "In-style: Bridging Text and Uncurated Videos with Style Transfer for Cross-modal Retrieval." ICCV 2023☆11Oct 5, 2023Updated 2 years ago
- [AAAI 2024] GMMFormer: Gaussian-Mixture-Model Based Transformer for Efficient Partially Relevant Video Retrieval☆20May 10, 2024Updated last year
- This repository contains the code for our AAAI 2017 paper, "Learning Latent Sub-events in Activity Videos Using Temporal Attention Filter…☆23Oct 4, 2018Updated 7 years ago
- ☆27Aug 16, 2022Updated 3 years ago
- This repo contains source code for Glance and Focus: Memory Prompting for Multi-Event Video Question Answering (Accepted in NeurIPS 2023)☆31Jun 28, 2024Updated last year
- ☆30Aug 14, 2023Updated 2 years ago
- This is the official implementation of Elaborative Rehearsal for Zero-shot Action Recognition (ICCV2021)☆36Apr 9, 2022Updated 3 years ago
- 【CVPR'24】OST: Refining Text Knowledge with Optimal Spatio-Temporal Descriptor for General Video Recognition☆38Apr 27, 2024Updated last year
- The implementation of Learning Instance and Task-Aware Dynamic Kernels for Few Shot Learning☆13Apr 14, 2024Updated last year
- Promptopia is an open-source AI prompting tool for modern world to discover, create, and share creative prompts☆12May 27, 2023Updated 2 years ago
- This suite of nodes unlocks high-performance parallel processing in ComfyUI by utilizing **Model Replication**. Unlike standard offloadin…☆41Feb 24, 2026Updated last week
- 🏆 The 1st Place Submission to AICity Challenge 2021 Natural Language-Based Vehicle Retrieval Track (Alibaba-UTS submission)☆94Apr 28, 2021Updated 4 years ago
- Adaptive Offline Quintuplet Loss for Image-Text Matching (AOQ)☆34Jul 2, 2020Updated 5 years ago
- ☆24Oct 9, 2025Updated 4 months ago
- [ICCV 2025] Repository for A Quality-Guided Mixture of Score-fusion Experts Framework for Human Recognition☆16Sep 29, 2025Updated 5 months ago
- ☆11May 9, 2023Updated 2 years ago
- ☆11Aug 7, 2025Updated 6 months ago
- In this project, facial recognition algorithm is implemented with python using PCA and SVD dimensionality reduction tools.☆10Sep 2, 2019Updated 6 years ago
- UVA-Human-Skeleton-Preprocessing☆10May 4, 2023Updated 2 years ago
- CenterMask2 on detectron2 (open images)☆10May 28, 2020Updated 5 years ago
- [ICCV 2021] Click to Move: Controlling Video Generation with Sparse Motion☆11Apr 14, 2023Updated 2 years ago
- Graph Convolutional Module for Temporal Action Localization in Videos☆10Jul 4, 2020Updated 5 years ago
- Official implementation of DGP-based multi-speaker speech synthesis with PyTorch☆24Mar 23, 2021Updated 4 years ago
- Official Implementation for "EmojiLM: Modeling the New Emoji Language"☆11Feb 23, 2024Updated 2 years ago
- T5Voice is a lightweight PyTorch implementation of T5-based text-to-speech synthesis, supporting both streaming and non-streaming speech …☆28Nov 7, 2025Updated 3 months ago
- ☆13Nov 28, 2021Updated 4 years ago
- A Simple Framwork for CV Pre-training Model (SOCO, VirTex, BEiT)☆15Oct 18, 2021Updated 4 years ago
- Official implementation of the OO-dMVMT paper☆11Jul 20, 2023Updated 2 years ago
- The Pytorch implemetation of "FeatWalk: Enhancing Few-Shot Classification through Local View Leveraging", AAAI 2024.☆11Mar 4, 2024Updated last year
- ☆13Dec 1, 2025Updated 3 months ago
- ☆56Apr 28, 2025Updated 10 months ago
- The code of 'The devil is in the labels: Semantic segmentation from sentences'.☆13Nov 13, 2022Updated 3 years ago
- [ICCV 2025] Boosting Multi-View Indoor 3D Object Detection via Adaptive 3D Volume Construction☆23Oct 1, 2025Updated 5 months ago
- Open, royalty free, lyrics2song / song generation data collection / cleaning pipeline.☆17May 9, 2025Updated 9 months ago
- [TCSVT23] Official code for "SPT: Spatial Pyramid Transformer for Image Captioning".☆10Aug 14, 2024Updated last year
- Real-Time ASR with CNN-BiLSTM: End-to-End Live Streaming Using PyTorch Lightning⚡☆11Jan 23, 2025Updated last year
- [ECCV 22] LocVTP: Video-Text Pre-training for Temporal Localization☆39Jul 29, 2022Updated 3 years ago
- ☆46Mar 29, 2021Updated 4 years ago
- a autodl environment for native finetune stable diffusion.☆11Dec 7, 2022Updated 3 years ago