li-xirong / video-retrievalView external linksLinks
Deep Learning for Video Retrieval by Natural Language
☆11Oct 20, 2019Updated 6 years ago
Alternatives and similar repositories for video-retrieval
Users that are interested in video-retrieval are comparing it to the libraries listed below
Sorting:
- Elastic Workplace Search Official Python Client☆10Aug 8, 2024Updated last year
- Ad-hoc Video Search☆28Feb 18, 2021Updated 4 years ago
- Repository for ACL2020 paper "Refer360° A Referring Expression Recognition Dataset in 360°Images"☆13Jun 26, 2021Updated 4 years ago
- “Open terminals”, “load CSVs”, “start hacking”☆16May 2, 2017Updated 8 years ago
- Word2VisualVec : Predicting Visual Features from Text for Image and Video Caption Retrieval☆70Jan 27, 2020Updated 6 years ago
- This repo contains data and code for the paper "Reasoning over Public and Private Data in Retrieval-Based Systems."☆46Jul 18, 2024Updated last year
- Awesome Chinese Corpus Datasets and Models.☆18Oct 28, 2019Updated 6 years ago
- PyTorch implementation of Gaussian word embeddings☆19Apr 7, 2018Updated 7 years ago
- [CVPR2019] Dual Encoding for Zero-Example Video Retrieval☆153Jan 10, 2023Updated 3 years ago
- Codes for arXiv paper "Semi-supervised Few-shot Atomic Action Recognition".☆18Jan 2, 2021Updated 5 years ago
- 语雀 Yuque python SDK & Command line interface☆17Sep 11, 2019Updated 6 years ago
- PyTorch Implementation of NeurIPS 2020 paper "Learning Sparse Prototypes for Text Generation"☆22Jul 8, 2021Updated 4 years ago
- Transformer model for the Amazon Topical-Chat Corpus. Baselines for DSTC9 Track 3.☆19Jul 9, 2020Updated 5 years ago
- Code for ICCV2021: Discovering Human Interactions with Large-Vocabulary Objects via Query and Multi-Scale Detection☆28Oct 12, 2021Updated 4 years ago
- baseline mode for the ObjectNet competition☆18Jan 13, 2021Updated 5 years ago
- Finetune CPM-1☆24Jun 20, 2021Updated 4 years ago
- Notes on Deep Reinforcement Learning for Natural Language Processing papers☆30Jul 17, 2017Updated 8 years ago
- W2VV++: A fully deep learning solution for ad-hoc video search☆29Jul 25, 2024Updated last year
- An implementation of DIP-VAE from the paper "Variational Inference of Disentangled Latent Concepts from Unlabelled Observations" by Kumar…☆26Apr 20, 2018Updated 7 years ago
- Code and Data for the paper Investigating Evaluation of Open-Domain Dialogue Systems With Human Generated Multiple References SIGdial 201…☆28Mar 6, 2020Updated 5 years ago
- Using business-level retrieval system (BM25) with Python in just a few lines.☆31Feb 3, 2023Updated 3 years ago
- LSH index for approximate set containment search☆61Jun 27, 2022Updated 3 years ago
- Code for SIGDial 2019 Best Paper: Structured Fusion Networks for Dialog https://arxiv.org/abs/1907.10016☆30Aug 19, 2019Updated 6 years ago
- Code and model for the AI City Challenge (CVPR 2022) Track 3 Action Detection (Naturalistic Driving Action Recognition)☆28Jul 22, 2023Updated 2 years ago
- ☆32Mar 7, 2022Updated 3 years ago
- This is the official implementation of Elaborative Rehearsal for Zero-shot Action Recognition (ICCV2021)☆36Apr 9, 2022Updated 3 years ago
- We are creating a challenging new benchmark MultiReQA: A Cross-Domain Evaluation for Retrieval Question Answering Models. Retrieval quest…☆31Jul 9, 2020Updated 5 years ago
- ☆35Oct 21, 2023Updated 2 years ago
- ☆31Jun 19, 2020Updated 5 years ago
- A collection of strong multimodal models for building multimodal AGI agents☆44Jul 9, 2024Updated last year
- Panorama_498全景图像数据集☆14Apr 8, 2022Updated 3 years ago
- 哔哩哔哩-API收集整理【不断更新中....】☆10Apr 25, 2025Updated 9 months ago
- A part of the course Mobile Application Development☆13Nov 30, 2021Updated 4 years ago
- GestureX is an OpenCV-based hand motion sensing system for intuitive, efficient user control.This project aims to investigate the potenti…☆16Jun 29, 2024Updated last year
- lasertagger-chinese;lasertagger中文学习案例,案例数据,注释,shell运行☆76Mar 25, 2023Updated 2 years ago
- Official implementation of ICCV19 oral paper Zero-Shot grounding of Objects from Natural Language Queries (https://arxiv.org/abs/1908.071…☆71Apr 22, 2020Updated 5 years ago
- Joint Embedding with Multimodal Cues for Cross-Modal Video-Text Retrieval☆68Apr 10, 2020Updated 5 years ago
- Sparse R-CNN 38.9 mAP, 640px(max side), 30.95fps(RTX 2080TI)☆30Dec 3, 2020Updated 5 years ago
- ☆42Sep 25, 2019Updated 6 years ago