Arun-George-Zachariah / awesome-video-retrieval-papersView external linksLinks
List of resources for video retrieval.
☆20Mar 17, 2022Updated 3 years ago
Alternatives and similar repositories for awesome-video-retrieval-papers
Users that are interested in awesome-video-retrieval-papers are comparing it to the libraries listed below
Sorting:
- (WACV 2021) Temporal Context Aggregation for Video Retrieval with Contrastive Learning☆29Aug 4, 2021Updated 4 years ago
- TransVCL: Attention-enhanced Video Copy Localization Network with Flexible Supervision [AAAI2023 Oral]]☆57Feb 25, 2023Updated 2 years ago
- Authors official PyTorch implementation of the "ViSiL: Fine-grained Spatio-Temporal Video Similarity Learning" [ICCV 2019]☆223Mar 6, 2024Updated last year
- Towards Local Visual Modeling for Image Captioning☆29Mar 31, 2023Updated 2 years ago
- Authors official PyTorch implementation of the "DnS: Distill-and-Select for Efficient and Accurate Video Indexing and Retrieval" [IJCV 20…☆70Apr 12, 2023Updated 2 years ago
- Implementation of various handwritten text line segmentation☆10Jan 6, 2020Updated 6 years ago
- TensorRT In Docker☆11Dec 7, 2024Updated last year
- [TPAMI-2018] A C++ framework for training/testing Support Vector Machine with Gaussian Sample Uncertainty (SVM-GSU).☆13Feb 20, 2018Updated 7 years ago
- Official Repository of RefChartQA: Grounding Visual Answer on Chart Images through Instruction Tuning☆14Jul 9, 2025Updated 7 months ago
- [CVPR 2025] Adaptive Markup Language Generation for Contextually-Grounded Visual Document Understanding☆15Jun 16, 2025Updated 8 months ago
- Data Programming for Text Detection in Documents using SPEAR☆12Mar 26, 2025Updated 10 months ago
- NLP Workshops☆11Apr 24, 2025Updated 9 months ago
- Quantization of LLMs and benchmarking.☆10Apr 3, 2024Updated last year
- A curated list of resources on Document Layout Analysis☆11Aug 7, 2025Updated 6 months ago
- ☆12Dec 15, 2022Updated 3 years ago
- ☆15Sep 11, 2025Updated 5 months ago
- STRExp is a framework that provides Explainability (XAI) to Scene Text Recognition (STR) models.☆11Nov 27, 2023Updated 2 years ago
- Official implementation for AAAI 2025 paper: SSAN: A Symbol Spatial-Aware Network for Handwritten Mathematical Expression Recognition☆15Jan 21, 2025Updated last year
- Implementation of the DocLLM paper for Llama models.☆13Apr 6, 2025Updated 10 months ago
- [CVPR 2024] Official repository of ST_GT☆10Sep 15, 2024Updated last year
- Reinforcing Text-Rich Video Reasoning with Visual Rumination☆27Nov 24, 2025Updated 2 months ago
- Agentic Keyframe Search for Video Question Answering☆16Apr 7, 2025Updated 10 months ago
- Official implementation of "ConViS-Bench: Estimating Video Similarity Through Semantic Concepts", NeurIPS 2025☆25Nov 28, 2025Updated 2 months ago
- MathNet: A Data-Centric Approach, Dataset and Benchmark Model to Advance Mathematical Expression Recognition☆10Mar 19, 2025Updated 10 months ago
- [WACV2023] This is the official PyTorch impelementation of our paper "[Rethinking Rotation in Self-Supervised Contrastive Learning: Adapt…☆12Feb 24, 2023Updated 2 years ago
- [WACV2025] source code of StrDA: https://arxiv.org/abs/2410.09913☆12Apr 15, 2025Updated 10 months ago
- Official code for the paper: "Perception and Semantic Aware Regularization for Sequential Confidence Calibration (CVPR2023)"☆10May 15, 2024Updated last year
- [ICPR-2024] S-MultiMAE - A Multi-Ground Truth approach for RGB-D Saliency Detection☆12Dec 13, 2024Updated last year
- SAN: Structure-Aware Network for Complex and Long-tailed Chinese Text Recognition☆10Apr 8, 2024Updated last year
- [ECCV 2022] Dual-Evidential Learning for Weakly-supervised Temporal Action Localization☆49Apr 19, 2024Updated last year
- The official implementation of 'Weakly Supervised Temporal Action Localization via Representative Snippet Knowledge Propagation' (CVPR 2…☆46Nov 7, 2022Updated 3 years ago
- A collection of AI-generated images papers and corresponding source code/demo program, including text-to-image, image translation (e.g., …☆13Nov 21, 2023Updated 2 years ago
- Big Data Resources and References☆13Sep 4, 2024Updated last year
- [CVPR 2025] Official Repository of the paper "On the Consistency of Video Large Language Models in Temporal Comprehension"☆16Oct 13, 2025Updated 4 months ago
- Repository for the CVPR23 paper Re^2TAL☆13Nov 21, 2025Updated 2 months ago
- All coursework for the Learn Python Programming Masterclass by Tim Buchalka and Jean-Paul Roberts.☆12May 5, 2022Updated 3 years ago
- For the Kaggle Competition on object detection with same name. 1) models used are DETR, EfficientDet, YOLOv5, RetinaNet, FasterRCNN. 2) E…☆12Jul 20, 2022Updated 3 years ago
- pytorch implementation of Semantics-AssistedVideoCaptioning☆11Feb 16, 2023Updated 3 years ago
- The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.☆17Jun 5, 2024Updated last year