BiC-Net: Learning Efficient Spatio-Temporal Relation for Text-Video Retrieval
☆26Jul 22, 2022Updated 3 years ago
Alternatives and similar repositories for BiC-Net
Users that are interested in BiC-Net are comparing it to the libraries listed below
Sorting:
- SLIC: Self-Supervised Learning with Iterative Clustering for Human Action Videos [CVPR 2022]☆19Jan 27, 2023Updated 3 years ago
- [CVPR 2022] Cross-Architecture Self-supervised Video Representation Learning☆24Jul 5, 2022Updated 3 years ago
- ☆26Oct 20, 2021Updated 4 years ago
- The public source code of "FreCaS: Efficient Higher-Resolution Image Generation via Frequency-aware Cascaded Sampling"☆29Jul 7, 2025Updated 7 months ago
- Code and benchmarks for the Semantic Video Retrieval Task☆53Oct 18, 2022Updated 3 years ago
- Using Color Histogram, SVD and Dynamic Clustering Method obtained Key-Frames from a video. This analysis can be used to identify frames w…☆24Dec 14, 2020Updated 5 years ago
- Source code of the paper "An efficient implementation for solving the all pairs minimax path problem in an undirected dense graph."☆16Dec 3, 2025Updated 3 months ago
- 新词发现/新词挖掘/自由度/凝固度/python3☆10May 28, 2019Updated 6 years ago
- Style Transfer by Rigid Alignment in Neural Net Feature Space☆11Jan 23, 2021Updated 5 years ago
- Finetuning & extending DiffusionDet to video & pedestrian multi-object-tracking☆13Apr 12, 2023Updated 2 years ago
- Official code for Accelerating Diffusion Sampling with Optimized Time Steps (CVPR 2024)☆38Mar 11, 2024Updated last year
- ☆32Jun 22, 2022Updated 3 years ago
- ☆12Aug 30, 2022Updated 3 years ago
- 豆瓣电影评论可视化☆10May 19, 2016Updated 9 years ago
- Detects scene change or cuts in a video file☆11Oct 23, 2017Updated 8 years ago
- I have created a dataset of Image-Text-Pairs by using the cosine similarity of the CLIP embeddings of the image & it's caption derrived f…☆16Apr 22, 2021Updated 4 years ago
- Near Duplicate Video Retrieval☆44Sep 30, 2020Updated 5 years ago
- Towards Photorealistic 4D Scene Generation via Video Diffusion Models☆20Jun 12, 2024Updated last year
- [CVPR 2023] VoP: Text-Video Co-operative Prompt Tuning for Cross-Modal Retrieval☆38Feb 28, 2023Updated 3 years ago
- [NeurIPS 2022] PointTAD: Multi-Label Temporal Action Detection with Learnable Query Points☆46Nov 24, 2023Updated 2 years ago
- WoBERT_pytorch☆40Apr 18, 2021Updated 4 years ago
- Generic classification model☆10Apr 2, 2025Updated 11 months ago
- RUArt: A Novel Text-Centered Solution for Text-Based Visual Question Answering☆10Nov 27, 2022Updated 3 years ago
- [ICML 2025] Repository for M3-JEPA: Multimodal Alignment via Multi-gate MoE based on the Joint-Predictive Embedding Architecture☆20Nov 4, 2025Updated 4 months ago
- [CoRL 2024] Software and hardware instructions for SoniceSense.☆15Mar 1, 2025Updated last year
- Official repo for FunkNN: Neural Interpolation for Functional Generation☆11May 12, 2023Updated 2 years ago
- ☆11Feb 9, 2026Updated 3 weeks ago
- Autoencoder for multi-label classification using Google's Tensorflow framework and MDMR for feature selection.☆10Aug 31, 2017Updated 8 years ago
- ☆10Nov 18, 2024Updated last year
- Character Grounding and Re-Identification in Story of Videos and Text Descriptions☆10Jan 17, 2021Updated 5 years ago
- finetune script for SDXL adapted from waifu-diffusion trainer☆11Aug 21, 2023Updated 2 years ago
- Vectorize Image Data to SVG using POTRACE. Based on multilabel-potrace by Hugo Raguet, which is based on potrace by Peter Selinger.☆15Jul 26, 2025Updated 7 months ago
- Video Summarization Transformer: Implementation in PyTorch of the Transformer model for video summarisation☆10Oct 27, 2020Updated 5 years ago
- ☆11Jul 17, 2024Updated last year
- ☆10Jul 20, 2020Updated 5 years ago
- 一个基于trie树的具有联想功能的文本编辑器。采用python和pyqt☆10Sep 7, 2016Updated 9 years ago
- ☆12Aug 5, 2022Updated 3 years ago
- A feishu bot daily push arxiv latest articles.☆10Nov 28, 2021Updated 4 years ago
- ☆12Oct 4, 2021Updated 4 years ago