GuyARoss / CLIP-video-searchLinks
demo natural language video db using CLIP
☆26Updated 11 months ago
Alternatives and similar repositories for CLIP-video-search
Users that are interested in CLIP-video-search are comparing it to the libraries listed below
Sorting:
- Chinese CLIP models with SOTA performance.☆55Updated last year
- ☆69Updated 2 years ago
- ☆28Updated 3 years ago
- ☆29Updated 3 years ago
- A multimodal image search engine built on the GME model, capable of handling diverse input types. Whether you're querying with text, imag…☆42Updated last month
- [PR 2024] A large Cross-Modal Video Retrieval Dataset with Reading Comprehension☆26Updated last year
- A multimodal large-scale model, which performs close to the closed-source Qwen-VL-PLUS on many datasets and significantly surpasses the p…☆14Updated last year
- Our 2nd-gen LMM☆33Updated last year
- Research Code for Multimodal-Cognition Team in Ant Group☆154Updated last week
- Low-latency ONNX and TensorRT based zero-shot classification and detection with contrastive language-image pre-training based prompts☆41Updated 10 months ago
- ☆28Updated 3 years ago
- Code for the Video Similarity Challenge.☆81Updated last year
- ☆57Updated last year
- Using open-source LLM Llama2 by Meta on local CPU inference for document question-and-answer☆15Updated last year
- Tensorflow implementation for Dash☆32Updated 2 years ago
- Codebase for the Recognize Anything Model (RAM)☆81Updated last year
- 本项目是关于Yi的多模态系列模型,如Yi-VL-6B/34B等的实验与应用。☆13Updated last year
- Repository for 23'MM accepted paper "Curriculum-Listener: Consistency- and Complementarity-Aware Audio-Enhanced Temporal Sentence Groundi…☆49Updated last year
- Multimodal chatbot with computer vision capabilities integrated, our 1st-gen LMM☆101Updated last year
- CLIP中文encoder☆22Updated 3 years ago
- Facebook Image Similarity Challenge 2021☆19Updated 3 years ago
- Code and model for the AI City Challenge (CVPR 2022) Track 3 Action Detection (Naturalistic Driving Action Recognition)☆28Updated last year
- [CVPR 2023 Workshop] The code reproduce the results of our solutions on both tracks for Meta AI Video Similarity Challenge (CVPR 2023 Wor…☆53Updated 2 years ago
- ☆22Updated 3 years ago
- [CVPR Challenge Rank 2nd] The codes and related files to reproduce the results for Video Similarity Challenge Descriptor Track.☆19Updated 3 months ago
- Vision-oriented multimodal AI☆49Updated last year
- A Simple MLLM Surpassed QwenVL-Max with OpenSource Data Only in 14B LLM.☆37Updated 10 months ago
- 2019 CCF 大数据与计算智能大赛 视频版权检测算法 复赛第8名 方案 | 8th place solution of Video Copyright Detection Algorithm Track, 2019 CCF Big Data & Computing Int…☆30Updated 5 years ago
- Large Multimodal Model☆15Updated last year
- official code for paper: Exploring Domain Incremental Video Highlights Detection with the LiveFood Benchmark☆37Updated last year