An AI-powered interactive video retrieval system
☆53Apr 14, 2026Updated this week
Alternatives and similar repositories for visione
Users that are interested in visione are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [EMNLP 2024 Industry track] MERLIN : Multimodal Embedding Refinement via LLM-based Iterative Navigation for Text-Video Retrieval-Rerank P…☆14Mar 4, 2025Updated last year
- 基于SG2300X的视频检索【使用自然语言搜索视频内容,定位到符合描述的具体时间段】☆13Feb 29, 2024Updated 2 years ago
- [NeurIPS 2019] Drill-down: Interactive Retrieval of Complex Scenes using Natural Language Queries☆12Apr 15, 2022Updated 4 years ago
- Towards Efficient and Effective Text-to-Video Retrieval with Coarse-to-Fine Visual Representation Learning☆21Feb 19, 2025Updated last year
- A Web app demonstrating multimodal image search using Visualized-BGE model☆15Dec 1, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [ECCV-W] Official repo for the paper "ComiCap: A VLMs pipeline for dense captioning of Comic Panels"☆14Nov 20, 2024Updated last year
- Ask Questions in natural language and get Answers backed by private sources. Connects to tools like Slack, GitHub, Confluence, etc.☆32Updated this week
- Mixture-of-Embeddings-Experts☆122Jul 21, 2020Updated 5 years ago
- Agentic Keyframe Search for Video Question Answering☆18Apr 7, 2025Updated last year
- More reliable Video Understanding Evaluation☆15Sep 23, 2025Updated 6 months ago
- ☆13Sep 20, 2023Updated 2 years ago
- F-16 is a powerful video large language model (LLM) that perceives high-frame-rate videos, which is developed by the Department of Electr…☆35Jul 3, 2025Updated 9 months ago
- A lightweight Text-to-Image Retrieval model [Web App]☆29Dec 6, 2024Updated last year
- [arXiv 2025] SAGE: Training Smart Any-Horizon Agents for Long Video Reasoning with Reinforcement Learning☆64Dec 17, 2025Updated 3 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- build vgg16 with pytorch 0.4.0 for classification of CIFAR datasets☆10Mar 31, 2019Updated 7 years ago
- Maaagic UI is an open-source UI framework designed to empower developers with seamless integration and advanced features of AI applicatio…☆14Jun 18, 2024Updated last year
- LinVT: Empower Your Image-level Large Language Model to Understand Videos☆84Dec 30, 2024Updated last year
- This repository implements the training, testing and evaluation code for the "VQ-NeRV: A Vector Quantised Neural Representation for Video…☆10Feb 19, 2024Updated 2 years ago
- Realtime Face detection demo using YOLO v2 and OpenCV DNN module☆17Mar 10, 2018Updated 8 years ago
- ☆12Jan 9, 2024Updated 2 years ago
- [ICCV 2025] Factorized Learning for Temporally Grounded Video-Language Models☆24Jan 1, 2026Updated 3 months ago
- Various video readers for PyTorch models training and a benchmark☆12Apr 8, 2026Updated last week
- Foundation Models for Video Understanding: A Survey☆141Jul 9, 2025Updated 9 months ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Implementation of camera calibration following this CVPR 2020 paper : https://openaccess.thecvf.com/content_CVPR_2020/papers/Sha_End-to-E…☆12Jun 4, 2021Updated 4 years ago
- Python library for building and sharing dataframe-agnostic, sklearn-style transformers and ml models for data science competitions.☆28Mar 10, 2026Updated last month
- GenFilesMCP: Minimal MCP Server for Open Web UI. Generates PPTX, XLSX, DOCX or MD files using user requests and full chat context. *Pul…☆75Apr 3, 2026Updated last week
- Better implementation of Kolmogorov Arnold Network☆26Jun 9, 2024Updated last year
- AutoQuant is an out-of-the-box quantitative investment platform.☆20Aug 1, 2023Updated 2 years ago
- Render pose sequences as photorealistic videos.☆19Jan 31, 2025Updated last year
- Scripts for training Kaldi for German speech recognition (ASR).☆27Feb 11, 2021Updated 5 years ago
- ☆13Aug 21, 2022Updated 3 years ago
- ☆15Apr 6, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Starter code for working with the YouTube-8M dataset.☆16Jun 9, 2017Updated 8 years ago
- Entropy Based Sampling and Parallel CoT Decoding☆17Oct 9, 2024Updated last year
- Learning by Aligning Videos in Time (CVPR 2021)☆14Sep 10, 2023Updated 2 years ago
- C++ implementation for 《"GrabCut" — Interactive Foreground Extraction using Iterated Graph Cuts》☆12Jul 25, 2023Updated 2 years ago
- A CUDA implementation of Arithmetic Coding☆18Jan 21, 2025Updated last year
- Kolmogorov-Arnold networks (KAN) as implicit functions (like NeRF but simpler)☆15May 16, 2024Updated last year
- ☆59Feb 27, 2025Updated last year