An AI-powered interactive video retrieval system
☆56May 4, 2026Updated this week
Alternatives and similar repositories for visione
Users that are interested in visione are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICCV 2023] Simple Baselines for Interactive Video Retrieval with Questions and Answers☆19Apr 16, 2024Updated 2 years ago
- Towards Efficient and Effective Text-to-Video Retrieval with Coarse-to-Fine Visual Representation Learning☆21Feb 19, 2025Updated last year
- gradio web ui for musepose☆15Jun 29, 2024Updated last year
- [ECCV-W] Official repo for the paper "ComiCap: A VLMs pipeline for dense captioning of Comic Panels"☆14Nov 20, 2024Updated last year
- ☆20Oct 18, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Mixture-of-Embeddings-Experts☆122Jul 21, 2020Updated 5 years ago
- Infrastructure useful to create natural language processing systems based on transformer networks☆12Sep 26, 2019Updated 6 years ago
- A video highlights creator☆12Jun 1, 2024Updated last year
- Agentic Keyframe Search for Video Question Answering☆18Apr 7, 2025Updated last year
- More reliable Video Understanding Evaluation☆15Sep 23, 2025Updated 7 months ago
- Temporal Sentence Grounding in Videos / Natural Language Video Localization / Video Moment Retrieval的相关工作☆30Mar 4, 2022Updated 4 years ago
- Learning to Count without Annotations☆23May 24, 2024Updated last year
- F-16 is a powerful video large language model (LLM) that perceives high-frame-rate videos, which is developed by the Department of Electr…☆36Jul 3, 2025Updated 10 months ago
- Official code for DAM: Dynamic Adapter Merging for Continual Video QA Learning☆15Apr 25, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [arXiv 2025] SAGE: Training Smart Any-Horizon Agents for Long Video Reasoning with Reinforcement Learning☆69Dec 17, 2025Updated 4 months ago
- ☆12Sep 15, 2024Updated last year
- build vgg16 with pytorch 0.4.0 for classification of CIFAR datasets☆10Mar 31, 2019Updated 7 years ago
- ☆26Oct 22, 2024Updated last year
- ☐ ☐ A simple, out-of-the-box and cross-platform bbox annotation tool by Python. Try it by `pip install easybox`☆10May 28, 2021Updated 4 years ago
- Personalized and Interactive Music Recommendation with Bandit approach☆11Sep 15, 2019Updated 6 years ago
- LinVT: Empower Your Image-level Large Language Model to Understand Videos☆84Dec 30, 2024Updated last year
- AIME API Server - Scalable AI Model Inference API Server☆15Sep 19, 2025Updated 7 months ago
- This repository implements the training, testing and evaluation code for the "VQ-NeRV: A Vector Quantised Neural Representation for Video…☆10Feb 19, 2024Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- BED-AIO team code for AIChallenge2023☆43Jul 30, 2024Updated last year
- ☆12Jan 9, 2024Updated 2 years ago
- [ICCV 2025] Factorized Learning for Temporally Grounded Video-Language Models☆24Apr 18, 2026Updated 2 weeks ago
- UMB: Understanding Model Behavior for Open-World object Detection (NeurIPS 2024)☆11May 26, 2024Updated last year
- ☆16Sep 27, 2023Updated 2 years ago
- The official source code of our AAAI25 paper "D&M: Enriching E-commerce Videos with Sound Effects by Key Moment Detection and SFX Matchin…☆10Feb 9, 2025Updated last year
- A small package for handy conversion of german numerals (also ordinal / signed) written as words to numbers.☆12Jan 22, 2026Updated 3 months ago
- Foundation Models for Video Understanding: A Survey☆142Jul 9, 2025Updated 9 months ago
- Implementation of camera calibration following this CVPR 2020 paper : https://openaccess.thecvf.com/content_CVPR_2020/papers/Sha_End-to-E…☆13Jun 4, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Multi Task GPT Model for Sign Language☆14Feb 16, 2025Updated last year
- Python library for building and sharing dataframe-agnostic, sklearn-style transformers and ml models for data science competitions.☆27Mar 10, 2026Updated last month
- williamFalcon / Predicting-floor-level-for-911-Calls-with-Neural-Networks-and-Smartphone-Sensor-DataCode + data for predicting floor location from smartphone sensor data☆11Mar 16, 2018Updated 8 years ago
- FlappyBird Reinforcement Learning based on Pygame, OpenCV, Tensorflow☆14Mar 16, 2020Updated 6 years ago
- Starter code for working with the YouTube-8M dataset.☆16Jun 9, 2017Updated 8 years ago
- Exercises && Drills from Programming Principles and Practice using C++ by Bjarne Stroustrup (2nd Edition)☆10May 15, 2024Updated last year
- Code of StyleCrafter on SDXL☆20Jun 25, 2024Updated last year