☆101Sep 19, 2024Updated last year
Alternatives and similar repositories for fineVideo
Users that are interested in fineVideo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A huge dataset for Document Visual Question Answering☆24Jul 29, 2024Updated last year
- Video-LlaVA fine-tune for CinePile evaluation☆51Aug 8, 2024Updated last year
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.☆96May 28, 2026Updated last month
- Competition of Mechanisms: Tracing How Language Models Handle Facts and Counterfactuals; ACL 2024☆13May 24, 2024Updated 2 years ago
- ☆25Dec 13, 2024Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- YOLOv10: Real-Time End-to-End Object Detection☆12May 24, 2024Updated 2 years ago
- Reinforcement Learning Tuning for VideoLLMs: Reward Design and Data Efficiency☆62Jun 6, 2025Updated last year
- Learning to cut end-to-end pretrained modules☆38Apr 17, 2025Updated last year
- ☆16Jul 8, 2024Updated last year
- Using short models to classify long texts☆21Mar 8, 2023Updated 3 years ago
- Profile your CoreML models directly from Python 🐍☆30Sep 8, 2025Updated 9 months ago
- Hugging Face Jobs☆20Jul 11, 2025Updated 11 months ago
- ☆32Jul 29, 2024Updated last year
- SmolVLM2 Demo☆189Mar 20, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Video-R1: Reinforcing Video Reasoning in MLLMs [🔥the first paper to explore R1 for video]☆878Dec 14, 2025Updated 6 months ago
- Audio Demo for "FastSVC: Fast Cross-Domain Singing Voice Conversion with Feature-wise Linear Modulation"☆21Apr 7, 2021Updated 5 years ago
- [ICCVW 25] LLaVA-MORE: A Comparative Study of LLMs and Visual Backbones for Enhanced Visual Instruction Tuning☆160Aug 8, 2025Updated 10 months ago
- helper functions for processing and integrating visual language information with Qwen-VL Series Model☆17Aug 30, 2024Updated last year
- Official implement for LaserHuman.☆35Mar 29, 2025Updated last year
- ☆94Apr 28, 2026Updated 2 months ago
- Large Language Model Text Generation Inference on Habana Gaudi☆34Mar 20, 2025Updated last year
- 👾 E.T. Bench: Towards Open-Ended Event-Level Video-Language Understanding (NeurIPS 2024)☆74Jan 20, 2025Updated last year
- ☆22Jun 30, 2021Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆222Updated this week
- Code used for the creation of OBELICS, an open, massive and curated collection of interleaved image-text web documents, containing 141M d…☆215Aug 28, 2024Updated last year
- Manage scalable open LLM inference endpoints in Slurm clusters☆288Jul 11, 2024Updated last year
- ☆21Nov 18, 2024Updated last year
- [ICASSP 2022] Official PyTorch Implementation for "Attention Probe: Vision Transformer Distillation in the Wild" (ICASSP 2022)☆11Jan 23, 2022Updated 4 years ago
- ☆16Aug 1, 2024Updated last year
- [ICML 2025] Official PyTorch implementation of LongVU☆427May 8, 2025Updated last year
- [ICCV 2025] LVBench: An Extreme Long Video Understanding Benchmark☆144Jul 9, 2025Updated 11 months ago
- [ICCV 2025] Auto Interpretation Pipeline and many other functionalities for Multimodal SAE Analysis.☆201Sep 26, 2025Updated 9 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [ICLR 2025] AuroraCap: Efficient, Performant Video Detailed Captioning and a New Benchmark☆145Jun 4, 2025Updated last year
- [PR 2024] A large Cross-Modal Video Retrieval Dataset with Reading Comprehension☆31Dec 28, 2023Updated 2 years ago
- The repository for IEEE CVPR 2023 (A Light Weight Model for Active Speaker Detection)☆178Mar 23, 2025Updated last year
- Deep Speech Distances PyTorch☆29Feb 21, 2022Updated 4 years ago
- Official PyTorch implementation of the paper "Chapter-Llama: Efficient Chaptering in Hour-Long Videos with LLMs"☆99Jun 6, 2025Updated last year
- ANE accelerated embedding models!☆20Dec 11, 2024Updated last year
- Awesome papers & datasets specifically focused on long-term videos.☆380Oct 9, 2025Updated 8 months ago