☆97Sep 19, 2024Updated last year
Alternatives and similar repositories for fineVideo
Users that are interested in fineVideo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Video-LlaVA fine-tune for CinePile evaluation☆51Aug 8, 2024Updated last year
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.☆93Apr 15, 2026Updated last month
- ☆23Apr 17, 2026Updated last month
- Repository for opt-out requests.☆10Mar 25, 2024Updated 2 years ago
- Unofficial Implementation of Selective Attention Transformer☆20Oct 31, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆26Updated this week
- ☆26Dec 13, 2024Updated last year
- Github action to connect to tailscale☆20Apr 21, 2026Updated last month
- Reinforcement Learning Tuning for VideoLLMs: Reward Design and Data Efficiency☆62Jun 6, 2025Updated 11 months ago
- ☆16Jul 8, 2024Updated last year
- Using short models to classify long texts☆21Mar 8, 2023Updated 3 years ago
- Official PyTorch Implementation of Opt-CWM: Self-Supervised Learning of Motion Concepts by Optimizing Counterfactuals.☆23Mar 27, 2025Updated last year
- ☆32Jul 29, 2024Updated last year
- Chunk Dedupe Estimation☆20Nov 5, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Audio Demo for "FastSVC: Fast Cross-Domain Singing Voice Conversion with Feature-wise Linear Modulation"☆21Apr 7, 2021Updated 5 years ago
- [ICCVW 25] LLaVA-MORE: A Comparative Study of LLMs and Visual Backbones for Enhanced Visual Instruction Tuning☆159Aug 8, 2025Updated 9 months ago
- helper functions for processing and integrating visual language information with Qwen-VL Series Model☆17Aug 30, 2024Updated last year
- Official implement for LaserHuman.☆35Mar 29, 2025Updated last year
- ☆92Apr 28, 2026Updated 3 weeks ago
- Large Language Model Text Generation Inference on Habana Gaudi☆34Mar 20, 2025Updated last year
- This project is a collection of fine-tuning scripts to help researchers fine-tune Qwen 2 VL on HuggingFace datasets.☆77Jul 14, 2025Updated 10 months ago
- 👾 E.T. Bench: Towards Open-Ended Event-Level Video-Language Understanding (NeurIPS 2024)☆74Jan 20, 2025Updated last year
- ☆209Apr 22, 2026Updated last month
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Quick exploration into fine tuning florence 2☆340Sep 19, 2024Updated last year
- ☆27Mar 3, 2025Updated last year
- [CVPR 2024] MovieChat: From Dense Token to Sparse Memory for Long Video Understanding☆698Jan 29, 2025Updated last year
- Code used for the creation of OBELICS, an open, massive and curated collection of interleaved image-text web documents, containing 141M d…☆214Aug 28, 2024Updated last year
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆23Jul 30, 2024Updated last year
- Manage scalable open LLM inference endpoints in Slurm clusters☆287Jul 11, 2024Updated last year
- The official repo for "OpenMoE 2: Sparse Diffusion Language Models".☆56Dec 28, 2025Updated 4 months ago
- ☆21Nov 18, 2024Updated last year
- ☆16Aug 1, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [ICML 2025] Official PyTorch implementation of LongVU☆425May 8, 2025Updated last year
- Official implementation of EgoHOD at ICLR 2025; 14 EgoVis Challenge Winners in CVPR 2024☆33Nov 25, 2025Updated 6 months ago
- [ICCV 2025] Auto Interpretation Pipeline and many other functionalities for Multimodal SAE Analysis.☆198Sep 26, 2025Updated 7 months ago
- [ICLR 2025] AuroraCap: Efficient, Performant Video Detailed Captioning and a New Benchmark☆142Jun 4, 2025Updated 11 months ago
- Deep Speech Distances PyTorch☆29Feb 21, 2022Updated 4 years ago
- A Swift wrapper for the Supertone text-to-speech model☆34Dec 11, 2025Updated 5 months ago
- Official PyTorch implementation of the paper "Chapter-Llama: Efficient Chaptering in Hour-Long Videos with LLMs"☆98Jun 6, 2025Updated 11 months ago