apple/ml-slowfast-llava

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/apple/ml-slowfast-llava)

apple / ml-slowfast-llava

SlowFast-LLaVA: A Strong Training-Free Baseline for Video Large Language Models

☆291

Alternatives and similar repositories for ml-slowfast-llava

Users that are interested in ml-slowfast-llava are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

contrastive / FreeVideoLLM
View on GitHub
☆83Oct 31, 2024Updated last year
imagegridworth / IG-VLM
View on GitHub
☆138Sep 29, 2024Updated last year
tingyu215 / TS-LLaVA
View on GitHub
TS-LLaVA: Constructing Visual Tokens through Thumbnail-and-Sampling for Training-Free Video Large Language Models
☆17Jan 2, 2025Updated last year
huofushuo / TIDNet
View on GitHub
Codes for Three-stream Interaction Decoder Network for RGB-Thermal Salient Object Detection
☆30May 12, 2022Updated 4 years ago
huofushuo / C2KD
View on GitHub
☆49Mar 21, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
IVGSZ / Flash-VStream
View on GitHub
This is the official implementation of ICCV 2025 "Flash-VStream: Efficient Real-Time Understanding for Long Video Streams"
☆287Oct 15, 2025Updated 9 months ago
huofushuo / HMMF
View on GitHub
accepted by ieee sensors journal
☆36Aug 30, 2020Updated 5 years ago
huofushuo / REQA
View on GitHub
☆30Oct 13, 2022Updated 3 years ago
huofushuo / PRWNet
View on GitHub
The source codes and results of Efficient Wavelet Boost Learning-Based Multi-stage Progressive Refinement Network for Underwater Image En…
☆41May 24, 2022Updated 4 years ago
huofushuo / PROCC
View on GitHub
The codes for 'Progressive cross-primitive consistency for open-world compositional zero-shot learning'
☆34Mar 21, 2024Updated 2 years ago
gls0425 / LinVT
View on GitHub
LinVT: Empower Your Image-level Large Language Model to Understand Videos
☆84Dec 30, 2024Updated last year
huofushuo / CSRNet
View on GitHub
☆35Jun 25, 2022Updated 4 years ago
ziplab / LongVLM
View on GitHub
☆108Jul 30, 2024Updated last year
Visual-AI / PruneVid
View on GitHub
[ACL 2025] PruneVid: Visual Token Pruning for Efficient Video Large Language Models
☆72May 15, 2025Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
magic-research / PLLaVA
View on GitHub
Official repository for the paper PLLaVA
☆669Jul 28, 2024Updated 2 years ago
farewellthree / PPLLaVA
View on GitHub
Official GPU implementation of the paper "PPLLaVA: Varied Video Sequence Understanding With Prompt Guidance"
☆133Nov 19, 2024Updated last year
LLaVA-VL / LLaVA-NeXT
View on GitHub
☆4,713Jun 15, 2026Updated last month
TencentARC / ST-LLM
View on GitHub
[ECCV 2024🔥] Official implementation of the paper "ST-LLM: Large Language Models Are Effective Temporal Learners"
☆153Sep 10, 2024Updated last year
mlvlab / ST-VLM
View on GitHub
☆13Mar 28, 2025Updated last year
huofushuo / NO-CL
View on GitHub
The codes for 'Non-Exemplar Online Class-incremental Continual Learning via Dual-prototype Self-augment and Refinement'
☆33Mar 21, 2024Updated 2 years ago
Ziyang412 / Video-RTS
View on GitHub
Code for EMNLP25 paper "Video-RTS: Rethinking Reinforcement Learning and Test-Time Scaling for Efficient and Enhanced Video Reasoning"
☆24Feb 18, 2026Updated 5 months ago
jiyt17 / IDA-VLM
View on GitHub
[ICLR 2025] IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model
☆37Nov 27, 2024Updated last year
NVlabs / FRAG
View on GitHub
☆15Apr 25, 2025Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
IVUL-KAUST / VideoAuto-R1
View on GitHub
[CVPR2026] VideoAuto-R1: Video Auto Reasoning via Thinking Once, Answering Twice
☆88Feb 27, 2026Updated 5 months ago
KangsanKim07 / VideoICL
View on GitHub
[CVPR2025] VideoICL: Confidence-based Iterative In-context Learning for Out-of-Distribution Video Understanding
☆23Mar 24, 2025Updated last year
tianyi-lab / HallusionBench
View on GitHub
[CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(…
☆342Oct 14, 2025Updated 9 months ago
locuslab / llava-token-compression
View on GitHub
☆47Nov 8, 2024Updated last year
buyi-Yang / getQzonehistory
View on GitHub
☆12Nov 13, 2024Updated last year
tiremoscode / dw-grupo58
View on GitHub
☆20Nov 28, 2024Updated last year
abduvalimurodullayev1 / boilerplate_Drf
View on GitHub
This is the boilerplate for django project. There are so many settings configurations
☆10Nov 7, 2025Updated 8 months ago
yunlong10 / Awesome-LLMs-for-Video-Understanding
View on GitHub
🔥🔥🔥 [IEEE TCSVT] Latest Papers, Codes and Datasets on Vid-LLMs.
☆3,252Jun 13, 2026Updated last month
sudo-Boris / mr-Blip
View on GitHub
Official Implementation of "Chrono: A Simple Blueprint for Representing Time in MLLMs"
☆95Mar 9, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
longvideobench / LongVideoBench
View on GitHub
[Neurips 24' D&B] Official Dataloader and Evaluation Scripts for LongVideoBench.
☆134Jul 27, 2024Updated 2 years ago
mlvlab / vid-TLDR
View on GitHub
Official implementation of CVPR 2024 paper "vid-TLDR: Training Free Token merging for Light-weight Video Transformer".
☆55Oct 21, 2025Updated 9 months ago
showlab / VideoLISA
View on GitHub
[NeurlPS 2024] One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos
☆149Dec 26, 2024Updated last year
wxh1996 / VideoAgent
View on GitHub
☆150Apr 16, 2025Updated last year
mrwu-mac / ControlMLLM
View on GitHub
[NeurIPS2024] Repo for the paper `ControlMLLM: Training-Free Visual Prompt Learning for Multimodal Large Language Models'
☆211Jul 17, 2025Updated last year
CircleRadon / TokenPacker
View on GitHub
The code for "TokenPacker: Efficient Visual Projector for Multimodal LLM", IJCV2025
☆280May 26, 2025Updated last year
huofushuo / SID
View on GitHub
https://arxiv.org/abs/2408.02032
☆138Jan 16, 2025Updated last year