yuzhms/Streaming-Video-Model

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yuzhms/Streaming-Video-Model)

yuzhms / Streaming-Video-Model

[CVPR2023] Code for "Streaming Video Model"

☆78

Alternatives and similar repositories for Streaming-Video-Model

Users that are interested in Streaming-Video-Model are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yliu-cs / PiTe
View on GitHub
[ECCV'24 Oral] PiTe: Pixel-Temporal Alignment for Large Video-Language Model
☆17Feb 13, 2025Updated last year
lxtGH / Tube-Link
View on GitHub
[ICCV-2023]-Universal Video Segmentaion For VSS, VPS and VIS
☆109Mar 18, 2024Updated 2 years ago
Lzq5 / Video-Text-Alignment
View on GitHub
☆28Jul 18, 2025Updated last year
Owen-Tian / MAAM-NET
View on GitHub
☆15Apr 9, 2023Updated 3 years ago
Mingzhen-Huang / DETracker
View on GitHub
Tracking Multiple Deformable Objects in Egocentric Videos (CVPR 2023)
☆13Apr 10, 2023Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
KimHanjung / VISAGE
View on GitHub
[ECCV 2024] VISAGE: Video Instance Segmentation with Appearance-Guided Enhancement
☆38Jul 29, 2024Updated last year
kaiw7 / STG-CMA
View on GitHub
Towards Efficient Audio-Visual Learners via Empowering Pre-trained Vision Transformers with Cross-Modal Adaptation
☆15Apr 13, 2024Updated 2 years ago
UCSC-VLAA / EVP
View on GitHub
[TMLR'24] This repository includes the official implementation our paper "Unleashing the Power of Visual Prompting At the Pixel Level"
☆42Apr 30, 2024Updated 2 years ago
Huntersxsx / RIS-Learning-List
View on GitHub
Related papers about Referring Image Segmentation (RIS)
☆16Dec 26, 2023Updated 2 years ago
Andy-Cheng / TEMPURA
View on GitHub
TEMPURA enables video-language models to reason about causal event relationships and generate fine-grained, timestamped descriptions of u…
☆27Jun 4, 2025Updated last year
zhengrongz / AoTD
View on GitHub
[CVPR 2025] Official PyTorch code of "Enhancing Video-LLM Reasoning via Agent-of-Thoughts Distillation".
☆58May 25, 2025Updated last year
jylins / hourllava
View on GitHub
[NeurIPS 2025 Spotlight] Unleashing Hour-Scale Video Training for Long Video-Language Understanding
☆19Jun 24, 2025Updated last year
renwang435 / video-ttt-release
View on GitHub
Test-Time Training on Video Streams
☆70Jul 24, 2023Updated 2 years ago
MCG-NJU / MGMAE
View on GitHub
[ICCV 2023] MGMAE: Motion Guided Masking for Video Masked Autoencoding
☆26Oct 16, 2023Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
OpenGVLab / unmasked_teacher
View on GitHub
[ICCV2023 Oral] Unmasked Teacher: Towards Training-Efficient Video Foundation Models
☆348May 27, 2024Updated 2 years ago
shiyi-zh0408 / LOGO
View on GitHub
[CVPR 2023] LOGO: A Long-Form Video Dataset for Group Action Quality Assessment
☆48Apr 9, 2024Updated 2 years ago
chenwei746 / EEVG
View on GitHub
☆23Aug 20, 2024Updated last year
Tencent-QQMM / Video-CCAM
View on GitHub
A lightweight flexible Video-MLLM developed by TencentQQ Multimedia Research Team.
☆73Oct 14, 2024Updated last year
TalalWasim / Vita-CLIP
View on GitHub
Official repository for "Vita-CLIP: Video and text adaptive CLIP via Multimodal Prompting" [CVPR 2023]
☆126Jul 1, 2023Updated 3 years ago
AnasEmad11 / C2FPL
View on GitHub
A Coarse-to-Fine Pseudo-Labeling (C2FPL) Framework for Unsupervised Video Anomaly Detection
☆21May 18, 2024Updated 2 years ago
Lipurple / ARIS
View on GitHub
A Simple Plugin for Transforming Images to Arbitrary Scales
☆19Feb 9, 2023Updated 3 years ago
whwu95 / BIKE
View on GitHub
【CVPR'2023】Bidirectional Cross-Modal Knowledge Exploration for Video Recognition with Pre-trained Vision-Language Models
☆156Sep 9, 2024Updated last year
wengzejia1 / Open-VCLIP
View on GitHub
☆119Feb 19, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
GitHubOfHyl97 / SkeAttnCLR
View on GitHub
The Official PyTorch implementation of "Part Aware Contrastive Learning for Self-Supervised Action Recognition" in IJCAI 2023
☆13Nov 9, 2023Updated 2 years ago
tomoyukun / convolutional-pose-machines-chainer
View on GitHub
☆12May 21, 2017Updated 9 years ago
sukjunhwang / set_classifier
View on GitHub
Official PyTorch implementation of: "Cannot See the Forest for the Trees: Aggregating Multiple Viewpoints to Better Classify Objects in V…
☆14Aug 29, 2022Updated 3 years ago
zugexiaodui / campus_vad_code
View on GitHub
☆26Nov 7, 2023Updated 2 years ago
HumanMLLM / ViSpeak
View on GitHub
(ICCV2025) Official repository of paper "ViSpeak: Visual Instruction Feedback in Streaming Videos"
☆52Jul 1, 2025Updated last year
RobMulla / helmet-assignment
View on GitHub
Helper code for the 2021 Kaggle NFL Helmet Assignment Task
☆13Sep 22, 2021Updated 4 years ago
xujinglin / FineDiving
View on GitHub
FineDiving: A Fine-grained Dataset for Procedure-aware Action Quality Assessment
☆153Aug 26, 2024Updated last year
microsoft / AudioEntailment
View on GitHub
Audio Entailment: Deductive Reasoning for Audio Understanding
☆17Dec 10, 2024Updated last year
jinxiang-liu / UFE-AVS
View on GitHub
Official code for CVPR 2024 paper, "Audio-Visual Segmentation via Unlabeled Frame Exploitation""
☆19Jul 7, 2024Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
TACJu / Axial-VS
View on GitHub
This repo contains the code for our TMLR paper: A Simple Video Segmenter by Tracking Objects Along Axial Trajectories
☆27Mar 20, 2025Updated last year
txyugood / PaddlePoseC3D
View on GitHub
☆21Jan 29, 2023Updated 3 years ago
kyegomez / MGQA
View on GitHub
The open source implementation of the multi grouped query attention by the paper "GQA: Training Generalized Multi-Query Transformer Model…
☆16Dec 11, 2023Updated 2 years ago
daniel-code / TubeViT
View on GitHub
An unofficial implementation of TubeViT in "Rethinking Video ViTs: Sparse Video Tubes for Joint Image and Video Learning"
☆95Updated this week
haoyanbin918 / Attention-in-Attention
View on GitHub
☆12Aug 5, 2022Updated 3 years ago
dominickrei / PoseAwareVT
View on GitHub
Code for the paper Seeing the Pose in the Pixels: Learning Pose-Aware Representations in Vision Transformers
☆21Aug 2, 2024Updated last year
visinf / veto
View on GitHub
Vision Relation Transformer for Unbiased Scene Graph Generation (ICCV 2023)
☆22Mar 23, 2026Updated 3 months ago