aurateam2026/AURA

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/aurateam2026/AURA)

aurateam2026 / AURA

☆114

Alternatives and similar repositories for AURA

Users that are interested in AURA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

PeiwenSun2000 / X-Stream
View on GitHub
Official Repo of "$X$-Stream: Exploring MLLMs as Multiplexers for Multi-Stream Understanding"
☆33Jun 18, 2026Updated last month
maifoundations / Streamo
View on GitHub
Streaming Video Instruction Tuning
☆79Feb 25, 2026Updated 4 months ago
yellow-binary-tree / MMDuet2
View on GitHub
[ICLR 2026] MMDuet2: Enhancing Proactive Interaction of Video MLLMs with Multi-Turn Reinforcement Learning
☆40Jan 14, 2026Updated 6 months ago
air-embodied-brain / Em-Garde
View on GitHub
Implementation of Em_Garde: a proposal-retrieval framework for streaming video understanding
☆26Jun 24, 2026Updated 3 weeks ago
lrslab / nanoSundial
View on GitHub
A de novo modification detection tool that targets current features based on 004kit for prokaryotes
☆22May 16, 2026Updated 2 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
OmniMMI / OmniMMI
View on GitHub
[CVPR 2025] OmniMMI: A Comprehensive Multi-modal Interaction Benchmark in Streaming Video Contexts
☆23Updated this week
CASIA-IVA-Lab / ThinkStream
View on GitHub
☆40Jun 18, 2026Updated last month
sotayang / Awesome-Streaming-Video-Understanding
View on GitHub
🔥🔥🔥 [Awesome] Latest Papers, Codes & Datasets on Streaming / Online Video Understanding — Building Always-on, Real-time Video AI 🤖
☆410Jul 2, 2026Updated 2 weeks ago
OmniMMI / M4
View on GitHub
[CVPR 2025] OmniMMI: A Comprehensive Multi-modal Interaction Benchmark in Streaming Video Contexts
☆18Apr 2, 2025Updated last year
jinpeng0528 / BalConpas
View on GitHub
Code release for "Strike a Balance in Continual Panoptic Segmentation" (ECCV 2024)
☆14Mar 14, 2025Updated last year
EIT-NLP / Speak-While-Watching
View on GitHub
☆17Mar 1, 2026Updated 4 months ago
JoeLeelyf / OVO-Bench
View on GitHub
[CVPR 2025] OVO-Bench: How Far is Your Video-LLMs from Real-World Online Video Understanding?
☆153Jul 24, 2025Updated 11 months ago
wanglu-cs / Think_While_Watching
View on GitHub
☆19Jun 26, 2026Updated 3 weeks ago
apple / ml-streambridge
View on GitHub
☆40Nov 5, 2025Updated 8 months ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
yellow-binary-tree / ProactiveVideoQA
View on GitHub
ProactiveBench: A Comprehensive Benchmark for VideoLLM Proactive Interaction Evaluation
☆18Jan 8, 2026Updated 6 months ago
EIT-NLP / StreamingLLM
View on GitHub
Repository of Streaming LLMs
☆90Jun 20, 2026Updated last month
VisionXLab / FIRM-Reward
View on GitHub
Trust Your Critic: Robust Reward Modeling and Reinforcement Learning for Faithful Image Editing and Generation
☆40Mar 13, 2026Updated 4 months ago
haowei-freesky / HERMES
View on GitHub
Official Repository for paper "HERMES: KV Cache as Hierarchical Memory for Efficient Streaming Video Understanding" [ACL 2026]
☆92May 8, 2026Updated 2 months ago
ydyhello / Awesome-VLM-Streaming-Video
View on GitHub
📚 A curated collection of papers and open-source code repositories dedicated to the application of Vision-Language Models (VLMs) for str…
☆188Jun 10, 2026Updated last month
MCG-NJU / StreamForest
View on GitHub
[NeurIPS 2025 Spotlight] StreamForest: Efficient Online Video Understanding with Persistent Event Memory
☆131Nov 4, 2025Updated 8 months ago
EvolvingLMMs-Lab / SimpleStream
View on GitHub
A simple video streaming baseline that outperforms SOTAs.
☆148May 1, 2026Updated 2 months ago
SooLab / EyeWO
View on GitHub
[NeurIPS2025] The official PyTorch implementation of the "Eyes Wide Open: Ego Proactive Video-LLM for Streaming Video".
☆34Dec 25, 2025Updated 6 months ago
yaolinli / TimeChat-Online
View on GitHub
[ACM MM 2025] TimeChat-online: 80% Visual Tokens are Naturally Redundant in Streaming Videos
☆132Jun 29, 2026Updated 3 weeks ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
IiuZiKai / Evo_TSE
View on GitHub
☆17Apr 9, 2026Updated 3 months ago
VisionXLab / Moment-Video
View on GitHub
☆18Jun 2, 2026Updated last month
yanghaha0908 / WavCube
View on GitHub
Official code for "WavCube: Unifying Speech Representation for Understanding and Generation via Semantic-Acoustic Joint Modeling"
☆62Jun 27, 2026Updated 3 weeks ago
EIT-NLP / Awesome-Streaming-LLMs
View on GitHub
🔥This is a repository of paper list for streaming LLMs/MLLMs.
☆24Apr 19, 2026Updated 3 months ago
jinpeng0528 / STAR
View on GitHub
Code release for "Saving 100x Storage: Prototype Replay for Reconstructing Training Sample Distribution in Class-Incremental Semantic Seg…
☆20Mar 19, 2025Updated last year
mit-han-lab / streaming-vlm
View on GitHub
StreamingVLM: Real-Time Understanding for Infinite Video Streams
☆1,046Oct 15, 2025Updated 9 months ago
xinding-bot / StreamMind
View on GitHub
[ICCV 2025] StreamMind: Unlocking Full Frame Rate Streaming Video Dialogue through Event-Gated Cognition
☆72Jun 25, 2025Updated last year
VisionXLab / GRADE
View on GitHub
[ECCV'26] GRADE: Grounded Reasoning Assessment for Discipline-informed Editing
☆28Apr 23, 2026Updated 2 months ago
minghangz / OnVTG
View on GitHub
Online video temporal grounding
☆16Oct 20, 2025Updated 9 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
VisionXLab / Rise-Video
View on GitHub
RISE-Video: Can Video Generators Decode Implicit World Rules?
☆28Mar 26, 2026Updated 3 months ago
VisionXLab / EvoTok
View on GitHub
[ECCV'26] Code repo for "EvoTok: A Unified Image Tokenizer via Residual Latent Evolution for Visual Understanding and Generation"
☆22Jun 18, 2026Updated last month
mathllm / VoiceAssistant-Eval
View on GitHub
A rigorous framework for evaluating and guiding the development of next-generation AI assistants.
☆19Jan 26, 2026Updated 5 months ago
yellow-binary-tree / MMDuet
View on GitHub
Official implementation of paper VideoLLM Knows When to Speak: Enhancing Time-Sensitive Video Comprehension with Video-Text Duet Interact…
☆44Feb 5, 2025Updated last year
hyzhang24 / DuplexSLA
View on GitHub
DuplexSLA: A Full-Duplex Spoken Language Model with Synchronized Speech, Language, and Action
☆99May 20, 2026Updated 2 months ago
1ranGuan / VST
View on GitHub
[ECCV 26] Video Streaming Thinking
☆114Jun 18, 2026Updated last month
SWivid / AUV
View on GitHub
An All-in-One Speech, Sound, Music Codec with Single Nested Codebook
☆28Oct 11, 2025Updated 9 months ago