A simple video streaming baseline that outperforms SOTAs.
☆144May 1, 2026Updated last month
Alternatives and similar repositories for SimpleStream
Users that are interested in SimpleStream are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [arXiv 2026] Official PyTorch Repository for "Coarse-Guided Visual Generation via Weighted h-Transform Sampling"☆42May 8, 2026Updated last month
- Streaming Video Instruction Tuning☆76Feb 25, 2026Updated 4 months ago
- LLM-Powered Open-Vocabulary Scene Segmentation with Language Embedded 3D Gaussians☆25Jan 10, 2025Updated last year
- Benchmarking and Analyzing Generative Data for Visual Recognition☆26Jul 25, 2023Updated 2 years ago
- [ICRA 2026] UltraDexGrasp: Learning Universal Dexterous Grasping for Bimanual Robots with Synthetic Data☆79Mar 6, 2026Updated 3 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- 🔥🔥🔥 [Awesome] Latest Papers, Codes & Datasets on Streaming / Online Video Understanding — Building Always-on, Real-time Video AI 🤖☆319Updated this week
- ☆15Sep 17, 2023Updated 2 years ago
- [ICCV 23] A Simple Vision Transformer for Weakly Semi-supervised 3D Object Detection☆13Apr 12, 2024Updated 2 years ago
- [SIGGRAPH Asia 2025] "ASIA: Adaptive 3D Segmentation using Few Image Annotations ".☆26Feb 14, 2026Updated 4 months ago
- ☆26Jun 5, 2025Updated last year
- This repository contains the Adverbs in Recipes (AIR) dataset and the code published at the CVPR 23 paper: "Learning Action Changes by Me…☆13May 25, 2023Updated 3 years ago
- ACL'2023: Few-shot Event Detection: An Empirical Study and a Unified View☆11Mar 13, 2024Updated 2 years ago
- [SIGGRAPH Asia'25] Enabling Reference-based Camera Control via Context without Explicit 3D Estimation☆158Jan 18, 2026Updated 5 months ago
- Code for reproducing the results in "How Well do Sparse Imagenet Models Transfer?", presented at CVPR 2022☆10Jun 3, 2022Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [Paper] SoMoFormer: Multi-Person Pose Forecasting with Transformers☆26Mar 1, 2023Updated 3 years ago
- SSAP: Single-Shot Instance Segmentation With Affinity Pyramid☆11Aug 6, 2019Updated 6 years ago
- Reasoning in Space via Grounding in the World (ICLR 2025)☆55Nov 3, 2025Updated 7 months ago
- ☆10May 17, 2024Updated 2 years ago
- 🔥An open-source survey of the latest video reasoning tasks, paradigms, and benchmarks.☆184Jun 14, 2026Updated 2 weeks ago
- ☆14Nov 7, 2022Updated 3 years ago
- Syphus: Automatic Instruction-Response Generation Pipeline☆14Dec 14, 2023Updated 2 years ago
- https://github.com/bernakabadayi/ganavatar☆12Oct 8, 2024Updated last year
- Official implementation for RoMaP :Robust 3D-Masked Part-level Editing in 3D Gaussian Splatting with Regularized Score Distillation Sampl…☆22Aug 5, 2025Updated 10 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Constraint Satisfaction Visual Grounding☆15Aug 10, 2025Updated 10 months ago
- ☆11Jun 21, 2025Updated last year
- Code for one-stage adaptive set-based HOI detector AS-Net.☆52May 8, 2021Updated 5 years ago
- [NeurIPS 2025] 𝓡𝓣𝓥-𝓑𝓮𝓷𝓬𝓱: Benchmarking MLLM Continuous Perception, Understanding and Reasoning through Real-Time Video.☆32Jan 15, 2026Updated 5 months ago
- ☆17Jun 10, 2024Updated 2 years ago
- ☆21Aug 22, 2025Updated 10 months ago
- [AAAI 2026] Official Code for VQAThinker: Exploring Generalizable and Explainable Video Quality Assessment via Reinforcement Learning☆29Nov 28, 2025Updated 7 months ago
- Image Compositing for Segmentation of Surgical Tools without Manual Annotations☆10May 20, 2021Updated 5 years ago
- ☆17May 24, 2023Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- The official implementation of the W&B Models and Weave MCP server.☆63Jun 18, 2026Updated last week
- [CVPR 2026] Towards Holistic Modeling for Video Frame Interpolation with Auto-regressive Diffusion Transformers☆42May 3, 2026Updated last month
- PyTorch implementation of "Deep Transferring Quantization" (ECCV2020)☆18Jun 22, 2022Updated 4 years ago
- Tempo: Small Vision-Language Models are Smart Compressors for Long Video Understanding☆72Apr 29, 2026Updated 2 months ago
- E-GRPO: High Entropy Steps Drive Effective Reinforcement Learning for Flow Models☆44Jan 5, 2026Updated 5 months ago
- Mancs: A multi-task attentional network with curriculum sampling for person re-identification☆13Aug 5, 2019Updated 6 years ago
- A benchmark dataset and simple code examples for measuring the perception and reasoning of multi-sensor Vision Language models.☆19Dec 27, 2024Updated last year