[ICLR2026] Video-STAR: Reinforcing Open-Vocabulary Action Recognition with Tools
☆207Apr 17, 2026Updated 2 months ago
Alternatives and similar repositories for Video-STAR
Users that are interested in Video-STAR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [CVPR 2026 Findings] Eevee: Towards Close-up High-resolution Video-based Virtual Try-on☆76Feb 27, 2026Updated 4 months ago
- ☆56Jun 3, 2026Updated 3 weeks ago
- DreamX-World: A General-Purpose Interactive World Model☆559Jun 22, 2026Updated last week
- ☆61Feb 9, 2026Updated 4 months ago
- A comprehensive benchmark specifically designed to evaluate the interactive response capabilities of world models in 4D settings.☆107Mar 24, 2026Updated 3 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Next Token Is Enough: Realistic Image Quality and Aesthetic Scoring with Multimodal Large Language Model.☆86Jun 30, 2025Updated 11 months ago
- ☆17Mar 25, 2025Updated last year
- [ICLR2026] AutoDrive-R2: Incentivizing Reasoning and Self-Reflection Capacity for VLA Model in Autonomous Driving☆209May 20, 2026Updated last month
- [ICLR2026] Everything in Its Place: Benchmarking Spatial Intelligence of Text-to-Image Models☆131Jan 30, 2026Updated 4 months ago
- ☆10Nov 12, 2024Updated last year
- official implementation of the paper "Delving into Latent Spectral Biasing of Video VAEs for Superior Diffusability".☆69Dec 25, 2025Updated 6 months ago
- [2026 CVPR]Extending One-Step Image Generation from Class Labels to Text via Discriminative Text Representation☆111Apr 15, 2026Updated 2 months ago
- [ACL 2026 Findings] Thinking with Map: Reinforced Parallel Map-Augmented Agent for Geolocalization☆176Mar 9, 2026Updated 3 months ago
- [ICCV25] USP: Unified Self-Supervised Pretraining for Image Generation and Understanding☆97Oct 11, 2025Updated 8 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Official PyTorch implementation of HCCNet: Efficient Semantic Matching with Hypercolumn Correlation (WACV '24 Oral, Best paper finalist (…☆11Apr 29, 2024Updated 2 years ago
- Demo for testing dynamically load the libos module.☆10Nov 8, 2023Updated 2 years ago
- [ICLR26] NarrLV: Towards a Comprehensive Narrative-Centric Evaluation for Long Video Generation Models☆113Jul 28, 2025Updated 11 months ago
- ☆191Jun 10, 2026Updated 2 weeks ago
- ☆16Jan 7, 2025Updated last year
- 基于 Next.js App Router 的博客与后台一体化项目:公开前台、管理员 CMS、AI 写作助手,以及基于向量检索(RAG)的站内智能问答。☆83Jun 21, 2026Updated last week
- 企业协同运营管理系统。项目采用前后端分离架构,前端基于 Vue 3 + Vite,后端基于 Spring Boot + MyBatis-Plus,数据库使用 MySQL。系统围绕实际仓储业务场景,完成了用户与权限管理、基础资料维护、进销退存流程、统计分析与可视化展示,具备完…☆114Updated this week
- Simple Face++ Python demo☆12Nov 18, 2019Updated 6 years ago
- ☆51May 8, 2026Updated last month
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Askme — Voice AI assistant☆80May 20, 2026Updated last month
- Leveraging AI, this solution boosts 360° video quality through 4x upscaling with Real-ESRGAN. It integrates GFPGAN for smart face enhance…☆25Jun 27, 2025Updated last year
- Drift: DLM Reinforcement Learning Training Framework☆239May 31, 2026Updated 3 weeks ago
- No.5 solution to non-targeted attack in IJCAI-2019 Alibaba Adversarial AI Challenge (AAAC 2019))☆12Oct 27, 2020Updated 5 years ago
- ☆19Dec 15, 2025Updated 6 months ago
- Deep Photo-to-Sketch Synthesis Framework☆10Nov 23, 2018Updated 7 years ago
- Deep Learning for NLP☆12Dec 7, 2022Updated 3 years ago
- A fork of COIN's VRPH☆11Dec 12, 2017Updated 8 years ago
- ☆42Oct 20, 2025Updated 8 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [ICCV 25] VMBench: A Benchmark for Perception-Aligned Video Motion Generation☆75Oct 10, 2025Updated 8 months ago
- This is the official repository for "BokehDiff: Neural Lens Blur with One-Step Diffusion" (ICCV'25).☆52Apr 27, 2026Updated 2 months ago
- 记录我的编程学习与成长之旅。涵盖技术笔记、项目实践、心得体会与日常思考☆86May 8, 2026Updated last month
- The vite plugin for multi-page application☆86Jun 15, 2025Updated last year
- Machine learning and decision intelligence models designed to improve healthcare safety through clinical risk prediction and medical inte…☆178Mar 19, 2026Updated 3 months ago
- ☆69Apr 8, 2025Updated last year
- [ICLR2026] There is No VAE: End-To-End Pixel-Space Generative Modeling Via Self-Supervised Pre-Training☆150Mar 27, 2026Updated 3 months ago