☆36Dec 16, 2025Updated 2 months ago
Alternatives and similar repositories for T3-Video
Users that are interested in T3-Video are comparing it to the libraries listed below
Sorting:
- Official implementation of the paper "M3CoTBench: Benchmark Chain-of-Thought of MLLMs in Medical Image Understanding"☆21Jan 14, 2026Updated last month
- [CVPR 2026] Soul: Breathe Life into Digital Human for High-fidelity Long-term Multimodal Animation☆55Dec 16, 2025Updated 2 months ago
- [NeurIPS 2025] Encoder-Decoder Diffusion Language Models for Efficient Training and Inference☆36Oct 29, 2025Updated 4 months ago
- ☆41Oct 29, 2025Updated 4 months ago
- Reinforcing Text-Rich Video Reasoning with Visual Rumination☆27Nov 24, 2025Updated 3 months ago
- [CVPR 2026] Official pytorch implementation of "ReDirector: Creating Any-Length Video Retakes with Rotary Camera Encoding"☆21Dec 17, 2025Updated 2 months ago
- ☆20Dec 3, 2025Updated 3 months ago
- The training codes of Jasper-Token-Compression-600M☆19Nov 19, 2025Updated 3 months ago
- EVOLVE-VLA: Test-Time Training from Environment Feedback for Vision-Language-Action Models☆74Dec 17, 2025Updated 2 months ago
- ASID-Caption: Attribute-Structured and Quality-Verified Audiovisual Instruction Dataset and Training Pipeline for Fine-Grained Video Unde…☆35Updated this week
- [CVPR 2025] DreamRelation: Bridging Customization and Relation Generation☆19Dec 17, 2025Updated 2 months ago
- [ICLR 26] Visual Multi-Agent System: Mitigating Hallucination Snowballing via Visual Flow☆36Oct 3, 2025Updated 5 months ago
- ☆53Dec 10, 2025Updated 2 months ago
- UltraFlux: Data-Model Co-Design for High-quality Native 4K Text-to-Image Generation across Diverse Aspect Ratios☆119Dec 17, 2025Updated 2 months ago
- RePlan: Reasoning-Guided Region Planning for Complex Instruction-Based Image Editing☆58Dec 26, 2025Updated 2 months ago
- RouterArena: An open framework for evaluating LLM routers with standardized datasets, metrics, an automated framework, and a live leaderb…☆71Feb 18, 2026Updated 2 weeks ago
- ☆121Feb 28, 2026Updated last week
- StereoVLA is powered by stereo vision and supports flexible deployment with high tolerance to camera pose variations.☆53Jan 12, 2026Updated last month
- ☆68Aug 16, 2024Updated last year
- EO: Open-source Unified Embodied Foundation Model Series☆51Jan 15, 2026Updated last month
- Multi-step AI agents powered by Gemini 2.0 and the LangGraph framework. These agents orchestrate complex workflows and enhance their reas…☆10Dec 19, 2024Updated last year
- ☆21Dec 14, 2025Updated 2 months ago
- Official Repository of Native Parallel Reasoner☆102Feb 5, 2026Updated last month
- Holistic Evaluation of Multimodal LLMs on Spatial Intelligence☆87Feb 25, 2026Updated last week
- InfiniteVL: Synergizing Linear and Sparse Attention for Highly-Efficient, Unlimited-Input Vision-Language Models☆91Feb 2, 2026Updated last month
- ThinkGen: Generalized Thinking for Visual Generation☆51Dec 30, 2025Updated 2 months ago
- [NeurIPS'25 Spotlight] Official implementation of "JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation"☆69Feb 26, 2026Updated last week
- AI-native knowledge kernel for human/agent collaboration. Use it as a Knowledge Base, Wiki, Annotator, Research Tool, or Agentic Memory.☆29Updated this week
- Martingale posterior neural networks for fast sequential decision making @ Neurips 2025☆23Nov 13, 2025Updated 3 months ago
- A free and open-source focus stacking software that supports multi-focus image alignment and fusion.☆20Feb 5, 2026Updated last month
- Software to enable data-rich collaboration from high-resolution display walls to your laptop☆16Updated this week
- ☆93Dec 30, 2025Updated 2 months ago
- Implementation of an X86 mini OS from scratch. Reference: https://github.com/yyu/osfs00☆11Jan 9, 2023Updated 3 years ago
- ☆29Jan 15, 2026Updated last month
- Auction Theory Toolbox – Computer Verified Auctions☆14Jul 12, 2016Updated 9 years ago
- This is the official repository for the paper "Modeling Human Gaze Behavior with Diffusion Models for Unified Scanpath Prediction". ICCV …☆24Dec 4, 2025Updated 3 months ago
- Training and evaluation codes for the BertGen paper (ACL-IJCNLP 2021)☆11Sep 17, 2023Updated 2 years ago
- (CVPR 2026) Long-RVOS: A Comprehensive Benchmark for Long-term Referring Video Object Segmentation☆27Feb 28, 2026Updated last week
- A lightweight OAuth 2.0 Authorization Server supporting Device Authorization Grant (RFC 8628) and Authorization Code Flow with PKCE (RFC …☆32Updated this week