Pose Extraction & Rendering for SCAIL: Towards Studio-Grade Character Animation via In-Context Learning of 3D-Consistent Pose Representations
☆180Dec 28, 2025Updated 2 months ago
Alternatives and similar repositories for SCAIL-Pose
Users that are interested in SCAIL-Pose are comparing it to the libraries listed below
Sorting:
- More reliable Video Understanding Evaluation☆14Sep 23, 2025Updated 5 months ago
- The official UniVerse-1 code.☆122Oct 13, 2025Updated 4 months ago
- Kaleido: Open-sourced multi-subject reference video generation model, enabling controllable, high-fidelity video synthesis from multiple …☆116Dec 23, 2025Updated 2 months ago
- [ICCV 2025] Enhancing spatial understanding in text-to-Image diffusion models☆90Sep 11, 2025Updated 5 months ago
- This project is the official implementation of 'DreamOmni3: Scribble-based Editing and Generation''☆38Dec 30, 2025Updated 2 months ago
- UltraFlux: Data-Model Co-Design for High-quality Native 4K Text-to-Image Generation across Diverse Aspect Ratios☆117Dec 17, 2025Updated 2 months ago
- A node for ComfyUI that adjusts a latent image before the VAE decoding step in order to improve your image quality.☆35Dec 30, 2025Updated 2 months ago
- The official implementation of COOPER: A Unified Model for Cooperative Perception and Reasoning in Spatial Intelligence.☆28Dec 30, 2025Updated 2 months ago
- ☆22Jan 26, 2026Updated last month
- ComfyUI wrapper for segment anything 3☆440Updated this week
- [CVPR 2026] VideoCoF: Unified Video Editing with Temporal Reasoner☆153Feb 22, 2026Updated last week
- Code and Models for the paper NeuralRemaster with Phase-Preserving Diffusion☆65Feb 6, 2026Updated 3 weeks ago
- This is the official code repository for the paper: Towards General Continuous Memory for Vision-Language Models.☆20Jul 3, 2025Updated 8 months ago
- ComfyUI custom node for generating prompts from images. Supports Qwen2.5 and Qwen3 (Instruct/Thinking) models, as well as the OpenAI API.☆24Jan 10, 2026Updated last month
- This is a VideoAsPrompt ComfyUI plugin☆20Oct 30, 2025Updated 4 months ago
- Professional video processing, scene detection, and utility nodes for ComfyUI.☆28Updated this week
- Reinforcing Text-Rich Video Reasoning with Visual Rumination☆27Nov 24, 2025Updated 3 months ago
- 🚀 Official code for “XStreamVGGT: Extremely Memory-Efficient Streaming Vision Geometry Grounded Transformer with KV Cache Compression”, …☆35Jan 27, 2026Updated last month
- ☆34Jan 25, 2026Updated last month
- Official PyTorch implementation of the paper "FlowDirector: Training-Free Flow Steering for Precise Text-to-Video Editing"☆78Dec 12, 2025Updated 2 months ago
- ☆50Updated this week
- When Visualizing is the First Step to Reasoning: MIRA, a Benchmark for Visual Chain-of-Thought☆27Feb 14, 2026Updated 2 weeks ago
- ComfyUI nodes for Step Audio EditX - State-of-the-art zero-shot voice cloning and audio editing with emotion, style, speed control, and m…☆56Dec 4, 2025Updated 2 months ago
- Pusa: Thousands Timesteps Video Diffusion Model☆671Feb 13, 2026Updated 2 weeks ago
- Krea Realtime 14B. An open-source realtime AI video model.☆497Nov 13, 2025Updated 3 months ago
- ☆197Feb 3, 2026Updated last month
- ☆130Dec 24, 2025Updated 2 months ago
- The official implementation of RealisDance☆610Jun 20, 2025Updated 8 months ago
- UNCAGE: Contrastive Attention Guidance for Masked Generative Transformers in Text-to-Image Generation☆18Aug 12, 2025Updated 6 months ago
- LoPA: Scaling dLLM Inference via Lookahead Parallel Decoding☆34Jan 16, 2026Updated last month
- Overworld's local world client interface to run Waypoint world models☆46Updated this week
- [ICLR'25] Official repository of paper: Ranking-aware adapter for text-driven image ordering with CLIP☆16Apr 17, 2025Updated 10 months ago
- ComfyUI nodes for SCAIL-Pose preprocessing☆286Dec 24, 2025Updated 2 months ago
- ☆296Feb 9, 2026Updated 3 weeks ago
- Towards Pixel-Level VLM Perception via Simple Points Prediction☆92Feb 9, 2026Updated 3 weeks ago
- [CVPR 2026] Official repo of "MorphAny3D: Unleashing the Power of Structured Latent in 3D Morphing“☆76Updated this week
- Code of RealisHuman: A Two-Stage Approach for Refining Malformed Human Parts in Generated Images☆93Nov 9, 2024Updated last year
- Official repo for paper "IC-Effect: Precise and Efficient Video Effects Editing via In-Context Learning"☆39Jan 29, 2026Updated last month
- MARSHAL: Incentivizing Multi-Agent Reasoning via Self-Play with Strategic LLMs☆38Feb 19, 2026Updated last week