showlab / livecc
LiveCC: Learning Video LLM with Streaming Speech Transcription at Scale (CVPR 2025)
β194Updated this week
Alternatives and similar repositories for livecc
Users that are interested in livecc are comparing it to the libraries listed below
Sorting:
- π₯ICLR 2025 (Spotlight) One-Prompt-One-Story: Free-Lunch Consistent Text-to-Image Generation Using a Single Promptβ260Updated 3 weeks ago
- MovieAgent: Automated Movie Generation via Multi-Agent CoT Planningβ194Updated last month
- π‘ VideoMind: A Chain-of-LoRA Agent for Long Video Reasoningβ191Updated 3 weeks ago
- β321Updated last month
- [CVPR 2025] This is an official inference code of the paper "BizGen: Advancing Article-level Visual Text Rendering for Infographics Generβ¦β253Updated last month
- [AAAI 2025] StoryWeaver: A Unified World Model for Knowledge-Enhanced Story Character Customizationβ211Updated last month
- All-round Creator and Editorβ217Updated 4 months ago
- [ICLR2025] DisPose: Disentangling Pose Guidance for Controllable Human Image Animationβ367Updated 3 months ago
- [CVPR 2025 Highlight] X-Dyna: Expressive Dynamic Human Image Animationβ240Updated 3 months ago
- SkyReels-A1: Expressive Portrait Animation in Video Diffusion Transformersβ501Updated 2 weeks ago
- The official repo for "Vidi: Large Multimodal Models for Video Understanding and Editing"β97Updated 3 weeks ago
- Official implementation of the paper "MusicInfuser: Making Video Diffusion Listen and Dance"β70Updated last month
- KeySync: A Robust Approach for Leakage-free Lip Synchronization in High Resolutionβ180Updated last week
- Official GPU implementation of the paper "PPLLaVA: Varied Video Sequence Understanding With Prompt Guidance"β130Updated 5 months ago
- [SIGGRAPH2025] Official repo for paper "Any-length Video Inpainting and Editing with Plug-and-Play Context Control"β364Updated last month
- Implementation for the paper "ComfyBench: Benchmarking LLM-based Agents in ComfyUI for Autonomously Designing Collaborative AI Systems".β164Updated 2 months ago
- AnimeGamer: Infinite Anime Life Simulation with Next Game State Predictionβ312Updated last month
- Official Implementation of Video-T1: Test-Time Scaling for Video Generationβ258Updated last month
- β68Updated 2 weeks ago
- FlashVideo: Flowing Fidelity to Detail for Efficient High-Resolution Video Generationβ427Updated 2 months ago
- ACTalker: an end-to-end video diffusion framework for talking head synthesis that supports both single and multi-signal control (e.g., auβ¦β262Updated 3 weeks ago
- MotionFollower: Editing Video Motion via Lightweight Score-Guided Diffusionβ217Updated 3 weeks ago
- Light-A-Video: Training-free Video Relighting via Progressive Light Fusionβ416Updated 3 weeks ago
- β491Updated 5 months ago
- β237Updated 2 months ago
- Pusa: Thousands Timesteps Video Diffusion Modelβ166Updated 3 weeks ago
- [ICLR'25] MovieDreamer: Hierarchical Generation for Coherent Long Visual Sequencesβ300Updated 9 months ago
- The official implementation of "MagicColor: Multi-Instance Sketch Colorization"β96Updated last month
- [ICLR 2025] Animate-X - PyTorch Implementationβ303Updated 3 months ago
- DICE-Talk is a diffusion-based emotional talking head generation method that can generate vivid and diverse emotions for speaking portraiβ¦β89Updated this week