bilibili / Index-anisoraLinks
☆2,362Updated last month
Alternatives and similar repositories for Index-anisora
Users that are interested in Index-anisora are comparing it to the libraries listed below
Sorting:
- HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation☆1,204Updated 3 months ago
- [NeurIPS 2025] Image editing is worth a single LoRA! 0.1% training data for fantastic image editing! Surpasses GPT-4o in ID persistence~ …☆2,074Updated last month
- SkyReels V1: The first and most advanced open-source human-centric video foundation model☆2,643Updated 10 months ago
- CogView4, CogView3-Plus and CogView3(ECCV 2024)☆1,104Updated 10 months ago
- Official implementations for paper: Zero-shot Image Editing with Reference Imitation☆1,305Updated last year
- ☆1,046Updated 8 months ago
- TurboDiffusion: 100–200× Acceleration for Video Diffusion Models☆3,298Updated last week
- Light Image Video Generation Inference Framework☆1,897Updated this week
- HunyuanVideo-I2V: A Customizable Image-to-Video Model based on HunyuanVideo☆1,776Updated 8 months ago
- [NeurIPS 2025] Let Them Talk: Audio-Driven Multi-Person Conversational Video Generation☆2,791Updated last month
- HunyuanImage-3.0: A Powerful Native Multimodal Model for Image Generation☆2,762Updated last week
- ☆2,019Updated last month
- A SOTA open-source image editing model, which aims to provide comparable performance against the closed-source models like GPT-4o and Gem…☆2,132Updated last month
- [ICCV 2025] 🔥🔥 UNO: A Universal Customization Method for Both Single and Multi-Subject Conditioning☆1,348Updated 4 months ago
- GAG is a GUI for GPT-SoVITS inference. Just add it to the official integration package and run for a smoother experience.☆222Updated 7 months ago
- MAGI-1: Autoregressive Video Generation at Scale☆3,638Updated 7 months ago
- Official PyTorch implementation of One-Minute Video Generation with Test-Time Training☆2,363Updated 8 months ago
- Understand Human Behavior to Align True Needs☆4,058Updated 5 months ago
- [NeurIPS 2025] OmniTalker: Real-Time Text-Driven Talking Head Generation with In-Context Audio-Visual Style Replication☆417Updated 4 months ago
- Phantom: Subject-Consistent Video Generation via Cross-Modal Alignment☆1,477Updated 4 months ago
- Sonic is a method about ' Shifting Focus to Global Audio Perception in Portrait Animation',you can use it in comfyUI☆1,123Updated 4 months ago
- An Open-Source Multimodal AIGC Solution based on ComfyUI + MCP + LLM https://pixelle.ai☆908Updated last month
- Implementation of [CVPR 2025] "DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation"☆892Updated last year
- ☆3,167Updated 10 months ago
- ☆566Updated 4 months ago
- [ACM MM 2025] FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis☆1,617Updated last week
- Implementation of "EasyControl: Adding Efficient and Flexible Control for Diffusion Transformer"(ICCV2025)☆1,713Updated 6 months ago
- [ICCV 2025] Official implementations for paper: VACE: All-in-One Video Creation and Editing☆3,604Updated 3 months ago
- ☆1,620Updated 3 weeks ago
- 一种基于Emotion2Vec的批量音频情感自动标注脚本☆493Updated 10 months ago