☆35Aug 31, 2025Updated 6 months ago
Alternatives and similar repositories for MuFun
Users that are interested in MuFun are comparing it to the libraries listed below
Sorting:
- Both audio-only and audio-visual speaker diarization datasets are listed here.☆14Feb 22, 2023Updated 3 years ago
- ☆97Oct 16, 2025Updated 4 months ago
- Controlnet module for Wan2.2☆42Oct 30, 2025Updated 4 months ago
- Official code of SenSE.☆74Oct 30, 2025Updated 4 months ago
- Official implementation of Progressive Detail Injection for Training-Free Semantic Binding in Text-to-Image Generation☆32Aug 3, 2025Updated 6 months ago
- ☆37May 28, 2025Updated 9 months ago
- Extend the Conditioning of Stable Diffusion to take Audio Embeddings Instead of Text Embeddings using Wav2Vec2-BERT model☆13Sep 25, 2024Updated last year
- AnyEnhance-based Baseline for the CCF-AATC 2025 Challenge Track 1☆44Dec 27, 2025Updated 2 months ago
- CVPR 2026 | Official Implementation of "MultiShotMaster: A Controllable Multi-Shot Video Generation Framework"☆75Feb 22, 2026Updated last week
- Unofficial implementation JEN-1 Composer: A Unified Framework for High-Fidelity Multi-Track Music Generation(https://arxiv.org/abs/2310.1…☆32Jan 19, 2024Updated 2 years ago
- Fork of ACE-Step v1.0 for LoRA training with < 10 GB VRAM☆65Feb 3, 2026Updated 3 weeks ago
- [AAAI2026] Bring Your Dreams to Life: Continual Text-to-Video Customization☆36Dec 9, 2025Updated 2 months ago
- TASU: A New Style of Alignment of Speech LLM with only Text Training Data, zero-shot on ASR and Other SU tasks☆22Jan 19, 2026Updated last month
- ☆40Apr 2, 2025Updated 10 months ago
- ☆11Oct 31, 2024Updated last year
- AdvSV stands as the first dataset developed specifically for evaluating Speaker Verification (SV) systems against adversarial attacks. I…☆11Nov 21, 2023Updated 2 years ago
- Ace-Step Dataset Generator☆23Sep 27, 2025Updated 5 months ago
- MV-RAG combines retrieval with multi-view generation to create accurate 3D-consistent visuals. By retrieving reference images and text, i…☆23Nov 29, 2025Updated 3 months ago
- ☆15Mar 11, 2025Updated 11 months ago
- ☆43Dec 1, 2025Updated 3 months ago
- ComfyUI workflows to create smooth transitions between video clips using Wan VACE. Works with video from any model or other source-LTX-2,…☆31Feb 10, 2026Updated 2 weeks ago
- The demo page for ALMTokenizer☆59Apr 14, 2025Updated 10 months ago
- Kakao Mobility MCP Server for directions and transit information☆10Sep 14, 2025Updated 5 months ago
- Noise supression using deep filtering☆13May 31, 2022Updated 3 years ago
- Scripting Multi-Scene Videos with Time-Aware and Structural Audio-Visual Captions☆21Feb 11, 2026Updated 2 weeks ago
- ☆39Oct 29, 2025Updated 4 months ago