laitselec/MuFun

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/laitselec/MuFun)

laitselec / MuFun

☆37

Alternatives and similar repositories for MuFun

Users that are interested in MuFun are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Exgc / OmniSep
View on GitHub
Sound Separation, Omni modal
☆29Sep 15, 2025Updated 10 months ago
kyutai-labs / tts_longeval
View on GitHub
☆30Apr 29, 2026Updated 2 months ago
liutaocode / AwesomeDiarizationDataset
View on GitHub
Both audio-only and audio-visual speaker diarization datasets are listed here.
☆16Feb 22, 2023Updated 3 years ago
wsntxxn / UniFlow-Audio
View on GitHub
☆72Updated this week
facebookresearch / dacvae
View on GitHub
DACVAE
☆226Dec 22, 2025Updated 6 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
NieeiM / Dasheng-Audiogen
View on GitHub
Generate a complete audio clip with music, intelligible speech, and sound effects from text in one pass.
☆44May 27, 2026Updated last month
yangdongchao / ALMTokenizer
View on GitHub
The demo page for ALMTokenizer
☆59Apr 14, 2025Updated last year
streichgeorg / autosing
View on GitHub
☆18Jan 20, 2025Updated last year
Ruiqi-Yan / Awesome-Audio-Editing
View on GitHub
A curated list of models, benchmarks, tools and guides for audio editing
☆34Jul 7, 2026Updated 2 weeks ago
ASLP-lab / SongEval
View on GitHub
A song aesthetic evaluation toolkit trained on SongEval.
☆314Apr 8, 2026Updated 3 months ago
sarulab-speech / DuplexChat
View on GitHub
☆46Jul 5, 2026Updated 2 weeks ago
XiaomiMiMo / MiMo-Audio-Training
View on GitHub
☆109Oct 16, 2025Updated 9 months ago
arielshaulov / TokenTrim
View on GitHub
Official implementation of the paper "TOKENTRIM: INFERENCE-TIME TOKEN PRUNING FOR AUTOREGRESSIVE LONG VIDEO GENERATION"
☆15Feb 8, 2026Updated 5 months ago
smulelabs / windowed-roformer
View on GitHub
Official Repository for "Efficient Vocal Source Separation Through Windowed RoFormer"
☆45Oct 30, 2025Updated 8 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
sony / mmaudiosep
View on GitHub
☆16Apr 30, 2026Updated 2 months ago
yuhui1038 / Muse
View on GitHub
ACL 2026 - Muse: Towards Reproducible Long-Form Song Generation with Fine-Grained Style Control
☆119Apr 11, 2026Updated 3 months ago
alibaba / vstyle
View on GitHub
☆34Sep 15, 2025Updated 10 months ago
sii-research / OpenMOSS
View on GitHub
OpenMOSS presents a collection of our research on LLMs, supported by SII, Fudan and Mosi.
☆30Updated this week
malradhi / PACodec
View on GitHub
[ICASSP 2026]Official code for "Prosody-Guided Harmonic Attention for Phase-Coherent Neural Vocoding in the Complex Spectrum"
☆27Jan 22, 2026Updated 5 months ago
IDEA-Emdoor-Lab / DistilCodec
View on GitHub
A Neural Audio Codec (NAC) for Universal Audio
☆46May 30, 2025Updated last year
FunAudioLLM / CV3-Eval
View on GitHub
☆187Aug 25, 2025Updated 10 months ago
b04901014 / vae-gslm
View on GitHub
Official Implementation for the paper: A Variational Framework for Improving Naturalness in Generative Spoken Language Models
☆24Jun 18, 2025Updated last year
JethroWangSir / SincQDR-VAD
View on GitHub
☆26Aug 29, 2025Updated 10 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
kandinskylab / kvae-audio
View on GitHub
KVAE-Audio: a continuous full-band audio waveform autoencoder
☆99Jun 30, 2026Updated 3 weeks ago
KhanhNguyen4999 / Speech-Enhancement-CLSKD
View on GitHub
Cross-Layer Similarity Knowledge Distillation for Speech Enhancement
☆11Jun 22, 2023Updated 3 years ago
opendilab / HH-Codec
View on GitHub
[ICML 2025 Tokenization Workshop] HH-Codec: High Compression High-fidelity Discrete Neural Codec for Spoken Language Modeling
☆106Sep 28, 2025Updated 9 months ago
lmxue / NVV-SuperBench
View on GitHub
NVV-SuperBench: Beyond Words, Beyond Quality—Benchmarking Nonverbal Vocalizations in Speech Generation (Interspeech 2026 long paper)
☆18Jun 21, 2026Updated last month
wenet-e2e / west
View on GitHub
We Speech Toolkit, LLM based Speech Toolkit for Speech Understanding, Generation, and Interaction
☆206Updated this week
dieKarotte / ASAudio
View on GitHub
☆59Oct 19, 2025Updated 9 months ago
VisionChengzhuo / CoF-T2I
View on GitHub
Video models as pure visual reasoners for high-quality text-to-image generation via Chain-of-Frame reasoning.
☆39Jan 16, 2026Updated 6 months ago
smartyfh / CMF-CTF
View on GitHub
Outlier-Resilient Web Service QoS Prediction
☆10Feb 7, 2021Updated 5 years ago
griko / vanpy
View on GitHub
☆19Jul 23, 2025Updated 11 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
FreedomIntelligence / EchoX
View on GitHub
EchoX: Towards Mitigating Acoustic-Semantic Gap via Echo Training for Speech-to-Speech LLMs
☆47Sep 19, 2025Updated 10 months ago
TianyuFan0504 / awesome-spatio-temporal-graph
View on GitHub
This repository contains a list of papers on spatio-temporal graph, especially about GNNs on S-T graph.
☆18Sep 8, 2023Updated 2 years ago
SprocketLab / sparse_matrix_fine_tuning
View on GitHub
Official repository for ICML 2024 paper "MoRe Fine-Tuning with 10x Fewer Parameters"
☆22Oct 14, 2025Updated 9 months ago
tencent-ailab / MuQ
View on GitHub
Official repository of the paper "MuQ: Self-Supervised Music Representation Learning with Mel Residual Vector Quantization".
☆357Aug 4, 2025Updated 11 months ago
yakovmon / Real-Time-Audio-Visual-Speech-Enhancement
View on GitHub
☆13May 27, 2019Updated 7 years ago
kaistmm / VoiceDiT
View on GitHub
[ICASSP2025] Official code for VoiceDiT: Dual-Condition Diffusion Transformer for Environment-Aware Speech Synthesis
☆52Apr 9, 2025Updated last year
Meirtz / BabyBLUE-llm
View on GitHub
[COLING 2025] Official repo of paper: "Not Aligned" is Not "Malicious": Being Careful about Hallucinations of Large Language Models' Jail…
☆12Jul 26, 2024Updated last year