AudioFans/audidata

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/AudioFans/audidata)

AudioFans / audidata

☆21

Alternatives and similar repositories for audidata

Users that are interested in audidata are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

qiuqiangkong / mini_music_tagging
View on GitHub
☆13Jul 14, 2024Updated 2 years ago
qiuqiangkong / mini_llm
View on GitHub
☆29Jul 4, 2025Updated last year
qiuqiangkong / audioflow
View on GitHub
☆128Updated this week
qiuqiangkong / music_source_separation
View on GitHub
☆60Jun 15, 2026Updated last month
qiuqiangkong / music_llm
View on GitHub
☆56Jul 13, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
IsaacYQH / WildFX
View on GitHub
Official implementation of WildFX Dataset Generating pipeline.
☆20Oct 21, 2025Updated 9 months ago
qiuqiangkong / materials_for_students
View on GitHub
☆16Aug 10, 2025Updated 11 months ago
yongyizang / music-source-restoration
View on GitHub
Official Repository for "Music Source Restoration"
☆31Jun 1, 2025Updated last year
Xia-aaa / L3former
View on GitHub
☆14Jun 26, 2025Updated last year
iver56 / loudness
View on GitHub
The world's fastest Python package for calculating integrated loudness (LUFS) from audio data as NumPy arrays
☆31Dec 26, 2025Updated 6 months ago
Tayjsl97 / RL-Chord
View on GitHub
This is the official implementation of RL-Chord (TNNLS).
☆13Jan 2, 2024Updated 2 years ago
inclusionAI / AudioMCQ
View on GitHub
[ICLR 2026] AudioMCQ: A 571k audio multiple-choice question dataset for post-training Large Audio Language Models with dual CoT annotatio…
☆51Apr 21, 2026Updated 3 months ago
Eps-Acoustic-Revolution-Lab / EAR_HEAR
View on GitHub
☆15Jan 9, 2026Updated 6 months ago
JusperLee / Look2hear
View on GitHub
A toolkit for researchers in the multimodal sound separation.
☆16Oct 20, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Naozumi520 / g2pW-Cantonese
View on GitHub
Cantonese Grapheme-to-Phoneme Converter based on GitYCC/g2pW
☆15Dec 10, 2024Updated last year
zeyuxie29 / SemanticVocoder
View on GitHub
☆28Apr 6, 2026Updated 3 months ago
zhenye234 / Talker-T2AV
View on GitHub
ACM MM 2026 Talker-T2AV Joint Talking Audio-Video Generation with Autoregressive Diffusion Modeling
☆77May 24, 2026Updated last month
RayYuki / CodecBench
View on GitHub
☆24Nov 16, 2025Updated 8 months ago
xiquan-li / FineLAP
View on GitHub
[ACL 2026 Main] FineLAP: Taming Heterogeneous Supervision for Fine-grained Language-Audio Pre-training
☆36Apr 20, 2026Updated 3 months ago
JusperLee / Gull-Codec-Training
View on GitHub
☆12Mar 11, 2025Updated last year
tencent-ailab / MuCodec
View on GitHub
☆169Nov 22, 2024Updated last year
MorenoLaQuatra / ARCH
View on GitHub
ARCH: Audio Representations benCHmark
☆57Aug 26, 2024Updated last year
facebookresearch / learning-audio-visual-dereverberation
View on GitHub
Code for paper Learning Audio-Visual Dereverberation
☆32Aug 10, 2022Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
iphysresearch / CQT_toolbox_python
View on GitHub
Constant-Q Transform Toolbox for Python/MATLAB
☆39Dec 21, 2020Updated 5 years ago
yukara-ikemiya / friendly-stable-audio-tools
View on GitHub
Refactored / updated version of `stable-audio-tools` which is an open-source code for audio/music generative models originally by Stabili…
☆220Jul 25, 2024Updated last year
NIRALUser / DTIPlayground
View on GitHub
An integrated framework for DWI Image QC and processing
☆13Mar 9, 2026Updated 4 months ago
qiuqiangkong / audio_understanding
View on GitHub
☆131Feb 6, 2025Updated last year
Sreyan88 / ReCLAP
View on GitHub
☆33Dec 23, 2025Updated 7 months ago
wuzhiyue111 / Codec-Evaluation
View on GitHub
☆50Apr 5, 2026Updated 3 months ago
facebookresearch / rlr-audio-propagation
View on GitHub
Audio propagation engine - Meta Reality Labs Research.
☆24Nov 1, 2022Updated 3 years ago
zengchang233 / xiaoicesing2
View on GitHub
The source code for the paper XiaoiceSing2 (interspeech2023)
☆49Jan 15, 2024Updated 2 years ago
jisang93 / VISinger
View on GitHub
Unofficial pytorch implementation of VISinger: Variational Inference with Adversarial Learning for End-to-end Singing Voice Synthesis (IC…
☆20May 12, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
hmartelb / NSynth-MIDI-Renderer
View on GitHub
Sample based concatenative synthesizer for the NSynth dataset. Render any MIDI (.mid) sequence with the notes of NSynth.
☆12Oct 4, 2023Updated 2 years ago
Grace9994 / CoMoSVC
View on GitHub
CoMoSVC: One-Step Consistency Model Based Singing Voice Conversion & Singing Voice Clone
☆148Mar 23, 2024Updated 2 years ago
zhenye234 / CoMoSpeech
View on GitHub
ACM MM 2023 CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Model
☆214Apr 26, 2024Updated 2 years ago
yzGuu830 / efficient-speech-codec
View on GitHub
[EMNLP 2024] ESC: Efficient Speech Coding with Cross-Scale Residual Vector Quantized Transformers
☆126Mar 20, 2025Updated last year
slp-rl / SpokenStoryCloze
View on GitHub
A spoken version of the textual story cloze benchmark
☆22Aug 6, 2023Updated 2 years ago
arxrean / LipRead-seq2seq
View on GitHub
An unofficial (PyTorch) implementation for the paper Deep Lip Reading: A comparison of models and an online application.
☆10May 13, 2020Updated 6 years ago
spkgyk / TDFNet
View on GitHub
Official code release for "TDFNet: An Efficient Audio-Visual Speech Separation Model with Top-down Fusion", accepted ICIST 2023
☆14Mar 17, 2024Updated 2 years ago