xjuspeech/YOLOPitch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/xjuspeech/YOLOPitch)

xjuspeech / YOLOPitch

☆10

Alternatives and similar repositories for YOLOPitch

Users that are interested in YOLOPitch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

sony / bigvsan_eval
View on GitHub
Evaluation tool used in the BigVSAN paper
☆14Mar 22, 2024Updated 2 years ago
malradhi / PACodec
View on GitHub
[ICASSP 2026]Official code for "Prosody-Guided Harmonic Attention for Phase-Coherent Neural Vocoding in the Complex Spectrum"
☆27Jan 22, 2026Updated 5 months ago
JusperLee / Gull-Codec-Training
View on GitHub
☆12Mar 11, 2025Updated last year
hs-oh-prml / ComVo
View on GitHub
[ICLR 2026] Official implementation of Toward Complex-Valued Neural Networks for Waveform Generation
☆20Apr 10, 2026Updated 3 months ago
Woo-jin-Chung / MF-PAM_mfpam_pitch_estimation_pytorch
View on GitHub
☆16Sep 17, 2025Updated 10 months ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
WX-Wei / HarmoF0
View on GitHub
☆108Aug 23, 2024Updated last year
alobashev / mkl-vc
View on GitHub
[Interspeech 2025] Official implementation of "Training-Free Voice Conversion with Factorized Optimal Transport"
☆45Sep 24, 2025Updated 9 months ago
Respaired / RiFornet_Vocoder
View on GitHub
a Neural Vocoder supporting Ring Attention, Conformer and NSF.
☆25Aug 1, 2025Updated 11 months ago
meaningTeam / tidy-tunes
View on GitHub
Tidy Tunes is an easy-to-use pipeline for mining high-quality audio data for speech generation models. To do so, it chains multiple open …
☆23May 19, 2026Updated 2 months ago
mushanshanshan / ESLTTS
View on GitHub
ESLTTS dataset
☆16Feb 6, 2025Updated last year
apple / ml-acn-embed
View on GitHub
Acoustic Neighbor Embeddings
☆33Jul 13, 2025Updated last year
weAreMusicAI / dmx-diffusion
View on GitHub
☆15Oct 13, 2025Updated 9 months ago
ZhaoZeyu1995 / BenNevis
View on GitHub
A Diffrentiable WFST-based End-to-End Automatic Speech Recognition toollkit with flexible topology support
☆12Feb 15, 2026Updated 5 months ago
ETH-DISCO / discoder
View on GitHub
Official repo for DisCoder: High-Fidelity Music Vocoder using Neural Audio Codecs presented at ICASSP 2025
☆42Feb 24, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
lonzi / mrflow_dpo
View on GitHub
☆22Jan 3, 2026Updated 6 months ago
ETH-DISCO / audio-atlas
View on GitHub
☆15Feb 6, 2026Updated 5 months ago
seongho608 / RingFormer
View on GitHub
☆52Jun 24, 2025Updated last year
vtuber-plan / hifi-gan
View on GitHub
An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.
☆32Apr 10, 2023Updated 3 years ago
liuhuang31 / HiFTNet-sr
View on GitHub
HiFTNet wav/audio super-resolution 16/24 kHz to 48 kHz
☆24Jan 2, 2024Updated 2 years ago
ASLP-lab / FlashTTS
View on GitHub
Fast Streaming TTS with MTP Acceleration and X-pred Mean Flow Distillation
☆63Jun 16, 2026Updated last month
felixperfler / Stable-Hybrid-Auditory-Filterbanks
View on GitHub
[Interspeech 2024] Hold Me Tight: Stable Encoder-Decoder Design for Speech Enhancement
☆43Jul 25, 2025Updated 11 months ago
JazminVidal / gop-ft
View on GitHub
Transfer learning approach to pronunciation scoring
☆12Jan 17, 2024Updated 2 years ago
Louis0324 / DDSP-Articulatory-Vocoder
View on GitHub
☆29Sep 5, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
crlandsc / torch-l1-snr
View on GitHub
Variations of L1 SNR Loss function for training audio source separation machine learning models
☆45May 1, 2026Updated 2 months ago
woongzip1 / UniverSR
View on GitHub
Official implemtation of UniverSR (ICASSP 2026)
☆59Apr 9, 2026Updated 3 months ago
SarthakYadav / axlstm-official
View on GitHub
Official repository for the paper "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs"
☆21Sep 7, 2025Updated 10 months ago
yqzhishen / onnxcrepe
View on GitHub
ONNX deployment of the CREPE pitch tracker
☆27Oct 27, 2022Updated 3 years ago
WingSingFung / TISDiSS
View on GitHub
Official implementation of TISDiSS, a scalable framework for discriminative source separation.
☆16Oct 19, 2025Updated 9 months ago
chomeyama / HN-UnifiedSourceFilterGAN
View on GitHub
☆88Nov 1, 2022Updated 3 years ago
merlresearch / reverberation-as-supervision
View on GitHub
Enhanced Reverberation As Supervision (ERAS) for unsupervised reverberant speech separation
☆15Aug 1, 2024Updated last year
Auroraaa86 / LCS-CTC
View on GitHub
For IEEE ASRU(2025)
☆15Jun 21, 2025Updated last year
YangXusheng-yxs / CodecFormer_5Hz
View on GitHub
☆35Oct 23, 2025Updated 8 months ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
shreybatra / Speech-To-Code
View on GitHub
Speech to Code service, built for Microsoft AI Hackathon.
☆13Dec 8, 2022Updated 3 years ago
yoongi43 / VRVQ
View on GitHub
Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"
☆11Apr 10, 2025Updated last year
apple / ml-omni-router-moe-asr
View on GitHub
☆18Oct 24, 2025Updated 8 months ago
thelinhbkhn2014 / Text2PhonemeSequence
View on GitHub
☆53Aug 28, 2024Updated last year
tencent-ailab / MuCodec
View on GitHub
☆168Nov 22, 2024Updated last year
ZhangXinWhut / SimWhisper-Codec
View on GitHub
Official code for paper:"Speaking Clearly: A Simplified Whisper-Based Codec for Low-Bitrate Speech Coding"
☆37Jan 28, 2026Updated 5 months ago
SonyCSLParis / codicodec
View on GitHub
Encode and decode audio samples to/from continuous and discrete compressed representations!
☆121Nov 25, 2025Updated 7 months ago