lourson1091/audiobertscore

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/lourson1091/audiobertscore)

lourson1091 / audiobertscore

☆15

Alternatives and similar repositories for audiobertscore

Users that are interested in audiobertscore are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ozspeech / OZSpeech
View on GitHub
[ACL 2025] OZSpeech: One-step Zero-shot Speech Synthesis with Learned-Prior-Conditioned Flow Matching
☆45Feb 9, 2025Updated last year
BUTSpeechFIT / TS_SUPERB
View on GitHub
☆16Apr 2, 2025Updated last year
hlt-mt / Speech-MASSIVE
View on GitHub
Speech-MASSIVE is a multilingual Spoken Language Understanding (SLU) dataset comprising the speech counterpart for a portion of the MASSI…
☆25Oct 8, 2025Updated 9 months ago
jhuang448 / MultilingualALT
View on GitHub
Repo of the paper "Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model""
☆15Jun 28, 2024Updated 2 years ago
rishikksh20 / MiniMax-TTS-pytorch
View on GitHub
Try to replicate the architecture of MiniMaxTTS mentioned in it's technical report
☆47Sep 2, 2025Updated 10 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Takaaki-Saeki / DiscreteSpeechMetrics
View on GitHub
Reference-aware automatic speech evaluation toolkit
☆185Dec 5, 2024Updated last year
uthree / auris_experimental_vits_dsp
View on GitHub
AI based singing voice synthesis
☆37Jun 10, 2024Updated 2 years ago
Plachtaa / ASTRAL-quantization
View on GitHub
speaker-disentangled speech linguistic content quantizer
☆26Mar 19, 2025Updated last year
omine-me / LaughterSegmentation
View on GitHub
2024 Latest laughter detection & segmentaion model. Paper: "Robust Laughter Segmentation with Automatic Diverse Data Synthesis", Interspe…
☆66Sep 1, 2024Updated last year
YuanGongND / llm_speech_emotion_challenge
View on GitHub
☆23Jun 24, 2024Updated 2 years ago
NUS-HPC-AI-Lab / MoST
View on GitHub
MoST: Mixing Speech and Text with Modality-Aware Mixture of Experts
☆33Jan 15, 2026Updated 6 months ago
tonnetonne814 / PL-Bert-VITS2
View on GitHub
VITS2 using Phoneme-Level Japanese BERT
☆14Dec 17, 2023Updated 2 years ago
alexisdmacintyre / SpeechBreathingToolbox
View on GitHub
Tools for the automatic detection of speech-related inhalation events and characterisation of the speech respiratory cycle.
☆11Feb 17, 2024Updated 2 years ago
tonnetonne814 / WhisperLive-PEFT
View on GitHub
Whisper系列のPEFTと、PEFT済のモデルを使ったストリーミング書き起こしを実装するためのリポジトリです。
☆15Oct 16, 2025Updated 9 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
tabahi / contexless-phonemes-CUPE
View on GitHub
pytorch model for contexless-phoneme prediction from speech audio
☆32Oct 30, 2025Updated 8 months ago
reppy4620 / convnext_tts
View on GitHub
Unofficial implementation of ConvNeXt-TTS powered by lightning
☆18Oct 20, 2024Updated last year
adelacvg / detail_tts
View on GitHub
All generative model in one for better TTS model
☆74Sep 8, 2024Updated last year
sarulab-speech / whisper-asr-finetune
View on GitHub
☆32Dec 4, 2022Updated 3 years ago
haoweilou / ParaStyleTTS
View on GitHub
This is the official code for ACM CIKM 2025 Paper: ParaStyleTTS: Toward Efficient and Robust Paralinguistic Style Control for Expressive …
☆59Dec 21, 2025Updated 7 months ago
mbrotos / SoundSeg
View on GitHub
Spectral Mapping of Singing Voices: U-Net-Assisted Vocal Segmentation
☆13Feb 18, 2026Updated 5 months ago
sarulab-speech / UTMOS22
View on GitHub
UT-Sarulab MOS prediction system using SSL models
☆308Apr 11, 2024Updated 2 years ago
leto19 / WhiSQA
View on GitHub
Whisper Speech Quality Assessment (WhiSQA)
☆16Apr 14, 2026Updated 3 months ago
YangXusheng-yxs / CodecFormer_5Hz
View on GitHub
☆35Oct 23, 2025Updated 9 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
lavendery / UUG
View on GitHub
☆21Sep 14, 2025Updated 10 months ago
VOICEVOX / kanalizer
View on GitHub
英単語から読みを推測するライブラリ。
☆29May 4, 2026Updated 2 months ago
FreedomIntelligence / ExpressiveSpeech
View on GitHub
☆24Updated this week
uthree / ddsp-vocoder
View on GitHub
☆12Nov 7, 2024Updated last year
qiuqiao / DDSP-HiFiGAN
View on GitHub
基于PC-DDSP和nsf-HiFiGAN的声码器
☆19Jul 17, 2023Updated 3 years ago
vara-tts / VARA-TTS
View on GitHub
Demo audio of VARA-TTS model
☆20Jun 11, 2021Updated 5 years ago
levtelyatnikov / radiomixer
View on GitHub
radiomixer
☆14Feb 16, 2022Updated 4 years ago
shengcanxu / canoSpeech
View on GitHub
text to speech
☆10Mar 19, 2024Updated 2 years ago
X-LANCE / LSCodec-Inference
View on GitHub
Inference code for Interspeech 2025 paper, "LSCodec: Low-Bitrate and Speaker-Decoupled Discrete Speech Codec"
☆36Oct 23, 2025Updated 9 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
Deep-unlearning / Finetune-Parakeet
View on GitHub
☆25Oct 22, 2025Updated 9 months ago
shang0712 / HierTTS
View on GitHub
☆47Apr 16, 2023Updated 3 years ago
ayutaz / uni-llm-voice-chat
View on GitHub
This project uses llama.cpp as an LLM server to perform inference and generate speech using Synthetic voice library
☆22Mar 5, 2024Updated 2 years ago
ICDM-UESTC / COSE
View on GitHub
The implementation of Paper: Compose Yourself: Average-Velocity Flow Matching for One-Step Speech Enhancement.
☆16Sep 23, 2025Updated 10 months ago
hi-paris / CosyVoice2-EU
View on GitHub
Europeanized CosyVoice2 for French & German
☆17Mar 30, 2026Updated 3 months ago
HappyColor / DrawSpeech_PyTorch
View on GitHub
☆25Nov 25, 2025Updated 8 months ago
unilight / jatts
View on GitHub
JATTS: A modern, research-oriented Japanese Text-to-speech Open-sourced Toolkit
☆43Mar 13, 2026Updated 4 months ago