☆15Nov 10, 2025Updated 5 months ago
Alternatives and similar repositories for audiobertscore
Users that are interested in audiobertscore are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ACL 2025] OZSpeech: One-step Zero-shot Speech Synthesis with Learned-Prior-Conditioned Flow Matching☆45Feb 9, 2025Updated last year
- ☆15Apr 2, 2025Updated last year
- speaker-disentangled speech linguistic content quantizer☆25Mar 19, 2025Updated last year
- Speech-MASSIVE is a multilingual Spoken Language Understanding (SLU) dataset comprising the speech counterpart for a portion of the MASSI…☆24Oct 8, 2025Updated 6 months ago
- ☆30Apr 29, 2026Updated last week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Repo of the paper "Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model""☆15Jun 28, 2024Updated last year
- Kanade is a single-layer disentangled speech tokenizer that extracts compact tokens suitable for both generative and discriminative model…☆95Apr 3, 2026Updated last month
- Try to replicate the architecture of MiniMaxTTS mentioned in it's technical report☆47Sep 2, 2025Updated 8 months ago
- Reference-aware automatic speech evaluation toolkit☆182Dec 5, 2024Updated last year
- A toolkit for benchmarking on a wide variety of audio deepfake datasets.☆29Apr 20, 2026Updated 2 weeks ago
- AI based singing voice synthesis☆37Jun 10, 2024Updated last year
- ☆23Jun 24, 2024Updated last year
- 2024 Latest laughter detection & segmentaion model. Paper: "Robust Laughter Segmentation with Automatic Diverse Data Synthesis", Interspe…☆64Sep 1, 2024Updated last year
- A high-performance FastAPI-based server that provides OpenAI-compatible Text-to-Speech (TTS) endpoints using the Orpheus TTS https://gith…☆30Nov 15, 2025Updated 5 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Tools for the automatic detection of speech-related inhalation events and characterisation of the speech respiratory cycle.☆11Feb 17, 2024Updated 2 years ago
- VITS2 using Phoneme-Level Japanese BERT☆14Dec 17, 2023Updated 2 years ago
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆18Oct 20, 2024Updated last year
- pytorch model for contexless-phoneme prediction from speech audio☆32Oct 30, 2025Updated 6 months ago
- Text-to-Speech Benchmark☆24Apr 2, 2026Updated last month
- All generative model in one for better TTS model☆74Sep 8, 2024Updated last year
- Distillation of Self-Supervised Representation-Based Speech Quality Assessment☆44May 15, 2025Updated 11 months ago
- GhostSuite (Official Codebase for "Data Shapley in One Training Run", ICLR'25)☆35Jan 16, 2026Updated 3 months ago
- ☆32Dec 4, 2022Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- PyTorch implementation of "Source Separation by Flow Matching (FLOSS)" by Google DeepMind☆93Nov 24, 2025Updated 5 months ago
- UT-Sarulab MOS prediction system using SSL models☆298Apr 11, 2024Updated 2 years ago
- Spectral Mapping of Singing Voices: U-Net-Assisted Vocal Segmentation☆13Feb 18, 2026Updated 2 months ago
- Official PyTorch implementation of 'Rec-RIR: Monaural Blind Room Impulse Response Identification via DNN-based Reverberant Speech Reconst…☆33Dec 25, 2025Updated 4 months ago
- PyTorch implementation of Miipher-2 [2025] which is a speech restoration model by Google DeepMind☆65Sep 22, 2025Updated 7 months ago
- JATTS: A modern, research-oriented Japanese Text-to-speech Open-sourced Toolkit☆43Mar 13, 2026Updated last month
- This project uses llama.cpp as an LLM server to perform inference and generate speech using Synthetic voice library☆22Mar 5, 2024Updated 2 years ago
- ☆12Jul 6, 2023Updated 2 years ago
- ☆12Nov 7, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- 日本語音声に対して音素ラベルをアラインメントするためのツールです☆39Aug 19, 2025Updated 8 months ago
- Demo audio of VARA-TTS model☆20Jun 11, 2021Updated 4 years ago
- ☆40Jul 15, 2025Updated 9 months ago
- radiomixer☆14Feb 16, 2022Updated 4 years ago
- text to speech☆10Mar 19, 2024Updated 2 years ago
- A TTS Trained on Universal Audio.☆41Jun 6, 2025Updated 11 months ago
- ☆47Apr 16, 2023Updated 3 years ago