lourson1091 / audiobertscoreView external linksLinks
☆15Nov 10, 2025Updated 3 months ago
Alternatives and similar repositories for audiobertscore
Users that are interested in audiobertscore are comparing it to the libraries listed below
Sorting:
- VITS2 using Phoneme-Level Japanese BERT☆14Dec 17, 2023Updated 2 years ago
- [ACL 2025] OZSpeech: One-step Zero-shot Speech Synthesis with Learned-Prior-Conditioned Flow Matching☆45Feb 9, 2025Updated last year
- ☆15Apr 2, 2025Updated 10 months ago
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆18Oct 20, 2024Updated last year
- speaker-disentangled speech linguistic content quantizer☆24Mar 19, 2025Updated 10 months ago
- AI based singing voice synthesis☆37Jun 10, 2024Updated last year
- Official repository of the work "Low-complexity Unsupervised Audio Anomaly Detection exploiting Separable Convolutions and Angular Loss" …☆10Nov 6, 2024Updated last year
- ☆11Aug 11, 2023Updated 2 years ago
- ☆22Jun 24, 2024Updated last year
- Speech-MASSIVE is a multilingual Spoken Language Understanding (SLU) dataset comprising the speech counterpart for a portion of the MASSI…☆24Oct 8, 2025Updated 4 months ago
- Kanade is a single-layer disentangled speech tokenizer that extracts compact tokens suitable for both generative and discriminative model…☆74Feb 3, 2026Updated last week
- ☆11Nov 7, 2024Updated last year
- Onset-and-Offset-Aware Sound Event Detection☆20Feb 10, 2025Updated last year
- [ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder☆12Mar 11, 2025Updated 11 months ago
- text to speech☆10Mar 19, 2024Updated last year
- ☆10Sep 2, 2024Updated last year
- This is not remotely close to a finished product, and does not intend to nor does this claim to be working fine-tuning code for MaskGCT. …☆13Dec 4, 2024Updated last year
- Non Parallel Voice Conversion based on VITS☆24Mar 31, 2023Updated 2 years ago
- PyTorch implementation of "Source Separation by Flow Matching (FLOSS)" by Google DeepMind☆91Nov 24, 2025Updated 2 months ago
- ☆14Aug 1, 2025Updated 6 months ago
- DysfluentWFST☆17Nov 13, 2025Updated 3 months ago
- ☆23Dec 6, 2025Updated 2 months ago
- A toolkit for benchmarking on a wide variety of audio deepfake datasets.☆29Oct 9, 2025Updated 4 months ago
- ☆15Nov 11, 2024Updated last year
- Cantonese Grapheme-to-Phoneme Converter based on GitYCC/g2pW☆15Dec 10, 2024Updated last year
- [ICMR 2025] Official Repository for The Paper, Let Network Decide What to Learn: Symbolic Music Understanding Model Based on Large-scale …☆18Aug 17, 2025Updated 5 months ago
- Tools for the automatic detection of speech-related inhalation events and characterisation of the speech respiratory cycle.☆11Feb 17, 2024Updated last year
- ☆27Aug 10, 2024Updated last year
- ☆15Jun 22, 2025Updated 7 months ago
- Repo of the paper "Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model""☆14Jun 28, 2024Updated last year
- ☆11Oct 14, 2023Updated 2 years ago
- 2024 Latest laughter detection & segmentaion model. Paper: "Robust Laughter Segmentation with Automatic Diverse Data Synthesis", Interspe…☆62Sep 1, 2024Updated last year
- 日本語音声に対して音素ラベルをアラインメントするためのツールです☆36Aug 19, 2025Updated 5 months ago
- 44100Hz日本語HuBERTに対応した QuickVC: Any-to-many Voice Conversion Using Inverse Short-time Fourier Transform for Faster Conversion です。☆16May 21, 2023Updated 2 years ago
- ☆15Mar 31, 2025Updated 10 months ago
- Causal Speech Enhancement Based on a Two-Branch Nested U-Net Architecture Using Self-Supervised Speech Embeddings☆19Jun 6, 2025Updated 8 months ago
- ☆30Jan 22, 2026Updated 3 weeks ago
- Spectral Mapping of Singing Voices: U-Net-Assisted Vocal Segmentation☆13Dec 12, 2024Updated last year
- radiomixer☆14Feb 16, 2022Updated 3 years ago