moonshine-ai / moonshine-v2View external linksLinks
β26Feb 10, 2026Updated last week
Alternatives and similar repositories for moonshine-v2
Users that are interested in moonshine-v2 are comparing it to the libraries listed below
Sorting:
- πΉ pyannote + π notebook = pyannotebookβ26Jun 12, 2023Updated 2 years ago
- eSNN - Learning similarity measure from dataβ12Nov 28, 2019Updated 6 years ago
- This is not remotely close to a finished product, and does not intend to nor does this claim to be working fine-tuning code for MaskGCT. β¦β13Dec 4, 2024Updated last year
- Testing sets for semanticVADβ20Feb 18, 2025Updated 11 months ago
- Llama-Mimi is a speech language model that uses a unified tokenizer (Mimi) and a single Transformer decoder (Llama) to jointly model sequβ¦β28Sep 20, 2025Updated 4 months ago
- Official code for paper:"Speaking Clearly: A Simplified Whisper-Based Codec for Low-Bitrate Speech Coding"β33Jan 28, 2026Updated 2 weeks ago
- β17Apr 28, 2021Updated 4 years ago
- β14Jun 12, 2015Updated 10 years ago
- LoRA-based phoneme/prosody control for LLM-based TTS with no G2P - Lightweight adapter for edit and control the target language's phonemeβ¦β23Aug 14, 2025Updated 6 months ago
- Conformer block with Rotary Position Embedding, modified from lucidrains' implementβ16Sep 13, 2024Updated last year
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speechβ¦β17Mar 6, 2023Updated 2 years ago
- This repository contains the training code from paper "SpidR Learning Fast and Stable Linguistic Units for Spoken Language Models Withoutβ¦β47Feb 4, 2026Updated last week
- WarpRNNT loss ported in Numba CPU/CUDA for Pytorchβ17Mar 11, 2022Updated 3 years ago
- β20Mar 7, 2025Updated 11 months ago
- β17Apr 14, 2023Updated 2 years ago
- This repository includes training, inference, evaluation, and utility scripts developed for fine-tuning the Whisper medium.en model on Aiβ¦β25Oct 9, 2024Updated last year
- speaker-disentangled speech linguistic content quantizerβ24Mar 19, 2025Updated 10 months ago
- Toolbox for Evaluation of AEC/AES Systemsβ32Jun 9, 2025Updated 8 months ago
- TriNet: stabilizing self-supervised learning from complete or slow collapse on ASR.β26Jun 1, 2023Updated 2 years ago
- Temporary anonymous versionβ22Mar 20, 2024Updated last year
- Finally, some decent sample sentencesβ23Dec 3, 2023Updated 2 years ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into oneβ26Aug 5, 2024Updated last year
- β33Nov 27, 2021Updated 4 years ago
- β24Sep 20, 2024Updated last year
- Quartznet implementation on pytorch [https://arxiv.org/abs/1910.10261]β26Jul 16, 2021Updated 4 years ago
- β25Sep 19, 2025Updated 4 months ago
- β37Sep 21, 2025Updated 4 months ago
- 24-hour Automatic Speech Recognitionβ27Jun 4, 2021Updated 4 years ago
- TTS Text Analyzerβ32Jul 20, 2023Updated 2 years ago
- SpeechGateway - A reverse proxy server that enhances speech synthesis with essential, extensible features. π¦π¬β31Feb 8, 2026Updated last week
- Da - ECHO - RetrievAl - daTasEtβ34Jul 7, 2024Updated last year
- Zero-Shot Foreign Accent Conversion without a Native Referenceβ36May 1, 2024Updated last year
- Compute useful transcriptions metrics (CER, WER, SER, ...)β27Nov 20, 2014Updated 11 years ago
- β32Jul 27, 2022Updated 3 years ago
- A Python package of the dynamic compressive gammachirp filterbank (dcGC-FB)β31May 14, 2024Updated last year
- Token-Level Supervised Contrastive Learning for Punctuation Restorationβ29Sep 8, 2021Updated 4 years ago
- My vocoder experimentsβ31Jul 26, 2025Updated 6 months ago
- This repository contains the baseline system for CHiME-8 MMCSG challenge focusing on transcribing both sides of a conversation where one β¦β40Mar 13, 2024Updated last year
- Soprano-Factory: Train your own 2000x realtime text-to-speech modelβ206Jan 13, 2026Updated last month