☆18Mar 17, 2025Updated 11 months ago
Alternatives and similar repositories for silentcipher
Users that are interested in silentcipher are comparing it to the libraries listed below
Sorting:
- DiFlow-TTS delivers low-latency zero-shot TTS via discrete flow matching and factorized speech tokens. A compact, open framework for fast…☆53Updated this week
- T5Voice is a lightweight PyTorch implementation of T5-based text-to-speech synthesis, supporting both streaming and non-streaming speech …☆28Nov 7, 2025Updated 3 months ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆15Mar 26, 2022Updated 3 years ago
- ☆32Jul 27, 2022Updated 3 years ago
- Simple tool for speech dataset augmentation for modeling various prosodies.☆14Jan 14, 2021Updated 5 years ago
- Official code for paper:"Speaking Clearly: A Simplified Whisper-Based Codec for Low-Bitrate Speech Coding"☆33Jan 28, 2026Updated last month
- (WIP)long form speech generatoins☆31Apr 2, 2025Updated 11 months ago
- A toolset for easy formant extraction and visualization from wav files and TTS models☆33Sep 2, 2022Updated 3 years ago
- ☆15Apr 2, 2025Updated 11 months ago
- Implementation of algorithms for refinement of direction of arrival estimators by optimization☆16Jun 2, 2021Updated 4 years ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Apr 13, 2022Updated 3 years ago
- ☆32Oct 23, 2025Updated 4 months ago
- Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"☆17Dec 10, 2020Updated 5 years ago
- ☆19Jan 8, 2025Updated last year
- ICASSP 2020 ESPnet-TTS: Merlin baseline system☆36Oct 28, 2019Updated 6 years ago
- Event Relation in Text-to-Audio (TTA) Generation☆20Feb 26, 2025Updated last year
- ☆16Dec 31, 2021Updated 4 years ago
- ☆15May 8, 2021Updated 4 years ago
- Sylber: Syllabic Embedding Representation of Speech from Raw Audio☆73Mar 17, 2025Updated 11 months ago
- A "Crowd-Built" continuously growing speech dataset with transcripts. The dataset contains multiple languages and is intended for anyone …☆43Aug 3, 2022Updated 3 years ago
- Calculates the Word Error Rate between two text files☆20Nov 10, 2022Updated 3 years ago
- A peer to peer machine intelligence benchmark☆27Mar 24, 2023Updated 2 years ago
- Implementation of the DIVA model of speech acquisition and production using PyTorch☆22Jan 18, 2023Updated 3 years ago
- Demo audio of VARA-TTS model☆20Jun 11, 2021Updated 4 years ago
- Open Source Speech/Text Data on AI☆19Sep 13, 2022Updated 3 years ago
- PyTorch implementation of NVIDIA WaveGlow with constant memory cost.☆36Jan 28, 2023Updated 3 years ago
- RawNet: Fast End-to-End Neural Vocoder☆42May 29, 2019Updated 6 years ago
- End-to-end Text-to-Speech with Generative Adversarial Networks☆20Feb 6, 2021Updated 5 years ago
- ☆19Mar 22, 2024Updated last year
- ☆17Aug 27, 2025Updated 6 months ago
- ESPnet-TTS Audio Sample HP☆21Oct 25, 2019Updated 6 years ago
- This is the official implementation for εar-VAE model including inference and evaluation parts, more details coming soon...☆56Feb 13, 2026Updated 2 weeks ago
- FCTalker: Fine and Coarse Grained Context Modeling for Expressive Conversational Speech Synthesis (Accepted by ISCSLP'2024)☆26Feb 22, 2024Updated 2 years ago
- ProsodyLM: Uncovering the Emerging Prosody Processing Capabilities in Speech Language Models☆34Nov 18, 2025Updated 3 months ago
- Supervoice diffusion enhance☆28Jul 15, 2024Updated last year
- A Neural Audio Codec (NAC) for Universal Audio☆44May 30, 2025Updated 9 months ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- Temporary anonymous version☆22Mar 20, 2024Updated last year
- Repository for the ISMIR 2024 Paper "STONE: Self-supervised Tonality Estimator".☆28Oct 24, 2025Updated 4 months ago