synesthesiam / voice-recorder
Simple tkinter application for recorded voice samples with text prompts
☆18Updated last year
Alternatives and similar repositories for voice-recorder
Users that are interested in voice-recorder are comparing it to the libraries listed below
Sorting:
- python code for converting among IPA, ARPABET, XSAMPA, Callhome, DISC, TIMIT, plus some lexical tones.☆34Updated last year
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆114Updated 2 years ago
- Official repository of STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllabl…☆160Updated 3 years ago
- Tools to create your own voice dataset for TTS training☆66Updated 4 years ago
- The official implementation of the Interspeech 2021 paper WSRGlow: A Glow-based Waveform Generative Model for Audio Super-Resolution.☆126Updated 3 years ago
- Uses ctypes and libespeak-ng to transform test into IPA phonemes☆20Updated last year
- Collect Voice Conversion researches☆93Updated this week
- Python library for manipulating pronunciations using the International Phonetic Alphabet (IPA)☆91Updated last year
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆149Updated last year
- ☆80Updated 11 months ago
- ☆110Updated 3 years ago
- phoneme tokenizer and grapheme-to-phoneme model for 8k languages☆160Updated last year
- Code for our INTERSPEECH paper Simul-Whisper: Attention-Guided Streaming Whisper with Truncation Detection☆62Updated last month
- Pitch Controllable DDSP Vocoders☆73Updated 6 months ago
- General Speech Restoration☆277Updated last year
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆137Updated last year
- HiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks☆214Updated 4 years ago
- Monotonic Alignment Search☆91Updated 2 years ago
- Python forced alignment☆89Updated last year
- Barkify: an unoffical training implementation of Bark TTS by suno-ai☆129Updated last year
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis☆83Updated last year
- Official implementation for the paper: A Unified One-Shot Prosody and Speaker Conversion System with Self-Supervised Discrete Speech Unit…☆78Updated 2 years ago
- ☆171Updated 2 years ago
- python wrapper for rnnoise library☆48Updated 2 years ago
- 🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition☆15Updated last year
- ☆130Updated 2 years ago
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!☆162Updated this week
- A speaker embedding network in Pytorch that is very quick to set up and use for whatever purposes.☆88Updated last month
- Deep Neural Pitch Extractor for Voice Conversion and TTS Training☆123Updated 2 years ago
- Style-Controllable Zero-Shot Text to Speech Synthesizer based on VALL-E☆138Updated 6 months ago