synesthesiam / voice-recorder

Simple tkinter application for recorded voice samples with text prompts

☆18

Alternatives and similar repositories for voice-recorder

Users that are interested in voice-recorder are comparing it to the libraries listed below

Sorting:

jhasegaw / phonecodes
python code for converting among IPA, ARPABET, XSAMPA, Callhome, DISC, TIMIT, plus some lexical tones.
☆34Updated last year
jumon / whisper-punctuator
Zero-shot multimodal punctuation insertion and truecasing using Whisper
☆114Updated 2 years ago
keonlee9420 / STYLER
Official repository of STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllabl…
☆160Updated 3 years ago
hollygrimm / voice-dataset-creation
Tools to create your own voice dataset for TTS training
☆66Updated 4 years ago
zkx06111 / WSRGlow
The official implementation of the Interspeech 2021 paper WSRGlow: A Glow-based Waveform Generative Model for Audio Super-Resolution.
☆126Updated 3 years ago
rhasspy / espeak-phonemizer
Uses ctypes and libespeak-ng to transform test into IPA phonemes
☆20Updated last year
tarepan / VoiceConversionLab
Collect Voice Conversion researches
☆93Updated this week
rhasspy / gruut-ipa
Python library for manipulating pronunciations using the International Phonetic Alphabet (IPA)
☆91Updated last year
cvqluu / simple_diarizer
Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code
☆149Updated last year
spring-media / DeepForcedAligner
☆80Updated 11 months ago
CODEJIN / HiFiSinger
☆110Updated 3 years ago
xinjli / transphone
phoneme tokenizer and grapheme-to-phoneme model for 8k languages
☆160Updated last year
backspacetg / simul_whisper
Code for our INTERSPEECH paper Simul-Whisper: Attention-Guided Streaming Whisper with Truncation Detection
☆62Updated last month
yxlllc / pc-ddsp
Pitch Controllable DDSP Vocoders
☆73Updated 6 months ago
haoheliu / voicefixer_main
General Speech Restoration
☆277Updated last year
miguelvalente / whisperer
Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.
☆137Updated last year
rishikksh20 / hifigan-denoiser
HiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks
☆214Updated 4 years ago
resemble-ai / monotonic_align
Monotonic Alignment Search
☆91Updated 2 years ago
maxrmorrison / pyfoal
Python forced alignment
☆89Updated last year
anyvoiceai / Barkify
Barkify: an unoffical training implementation of Bark TTS by suno-ai
☆129Updated last year
rishikksh20 / HiFi-GAN
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
☆83Updated last year
b04901014 / UUVC
Official implementation for the paper: A Unified One-Shot Prosody and Speaker Conversion System with Self-Supervised Discrete Speech Unit…
☆78Updated 2 years ago
dhchoi99 / NANSY
☆171Updated 2 years ago
Shb742 / rnnoise_python
python wrapper for rnnoise library
☆48Updated 2 years ago
rendchevi / daisy-tts
🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition
☆15Updated last year
ebadawy / voice_conversion
☆130Updated 2 years ago
roedoejet / g2p
Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!
☆162Updated this week
RF5 / simple-speaker-embedding
A speaker embedding network in Pytorch that is very quick to set up and use for whatever purposes.
☆88Updated last month
yl4579 / PitchExtractor
Deep Neural Pitch Extractor for Voice Conversion and TTS Training
☆123Updated 2 years ago
0913ktg / SC_VALL-E
Style-Controllable Zero-Shot Text to Speech Synthesizer based on VALL-E
☆138Updated 6 months ago