TRoboto / Gender-Recognition-by-Voice
Predict the speaker's gender from an audio file (Flask API included)
☆20Updated 2 years ago
Alternatives and similar repositories for Gender-Recognition-by-Voice
Users that are interested in Gender-Recognition-by-Voice are comparing it to the libraries listed below
Sorting:
- How to use our public wav2vec2 age and gender model☆40Updated last year
- PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supp…☆48Updated last year
- Official implementation of "Automatic Tuning of Loss Trade-offs without Hyper-parameter Search in End-to-End Zero-Shot Speech Synthesis",…☆79Updated last year
- ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations☆158Updated last year
- Speaker change detection using SincNet and an LSTM/Transformer☆51Updated 10 months ago
- 56 language, 1 model Multilingual ASR☆25Updated 3 years ago
- Phoneme alignment representation compatible with multiple forced aligners☆21Updated last year
- asr2k☆50Updated 11 months ago
- A mini, simple, and fast end-to-end automatic speech recognition toolkit.☆51Updated 2 years ago
- pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paper☆21Updated 2 years ago
- A Tiny Project For ASR model training and Deployment☆27Updated 2 years ago
- A set of audio augmentation techniques to perform noise insertion in datasets used for Automatic Speech Recognition.☆41Updated 3 years ago
- Toolbox for easy and qualitative one-shot voice conversion☆45Updated 3 years ago
- ☆140Updated last year
- ☆66Updated 8 months ago
- AdaSpeech 2: Adaptive Text to Speech with Untranscribed Data☆70Updated 3 years ago
- ☆38Updated 3 years ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆75Updated 3 years ago
- ☆56Updated 2 years ago
- A Full Text-Dependent End to End Mispronunciation Detection and Diagnosis with Easy Data Augment Techniques☆59Updated 4 years ago
- Python forced alignment☆89Updated last year
- ☆64Updated last year
- ☆71Updated last year
- PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised T…☆193Updated 2 years ago
- Implementation of DCComix TTS: An End-to-End Expressive TTS with Discrete Code Collaborated with Mixer☆74Updated last year
- This is the implementation of our Interspeech 2021 paper: Limited data emotional voice conversion leveraging text-to-speech: two-stage se…☆83Updated 2 years ago
- ☆43Updated 11 months ago
- Repository for the paper: VoiceMe: Personalized voice generation in TTS☆126Updated 3 years ago
- Barkify: an unoffical training implementation of Bark TTS by suno-ai☆129Updated last year
- ☆57Updated 10 months ago