TRoboto / Gender-Recognition-by-VoiceLinks
Predict the speaker's gender from an audio file (Flask API included)
☆20Updated 2 years ago
Alternatives and similar repositories for Gender-Recognition-by-Voice
Users that are interested in Gender-Recognition-by-Voice are comparing it to the libraries listed below
Sorting:
- PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supp…☆48Updated 2 years ago
- Putting flows on top of neural transducers for better TTS☆64Updated last month
- Building a Deep learning model that predicts the gender of a speaker using TensorFlow 2☆130Updated 2 years ago
- ☆55Updated 3 years ago
- Finetuning VITS Efficiently☆33Updated 2 years ago
- A mini, simple, and fast end-to-end automatic speech recognition toolkit.☆52Updated 3 years ago
- 56 language, 1 model Multilingual ASR☆24Updated 4 years ago
- Your one-stop solution for voice dataset creation☆128Updated 2 years ago
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆80Updated 2 years ago
- A set of audio augmentation techniques to perform noise insertion in datasets used for Automatic Speech Recognition.☆47Updated 4 years ago
- Speaker change detection using SincNet and an LSTM/Transformer☆56Updated 7 months ago
- ASRecognition: just an easy-to-use library for Automatic Speech Recognition.☆50Updated 2 years ago
- Pytorch implementation of Noisy Student Training for Automatic Speech Recognition and Automatic Pronunciation Error Detection problem☆98Updated 7 months ago
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆63Updated 2 years ago
- ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations☆184Updated last year
- ☆70Updated last year
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆154Updated last year
- AdaSpeech 2: Adaptive Text to Speech with Untranscribed Data☆70Updated 4 years ago
- Datasets of A Deep Convolutional Neural Network Based Virtual Elderly Companion Agent.☆37Updated 7 years ago
- ☆45Updated 3 years ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆76Updated 4 years ago
- A framework for automatic speech recognition☆51Updated 2 years ago
- Toolbox for easy and qualitative one-shot voice conversion☆46Updated 4 years ago
- ☆25Updated 2 years ago
- Official implementation of "Automatic Tuning of Loss Trade-offs without Hyper-parameter Search in End-to-End Zero-Shot Speech Synthesis",…☆80Updated 2 years ago
- SyntaSpeech: Syntax-aware Generative Adversarial Text-to-Speech; IJCAI 2022; Official code☆203Updated 3 years ago
- pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paper☆21Updated 3 years ago
- [INTERSPEECH 2022] This dataset is designed for multi-modal speaker diarization and lip-speech synchronization in the wild.☆58Updated last year
- Barkify: an unoffical training implementation of Bark TTS by suno-ai☆129Updated 2 years ago
- ☆71Updated 2 years ago