TRoboto / Gender-Recognition-by-VoiceLinks
Predict the speaker's gender from an audio file (Flask API included)
☆20Updated 2 years ago
Alternatives and similar repositories for Gender-Recognition-by-Voice
Users that are interested in Gender-Recognition-by-Voice are comparing it to the libraries listed below
Sorting:
- Toolbox for easy and qualitative one-shot voice conversion☆45Updated 3 years ago
- PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supp…☆48Updated last year
- VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram☆251Updated 11 months ago
- A set of audio augmentation techniques to perform noise insertion in datasets used for Automatic Speech Recognition.☆43Updated 3 years ago
- Repository for the paper: VoiceMe: Personalized voice generation in TTS☆125Updated 3 years ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆116Updated 2 years ago
- Putting flows on top of neural transducers for better TTS☆62Updated 3 weeks ago
- ☆56Updated 2 years ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆75Updated 3 years ago
- A Massive Multilingual Multi-speaker Speech Corpus for Scaling Indian TTS☆43Updated 7 months ago
- Official implementation of "Automatic Tuning of Loss Trade-offs without Hyper-parameter Search in End-to-End Zero-Shot Speech Synthesis",…☆80Updated 2 years ago
- SyntaSpeech: Syntax-aware Generative Adversarial Text-to-Speech; IJCAI 2022; Official code☆202Updated 2 years ago
- Speaker change detection using SincNet and an LSTM/Transformer☆53Updated last month
- TriAAN-VC: Triple Adaptive Attention Normalization for Any-to-Any Voice Conversion☆146Updated last year
- ☆140Updated last year
- A mini, simple, and fast end-to-end automatic speech recognition toolkit.☆54Updated 2 years ago
- AdaSpeech 2: Adaptive Text to Speech with Untranscribed Data☆70Updated 3 years ago
- ☆71Updated 2 years ago
- ☆65Updated last year
- How to use our public wav2vec2 age and gender model☆46Updated last year
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆63Updated 2 years ago
- Byte-based multilingual transformer TTS for low-resource/few-shot language adaptation.☆88Updated 2 years ago
- Barkify: an unoffical training implementation of Bark TTS by suno-ai☆127Updated 2 years ago
- PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.☆173Updated last year
- pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paper☆21Updated 3 years ago
- An 16kHz implementation of HiFi-GAN for soft-vc.☆101Updated last year
- ☆66Updated 10 months ago
- Fine-Tune Whisper with Transformers and PEFT☆57Updated last year
- Code repository for the Cantonese In-car Audio-Visual Speech Recognition (CI-AVSR) dataset.☆39Updated last year
- ☆69Updated 2 years ago