Anvarjon / Age-Gender-Classification
Official implementation of the paper titled "Age and Gender Recognition Using a Convolutional Neural Network with a Specially Designed Multi-Attention Module through Speech Spectrograms". This implementation is for the Common Voice dataset. But it can be adjusted to any custom dataset.
☆19Updated 10 months ago
Alternatives and similar repositories for Age-Gender-Classification:
Users that are interested in Age-Gender-Classification are comparing it to the libraries listed below
- An evaluation set for large-scale trained TTS models (Coming in Sep 2024)☆12Updated 4 months ago
- CTC decoder with hotwords for ASR.☆12Updated 3 weeks ago
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Updated last year
- Speaker-aware CTC (SACTC) for multi-talker overlapped speech recognition.☆10Updated last week
- silero-vad pytorch implement☆12Updated last month
- G2pw's inference speed is accelerated by about 8-10 times. Change loop generated predictive data to only once and model loop prediction b…☆16Updated last year
- ☆19Updated 3 months ago
- (WIP)long form speech generatoins☆29Updated last month
- TechSinger: Technique Controllable Multilingual Singing Voice Synthesis via Flow Matching☆17Updated last month
- real-time speech enhance☆12Updated 11 months ago
- Open Source Speech/Text Data on AI☆18Updated 2 years ago
- noise reduction☆17Updated 6 months ago
- UMETTS: A Unified Framework for Emotional Text-to-Speech Synthesis with Multimodal Prompts☆16Updated 3 weeks ago
- A simple command line tool to calculate WER for ASR.☆14Updated 3 months ago
- text to speech☆10Updated 10 months ago
- Pytorch Models for Speech Enhancement☆16Updated last year
- ☆20Updated 5 months ago
- An open-source Kazakh Emotional Text-to-Speech Dataset☆27Updated 9 months ago
- An unofficial implementation of "UniCATS: A Unified Context-Aware Text-to-Speech Framework with Contextual VQ-Diffusion and Vocoding".☆24Updated last year
- Unofficial pytorch implementation of VISinger: Variational Inference with Adversarial Learning for End-to-end Singing Voice Synthesis (IC…☆15Updated last year
- Speech-To-Text forced-alignment Speech processing Universal PERformance Benchmark☆25Updated 6 months ago
- End-to-End SpeechSynthesis system with fastspeech2 & hifigan☆13Updated 2 years ago
- The source code for the paper CrossSinger (asru2023)☆18Updated last year
- Inference code for Audiodec-Valle-Wenetspeech4TTS☆48Updated 6 months ago
- ☆19Updated 9 months ago
- Aligner for text-to-speech☆15Updated 6 months ago
- TTS Text Analyzer☆32Updated last year
- ☆10Updated last year