♂️♀️ Detect a person's gender from a voice file (90.7% +/- 1.3% accuracy).
☆91Jun 17, 2024Updated last year
Alternatives and similar repositories for voice_gender_detection
Users that are interested in voice_gender_detection are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 🗣️ A book and repo to get you started programming voice computing applications in Python (10 chapters and 200+ scripts).☆388Dec 8, 2022Updated 3 years ago
- 📊 Easily apply audio-related machine learning models trained on the AudioSet dataset (527+ models/classes).☆30Jun 17, 2024Updated last year
- Voice Gender Recognition/ Detection/ Identification using Deep Neural Networks and Machine Learning.☆22Sep 2, 2020Updated 5 years ago
- Singing voice detection☆15Aug 28, 2018Updated 7 years ago
- 👀 An all-purpose eye tracking web application and API for Alzheimer's disease research (3 tasks, <3 mins). 1st place in the 2021 CNT hac…☆13Jun 17, 2021Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Raw waveform adaptation with SincNet☆12Mar 19, 2024Updated 2 years ago
- This python code performs an efficient speech reverberation starting from a dataset of close-talking speech signals and a collection of a…☆97May 30, 2020Updated 5 years ago
- Mainly on text documents. Implemented a Mini Search Engine using different algorithms and then summaried documents using lexrank.☆11Jan 19, 2018Updated 8 years ago
- This repo contains the code to reproduce the paper: "Enriched Music Representations with Multiple Cross-modal Contrastive Learning"☆15Jun 22, 2023Updated 2 years ago
- ☆13Dec 7, 2022Updated 3 years ago
- Emotion Recognition ToolKit (ERTK): tools for emotion recognition. Dataset processing, feature extraction, experiments,☆56Oct 19, 2025Updated 6 months ago
- ☆15Sep 6, 2021Updated 4 years ago
- 🐍🐳🐘 A python command line interface for DigitalOcean postgres clusters (5+ integrations).☆13Nov 7, 2022Updated 3 years ago
- A speaker gender classifier. MFC feature engineering and a pre-trained ResNet-50. GradCAM interpretation.☆27Nov 18, 2021Updated 4 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Contains code for Deep Self Supervised Heirarchical Clustering for Speaker Diarization☆17Dec 16, 2021Updated 4 years ago
- wake word spotting with kaldi☆19Dec 3, 2020Updated 5 years ago
- Reproduction of the paper SFSRNet: Super-resolution for single-channel Audio Source Separation by me (@arda-num) and @dritx16. Navigate P…☆11Jul 7, 2022Updated 3 years ago
- 🦁 Nala is an agile open-source voice assistant framework (20+ actions).☆36Aug 8, 2023Updated 2 years ago
- 🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).☆2,155Jun 6, 2024Updated last year
- A implementation of Power Normalized Cepstral Coefficients: PNCC☆54Aug 11, 2019Updated 6 years ago
- Building a Deep learning model that predicts the gender of a speaker using TensorFlow 2☆129Apr 25, 2023Updated 2 years ago
- 🔊 extract runescape classic sounds from cache to wav (and vice versa)☆13Aug 2, 2022Updated 3 years ago
- Voice based gender recognition using Mel-frequency cepstrum coefficients (MFCC) and Gaussian mixture models (GMM)☆221Jul 6, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆53Jan 15, 2021Updated 5 years ago
- One-shot TTS with Improved Unseen Speaker and Style Transfer☆37Mar 2, 2022Updated 4 years ago
- GE2E Speaker Encoder - Generalized End-To-End Loss for Speaker Verification☆14May 17, 2020Updated 5 years ago
- Machine learning experiment to perform gender classification from raw audio.☆23Sep 1, 2018Updated 7 years ago
- Model for recasing and repunctuating ASR transcripts☆142Apr 10, 2024Updated 2 years ago
- multi-channel target speech extraction with channel decorrelation and target speaker adaptation☆27Feb 19, 2021Updated 5 years ago
- RASTA-PLP and MFCC tool based rasta-mat☆33Jul 6, 2022Updated 3 years ago
- Beam-guided TasNet☆57Sep 8, 2022Updated 3 years ago
- Perform the forced decoding with target transcription☆11Sep 12, 2018Updated 7 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Articulatory features estimation using Listen Attend and Spell architecture.☆33Apr 24, 2020Updated 5 years ago
- Exploring Bark, the Open-Source Text-to-Audio Generative Model☆15Oct 10, 2023Updated 2 years ago
- lyrics-to-audio-alignement system. Initially done using HTK for rapid prototyping☆14Mar 14, 2018Updated 8 years ago
- ☆19Jul 12, 2020Updated 5 years ago
- py-webrtcvad wrapper for trimming speech clips☆48Jul 3, 2022Updated 3 years ago
- Universal Adversarial Audio Perturbations☆17May 29, 2020Updated 5 years ago
- MINT, Multiplier-less INTeger Quantization for Energy Efficient Spiking Neural Networks, ASP-DAC 2024, Nominated for Best Paper Award☆16Apr 12, 2024Updated 2 years ago