aldragan0/voice-recognition

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/aldragan0/voice-recognition)

aldragan0 / voice-recognition

Voice-based gender, age and language recognition.

☆44

Alternatives and similar repositories for voice-recognition

Users that are interested in voice-recognition are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Anvarjon / Age-Gender-Classification
View on GitHub
Official implementation of the paper titled "Age and Gender Recognition Using a Convolutional Neural Network with a Specially Designed Mu…
☆28Mar 5, 2024Updated 2 years ago
dave-fernandes / SpeakerClassifier
View on GitHub
A random forest classifier to predict the age-group and gender of a speaker from voice measurements.
☆18Apr 30, 2019Updated 7 years ago
nhut-ngnn / Voice-Based-Age-and-Gender-Recogniton
View on GitHub
[ICTC'24] - "Voice-Based Age and Gender Recognition: A Comparative Study of LSTM, RezoNet and Hybrid CNNs-BiLSTM Architecture" by Nhut Mi…
☆10Jan 16, 2025Updated last year
IvanEvan / voiceProfile-for-gender-age-classify
View on GitHub
Two Keras models for child/adult & man/woman classify use speech in Python.
☆14Jan 19, 2019Updated 7 years ago
MathurUtkarsh / Video-Captioning-Using-LSTM-and-Keras
View on GitHub
Generating Video Caption Using LSTM
☆12May 29, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
karthikbhamidipati / multi-task-speech-classification
View on GitHub
Multi-Task Speech classification of accent and gender of an english speaker on Mozilla's common voice dataset
☆28Updated this week
yqli2420 / noisex-92
View on GitHub
☆15Sep 9, 2020Updated 5 years ago
ductuantruong / speaker_age_estimation_ssl_study
View on GitHub
[APSIPA'22] Exploring Speaker Age Estimation on Different Self-Supervised Learning Models
☆14Oct 19, 2022Updated 3 years ago
aigalaxy / voice-emotion-recognition
View on GitHub
detecting the meotions using by analysing the sound of the person unsing python
☆11Oct 7, 2019Updated 6 years ago
WeldonWangwang / py-webrtcns
View on GitHub
Python interface to the WebRTC Noise Suppression
☆18Dec 16, 2021Updated 4 years ago
singhamanraj / Gender-Voice-Recognition_Machine-Learning
View on GitHub
Supervised Machine Learning Classification Algorithms using Python and R (Logistic Regression, Decision Tree, Random Forest, SVM)
☆14May 12, 2018Updated 8 years ago
EternalDusk / LipSyncVideoGenerator
View on GitHub
Automatically generate a lip-synced avatar based off of a transcript and audio
☆15Feb 17, 2023Updated 3 years ago
PanagiotisP / svs-multiband
View on GitHub
Code for the paper "MULTI-BAND MASKING FOR WAVEFORM-BASED SINGING VOICE SEPARATION" that was accepted on EUSIPCO2022
☆15Jun 18, 2022Updated 4 years ago
dachosen1 / Common-Voice
View on GitHub
Audio Classification with machine learning
☆18Jun 8, 2026Updated last month
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
admineral / RAG-X
View on GitHub
Advanced Video Graph RAG using SAM2,CLIP,BLIP,Qwen2-VL,YOLO-World ,Neo4j, WebGPU, local LLM
☆14Nov 25, 2024Updated last year
SuperKogito / Voice-based-gender-recognition
View on GitHub
Voice based gender recognition using Mel-frequency cepstrum coefficients (MFCC) and Gaussian mixture models (GMM)
☆221Jul 6, 2023Updated 3 years ago
Aria-K-Alethia / laughter-synthesis
View on GitHub
Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" acc…
☆77Jul 16, 2023Updated 3 years ago
schufo / plla-tisvs
View on GitHub
Phoneme Level Lyrics Alignment and Text-Informed Singing Voice Separation
☆24Nov 8, 2021Updated 4 years ago
ishalyminov / babi_tools
View on GitHub
Augmentation scripts for the bAbI Dialog Tasks dataset
☆13Oct 16, 2018Updated 7 years ago
weihaosky / dro-sfm
View on GitHub
☆16May 14, 2021Updated 5 years ago
MiuLab / Lattice-Transformer-SLU
View on GitHub
Source code for ASRU 2019 paper "Adapting Pretrained Transformer to Lattices for Spoken Language Understanding"
☆10Jul 8, 2020Updated 6 years ago
Marinto-Richee / YOLOv8-and-GroundingDINO-for-Real-Time-License-Plate-Detection
View on GitHub
A project using YoloV8 to detect License Plates
☆13Sep 29, 2023Updated 2 years ago
blackbird71SR / Brain-Segmentation-and-Tumor-Detection
View on GitHub
Modified VGG16 and UNetCNN based 4D Image Segmentation (Finalist - Smart India Hackathon 2019)
☆12Aug 15, 2020Updated 5 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
jefrydco / cari-teks-video-api
View on GitHub
API service for searching text in YouTube Closed Captions
☆12Updated this week
x4nth055 / gender-recognition-by-voice
View on GitHub
Building a Deep learning model that predicts the gender of a speaker using TensorFlow 2
☆130Apr 25, 2023Updated 3 years ago
qiskrypt / qiskrypt-tutorials
View on GitHub
A collection of Jupyter Notebooks with explanations, insights, tutorials, exercises and showing how to use the Qiskrypt software suite, b…
☆12Jun 28, 2021Updated 5 years ago
simurgailab / mask-rcnn-implementation-with-custom-dataset
View on GitHub
Implementation of Mask R-CNN architecture, one of the object recognition architectures, on a custom dataset.
☆10Nov 1, 2022Updated 3 years ago
nikhilpatil99 / Smart-Traffic-Management-Using-Deep-Learning
View on GitHub
The traffic handling schemes that are in use today are fixed time allocated traffic signal which do not change on incoming traffic or fai…
☆11May 21, 2021Updated 5 years ago
silver-zepp / zeppos-easy-ble
View on GitHub
ZeppOS BLE Master: Simple interaction with home Peripherals
☆18Nov 9, 2024Updated last year
mmorise / itako_singing
View on GitHub
東北イタコ歌唱データベースの最新ラベルデータ
☆23Jul 1, 2021Updated 5 years ago
goodmike31 / pl-asr-speech-data-survey
View on GitHub
Survey of available speech datasets for Polish ASR development
☆17Jan 1, 2025Updated last year
ravirajsinh45 / implementation_of_RCNN
View on GitHub
We implement RCNN algorithm for object detection from an Images.
☆17Jul 6, 2020Updated 6 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
nii-yamagishilab / speaker_sex_attribute_privacy
View on GitHub
Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE
☆15Nov 30, 2022Updated 3 years ago
yohanes / mnist-sirekap
View on GitHub
Test/Demo SIREKAP digits recognition
☆10Feb 23, 2024Updated 2 years ago
robmsmt / SpeechLoop
View on GitHub
Many ASRs under one roof. With Benchmarking... answering the question. What is the best ASR for my dataset?
☆19Oct 5, 2022Updated 3 years ago
kreimanlab / DeepLearning-vs-HighLevelVision
View on GitHub
Code and database for Jacquot et al. CVPR 2020. Can we decode subtle human activities?
☆12Dec 22, 2020Updated 5 years ago
HuangZiliAndy / SSL_for_multitalker
View on GitHub
ADAPTING SELF-SUPERVISED MODELS TO MULTI-TALKER SPEECH RECOGNITION USING SPEAKER EMBEDDINGS
☆33Mar 16, 2023Updated 3 years ago
ben-tiki / naruto-handsign-recognition
View on GitHub
This project uses Google's Teachable Machine for real-time identification of hand gestures from the popular anime "Naruto".
☆11Oct 13, 2024Updated last year
saurjya / EnsembleSep
View on GitHub
This branch of Asteroid contains code for the vocal harmony and chamber ensemble separation related papers.
☆12Nov 7, 2024Updated last year