hernanrazo/human-voice-detection

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/hernanrazo/human-voice-detection)

hernanrazo / human-voice-detection

Binary classification problem that aims to classify human voices from audio recordings. Implemented using PyTorch and Librosa.

☆37

Alternatives and similar repositories for human-voice-detection

Users that are interested in human-voice-detection are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

tarnowski-git / Audio_Spectrum_Analyzer
View on GitHub
Desktop GUI applications to show audio waveform and spectrogram which is visual representation of sound using the amplitude of the freque…
☆12Jul 21, 2023Updated 3 years ago
ghunkins / Binaural-Source-Localization-CNN
View on GitHub
A Deep Convolutional Neural Network (DCNN) designed for the task of localizing human speech to 168 location classes using binaural microp…
☆10Dec 16, 2017Updated 8 years ago
heeeyk / Transformer-DOA-Prediction
View on GitHub
A Transformer-based Prediction Method for Depth of Anesthesia During Target-controlled Infusion of Propofol and Remifentanil.
☆16Feb 17, 2025Updated last year
nfitter / BaxterFaces
View on GitHub
☆16Feb 2, 2018Updated 8 years ago
fangzheng81 / 3D-Object-Detection
View on GitHub
Papers and code related to 3D object detection
☆12May 23, 2020Updated 6 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
bingo-todd / WaveLoc
View on GitHub
End-to-End binaural sound localization
☆17Feb 27, 2020Updated 6 years ago
sashavol / Frozlunky
View on GitHub
Spelunky mod introducing a level editor, netplay, and more
☆12Nov 4, 2022Updated 3 years ago
ibliever / Cross-modal-information-fusion-for-voice-spoofing-detection
View on GitHub
This is the implementation of the paper "Physiological-Physical Feature Fusion for Automatic Voice Spoofing Detection"
☆13Jun 5, 2023Updated 3 years ago
AfifHM / Smart-CCTV-Using-Face-and-Human-Detection
View on GitHub
☆13Aug 10, 2020Updated 5 years ago
hasnainnaeem / Gunshot-Detection-in-Audio
View on GitHub
Audio classification deep learning model using TensorFlow 2.0 to detect Gunshots. 97.5% test set accuracy and 99% training set accuracy w…
☆23Feb 16, 2020Updated 6 years ago
axeber01 / wav2pos
View on GitHub
3D Sound Source Localization using Masked Autoencoders
☆21Feb 12, 2025Updated last year
FYJNEVERFOLLOWS / ResNet-STFT-SSL
View on GitHub
ResNet-STFT Model for Sound Source Localization
☆20Aug 25, 2022Updated 3 years ago
topel / bird_audio_detection_challenge
View on GitHub
DenseNets for the detection of singing birds in audio files
☆19Nov 15, 2017Updated 8 years ago
olivierlar / mirtoolbox
View on GitHub
☆17May 31, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
omarrayyann / KAN-Conv2D
View on GitHub
Drop-in convolutional Kolmogorov-Arnold Network
☆19May 23, 2024Updated 2 years ago
awsaf49 / audio_classification_models
View on GitHub
Tensorflow Audio Classification Models
☆13Jul 21, 2023Updated 3 years ago
kolmogorovArnoldFourierNetwork / KAF
View on GitHub
KAF : Kolmogorov-Arnold Fourier Networks
☆22Feb 19, 2025Updated last year
vishalshar / Audio-Classification-using-CNN-MLP
View on GitHub
Multi class audio classification using Deep Learning (MLP, CNN): The objective of this project is to build a multi class classifier to id…
☆69Jan 8, 2021Updated 5 years ago
sjaishanker / Benford-Analysis-For-Fraud-Detection
View on GitHub
Benford law helps in detecting the irregularity in a set of numbers. It can be used to detect fraud in image forensics(detecting whether …
☆24Nov 11, 2020Updated 5 years ago
Mak-3 / Car-Dirtiness-and-Damage-detection
View on GitHub
☆16Jan 15, 2023Updated 3 years ago
NathanDuran / CAMS-Dialogue-Annotation
View on GitHub
Label dialogue with Dialogue Acts and Adjacency Pairs
☆12Jun 20, 2023Updated 3 years ago
davidefiocco / faiss-on-disk-example
View on GitHub
Example of out-of-RAM k-nearest neighbors search using faiss
☆18Mar 28, 2026Updated 4 months ago
anushuk / Object-Detection-SSD
View on GitHub
Detecting Multiple objects in a video using Single Shot Multibox Detector
☆24Feb 26, 2021Updated 5 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
oppo-us-research / PoP-Net
View on GitHub
Pose over Parts Network for Multi-Person 3D Pose Estimation from a Depth Image
☆34Jun 1, 2023Updated 3 years ago
akq / Leaflet.DonutCluster
View on GitHub
Display donut statistic information instead of only a circle with marker cluster and leaflet.
☆14Apr 8, 2019Updated 7 years ago
chrepl / ds4
View on GitHub
Tools for working with DualShock 4
☆16Jan 20, 2017Updated 9 years ago
techpro-studio / MetalAudioShaders
View on GitHub
MPS like shaders for audio processing. Conv1d, Spectrogram.
☆19Apr 3, 2021Updated 5 years ago
blazerunner44 / survey
View on GitHub
Easy to use survey system in PHP
☆10Mar 7, 2021Updated 5 years ago
amnemonic / MacroSilicon
View on GitHub
MacroSilicon MS2109 research, code and information
☆19Dec 29, 2022Updated 3 years ago
SOUNDS-RESEARCH / complex_neural_source_localization
View on GitHub
Complex-valued neural networks for DOA estimation
☆31Jan 25, 2023Updated 3 years ago
maherharb / Autocomplete
View on GitHub
Next word prediction based on N-gram language model
☆11Jan 11, 2015Updated 11 years ago
amir-zeldes / rst2dep
View on GitHub
Converter for Rhetorical Structure Theory (RST) trees to dependency representation
☆17Aug 21, 2025Updated 11 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
fischJan / CiRA
View on GitHub
System behavior is often expressed by causal relations in requirements (e.g. if event 1 then event 2). Automatically extracting this embe…
☆13Oct 24, 2021Updated 4 years ago
paysure / orinoco
View on GitHub
Functional composable pipelines allowing clean separation of the business logic and its implementation
☆11Sep 6, 2025Updated 10 months ago
wasimusu / RL-Chatbot
View on GitHub
Chatbot using reinforcement learning
☆19May 2, 2019Updated 7 years ago
Jasson-Chen / Add_noise_and_rir_to_speech
View on GitHub
The purpose of this code base is to add a specified signal-to-noise ratio noise from MUSAN dataset to a pure speech signal and to generat…
☆31Sep 21, 2021Updated 4 years ago
fanolabs / 0shot-classification
View on GitHub
Zero-shot Intent Classification and Unknown Intent Detection
☆19Sep 6, 2021Updated 4 years ago
hasithsura / Environmental-Sound-Classification
View on GitHub
☆31Nov 22, 2022Updated 3 years ago
sbera7 / Dialogue-act-classification
View on GitHub
Dialogue Act classification
☆18Jan 15, 2024Updated 2 years ago