A deep learning model for face detection
☆15May 15, 2017Updated 8 years ago
Alternatives and similar repositories for face_detection
Users that are interested in face_detection are comparing it to the libraries listed below
Sorting:
- ☆15Apr 7, 2025Updated 11 months ago
- Implementation of "Look, Listen and Recognise:character-aware audio-visual subtitling"☆19Nov 3, 2025Updated 4 months ago
- Emotion Classification on FerPlus Dataset☆10Dec 10, 2018Updated 7 years ago
- This is the accompanying repository to the paper - Automatic Estimation of Singing Voice Musical Dynamics☆15Oct 28, 2024Updated last year
- pytorch code for sound event localization and classification☆13Aug 12, 2021Updated 4 years ago
- CANTE: Automatic transcription of flamenco singing.☆14Feb 13, 2018Updated 8 years ago
- Localize to Binauralize: Audio Spatialization from Visual Sound Source Localization (ICCV 2021)☆10Oct 11, 2021Updated 4 years ago
- ☆13Aug 13, 2023Updated 2 years ago
- ☆13Jan 14, 2025Updated last year
- The LAP Challenge aims at advancing spatial audio technologies through the personalization of HRTFs.☆16Aug 12, 2025Updated 6 months ago
- Dynamic vision-guided speaker embedding for audio-visual speaker diarization☆12Jul 5, 2022Updated 3 years ago
- The codes for 'Predicting Global Head-Related Transfer Functions From Scanned Head Geometry Using Deep Learning and Compact Representatio…☆14May 16, 2024Updated last year
- Source code for paper "Breaking Security-Critical Voice Authentication".☆13Jul 10, 2023Updated 2 years ago
- Python tool for creating average images from faces☆14Jan 13, 2017Updated 9 years ago
- SafeEar是由浙大和清华共同开发的一种深度伪声探测模型。这是我撰写的模型推理脚本。我不确定它是否正确,目前我还是初学者,如有问题请原谅我并指出,谢谢!☆15May 16, 2025Updated 9 months ago
- Experiments with noisy labels, to se the accuracy of classification under noise, effects of active learning, etc☆13Oct 1, 2017Updated 8 years ago
- A Python Library for Full Reference Binaural Fidelity Testing, Visualization & Feature Generation☆23Oct 30, 2025Updated 4 months ago
- Optimizing speaker verification and spoofing countermeasure systems together with REINFORCE☆13Mar 31, 2021Updated 4 years ago
- An implementation of Zhang and Sclaroff's Boolean Map Saliency algorithm. http://cs-people.bu.edu/jmzhang/BMS/BMS_iccv13_preprint.pdf.☆14Apr 22, 2025Updated 10 months ago
- Official PyTorch implementation of "t-EER: Parameter-Free Tandem Evaluation Metric of Countermeasures and Biometric Comparators"☆14Sep 25, 2023Updated 2 years ago
- Benchmark on interactive safety☆12Dec 4, 2019Updated 6 years ago
- ☆17Nov 22, 2022Updated 3 years ago
- ☆14May 12, 2017Updated 8 years ago
- Spoofing Speaker Verification Systems with Multi-speaker Text-to-speech Synthesis☆11Jun 21, 2022Updated 3 years ago
- A comapartive analysis of voice spoofing detection systems, based on a paper available at https://arxiv.org/abs/2210.00417.☆17Oct 24, 2022Updated 3 years ago
- This repository includes the code to reproduce our paper Partially-Connected Differentiable Architecture Search for Deepfake and Spoofing…☆18Apr 30, 2022Updated 3 years ago
- Official implementation of the paper How to Listen? Rethinking Visual Sound Localization☆18Apr 25, 2022Updated 3 years ago
- Materials for "Multimedia Deepfake Detection" Tutorial @ ICME 2024☆17Aug 26, 2024Updated last year
- ☆18Jan 10, 2024Updated 2 years ago
- Python 3.5 and Windows version of Speech Enhancement using DNN by Yong Xu and Qiuqiang Kong☆15Mar 13, 2019Updated 6 years ago
- PyTorch implementation of a self-attentive speaker embedding☆17Sep 24, 2019Updated 6 years ago
- Implementation of the GaussianFace algorithm for TU Delft IN4393 Computer Vision 2016/2017☆18Sep 21, 2018Updated 7 years ago
- Source code for EAC-Net in Theano/Pytorch/Tensorflow☆20Jan 16, 2018Updated 8 years ago
- Implemeting Improving Facial Attribute Prediction using Semantic Segmentation using Pytorch☆15Nov 27, 2018Updated 7 years ago
- Python library for detecting faces and classifying emotions in images lightweight, efficient threading and object pooling for concurrent …☆17Oct 26, 2025Updated 4 months ago
- FREECODEC: A DISENTANGLED NEURAL SPEECH CODEC WITH FEWER TOKENS☆24Sep 9, 2024Updated last year
- Audio-visual diarization pipeline used for creating VoxConverse dataset☆21Jun 6, 2025Updated 9 months ago
- PSELDNets: Pre-trained Neural Networks on Large-scale Synthetic Datasets for Sound Event Localization and Detection☆35Sep 17, 2025Updated 5 months ago
- ☆20Dec 11, 2017Updated 8 years ago