arief25ramadhan / sound-source-localizationView external linksLinks
Four neural network architectures to classify sound source direction
☆11Oct 3, 2020Updated 5 years ago
Alternatives and similar repositories for sound-source-localization
Users that are interested in sound-source-localization are comparing it to the libraries listed below
Sorting:
- A Deep Convolutional Neural Network (DCNN) designed for the task of localizing human speech to 168 location classes using binaural microp…☆10Dec 16, 2017Updated 8 years ago
- Files for the paper: "Sound Source Localization using Deep Residual Learning"☆24Nov 13, 2017Updated 8 years ago
- Quaternion Neural Networks for 3D Sound Source Localization in Reverberant Environments.☆19Nov 21, 2022Updated 3 years ago
- Pytorch implementation of BiFSMN, IJCAI 2022☆22Feb 10, 2023Updated 3 years ago
- PyTorch implementation of LiMuSE☆32Oct 11, 2022Updated 3 years ago
- Audio activity detector based on per-channel energy normalization (PCEN)☆29Nov 16, 2018Updated 7 years ago
- Official Pytorch Implementation for Continual Learning For On-Device Environmental Sound Classification☆14Jul 19, 2022Updated 3 years ago
- keras-regression-cnns☆11Dec 23, 2019Updated 6 years ago
- This is a simple python code of spectral subtraction.☆42Apr 21, 2019Updated 6 years ago
- SpeedVision is an AI-powered tool that detects and calculates vehicle speed from video footage using YOLO-based object detection and fram…☆10Sep 22, 2024Updated last year
- PyTorch implementation of "Nextformer: A ConvNeXt Augmented Conformer For End-To-End Speech Recognition"☆11Dec 15, 2022Updated 3 years ago
- Python Phonetic Tools and Distance Metrics☆13Apr 21, 2018Updated 7 years ago
- ☆10Sep 19, 2018Updated 7 years ago
- This repository contains the video files (download links) and corresponding annotations used in the paper "Long-Term Face Tracking for Cr…☆14Dec 18, 2020Updated 5 years ago
- Neural Lexicon Reader: Reduce Pronunciation Errors in End-to-end TTS by Leveraging External Textual Knowledge☆21Jul 25, 2022Updated 3 years ago
- ☆12May 22, 2022Updated 3 years ago
- Tensorflow implementation of the Differentiable Neural Computer☆12Feb 15, 2019Updated 7 years ago
- Compute the most likely permutation of a lattice given an LM☆10Jan 3, 2013Updated 13 years ago
- A collection of basic text processing modules focused on Gujarati☆10Oct 24, 2017Updated 8 years ago
- A custom frequency encoder for the HTM, an AI algorithm by @numenta.☆11Mar 17, 2017Updated 8 years ago
- A Transformer-based Prediction Method for Depth of Anesthesia During Target-controlled Infusion of Propofol and Remifentanil.☆15Feb 17, 2025Updated 11 months ago
- Analyzing Conditional Adversarial Networks to solve image recovery problems like shadow recovery, denoising and deblurring - CVIP 2019☆10Jun 9, 2020Updated 5 years ago
- Exploratory notebook . Techniques used: FFT, ARIMA, GARCH, Monte Carlo Simulations, fbprophet, LSTM, WaveNet.☆11Jul 11, 2022Updated 3 years ago
- MMM 2021: Crossed-Time Delay Neural Network for Speaker Recognition☆11Dec 4, 2021Updated 4 years ago
- Most Complete Pytorch Imeplementation "GENERALIZED END-TO-END LOSS FOR SPEAKER VERIFICATION"☆10Mar 11, 2020Updated 5 years ago
- VAD + resampling | High resolution spectrogram☆14Nov 29, 2022Updated 3 years ago
- Asymmetric Multi-Task Learning code, If you want to use it, please let me know and cite AMTL paper☆11Aug 3, 2016Updated 9 years ago
- Source code for "BLOOM-Net: Blockwise Optimization for Masking Networks Toward Scalable and Efficient Speech Enhancement"☆14Feb 13, 2022Updated 4 years ago
- ☆11Dec 31, 2019Updated 6 years ago
- Score Normalization for NIST 2019 Speaker Recognition Evaluation☆10Nov 8, 2019Updated 6 years ago
- Data generators in Python☆14Jun 10, 2019Updated 6 years ago
- Tsinghua University SPMI Lab array processing toolkit☆18Nov 23, 2016Updated 9 years ago
- Awesome Quantization Paper lists with Codes☆10Feb 24, 2021Updated 4 years ago
- Text-to-Speech Synthesis by Generating Spectrograms using Generative Adversarial Network☆10Dec 12, 2018Updated 7 years ago
- Dynamic vision-guided speaker embedding for audio-visual speaker diarization☆12Jul 5, 2022Updated 3 years ago
- ☆10Mar 21, 2018Updated 7 years ago
- A python implementation of “SRP-DNN: Learning Direct-Path Phase Difference for Multiple Moving Sound Source Localization” [ICASSP 2022]☆59Sep 28, 2024Updated last year
- py-webrtcvad wrapper for trimming speech clips☆48Jul 3, 2022Updated 3 years ago
- ☆14May 9, 2022Updated 3 years ago