CaA23187 / VAD-based-on-LSTMView external linksLinks
A LSTM for voice activity detection. In fact, this is a homework which I didn't expected.
☆13Dec 3, 2020Updated 5 years ago
Alternatives and similar repositories for VAD-based-on-LSTM
Users that are interested in VAD-based-on-LSTM are comparing it to the libraries listed below
Sorting:
- Pytorch implementation of "spectro-temporal attention-based voice activity detection"☆13Jun 4, 2024Updated last year
- Pytorch version of Voice Activity Detection (VAD) based on Deep Learning (https://github.com/filippogiruzzi)☆27Mar 20, 2021Updated 4 years ago
- Simple DNN based Voice Activity Detection (VAD) using Pytorch☆42Feb 8, 2020Updated 6 years ago
- ConvLSTM-AE_VAD_ICME2017 (code reimplementation)☆21Oct 10, 2020Updated 5 years ago
- ☆27Jul 9, 2022Updated 3 years ago
- ERB representation of an audio file implemented in Python☆27Oct 21, 2018Updated 7 years ago
- microphone array speech generator (MASG) in room acoustic☆39Jan 2, 2020Updated 6 years ago
- Code repository for the paper Direction of Arrival Estimation of Sound Sources Using Icosahedral CNNs☆45May 19, 2022Updated 3 years ago
- acoustic interference (echo) cancellation project in summer internship☆88Aug 31, 2018Updated 7 years ago
- The implementation codes of paper: Multimodal Sentiment Analysis with Mutual Information-based Disentangled Representation Learning☆18May 8, 2025Updated 9 months ago
- 在Android上运行人脸表情识别的tflite模型☆12Apr 7, 2021Updated 4 years ago
- ☆12Jun 17, 2019Updated 6 years ago
- ☆45Dec 5, 2019Updated 6 years ago
- 语音增强领域的相关数据仿真工具和方法汇总--持续更新☆45Jul 11, 2024Updated last year
- 一个小小的云fitting计算器,计算方法来自微博@摸发虱痒☆10Apr 10, 2022Updated 3 years ago
- The implementation of TaylorBeamformer, which is in submission to Interspeech2022☆48Jun 10, 2022Updated 3 years ago
- ☆17Jun 24, 2025Updated 7 months ago
- Translating Synthetic RIRs to Real RIRs☆45Sep 15, 2023Updated 2 years ago
- 2021数字中国创新大赛 早高峰共享单车潮汐点的群智优化第二名☆12Apr 26, 2021Updated 4 years ago
- ☆13Jan 12, 2023Updated 3 years ago
- ☆11Jun 15, 2022Updated 3 years ago
- ☆13Nov 16, 2020Updated 5 years ago
- [ICLR 2020] Contrastive Representation Distillation (CRD), and benchmark of recent knowledge distillation methods☆10Jan 12, 2020Updated 6 years ago
- Binaural audio reproduction through loudspeakers. Also known as crosstalk cancellation.☆10Sep 12, 2024Updated last year
- Code for calculate DNS_MOS.☆43Dec 18, 2022Updated 3 years ago
- A python implementation of “SRP-DNN: Learning Direct-Path Phase Difference for Multiple Moving Sound Source Localization” [ICASSP 2022]☆59Sep 28, 2024Updated last year
- Empirical Evaluation of Speaker Adaptation on DNN based Acoustic Model☆13Nov 25, 2019Updated 6 years ago
- a lightweight network for monaural speech enhancement☆56Oct 12, 2023Updated 2 years ago
- A packaged convolutional voice activity detector for noisy environments.☆14Jun 15, 2019Updated 6 years ago
- Codebase of the submitted work in ICASSP 2023☆14Nov 30, 2022Updated 3 years ago
- ☆13Jan 30, 2021Updated 5 years ago
- Open source data for data visualization enthusiasts.☆22Dec 20, 2021Updated 4 years ago
- 中国移动消费者人群画像--信用智能评分top10☆13Jun 18, 2019Updated 6 years ago
- HRTF data preparation for machine learning by finding common measurement angles☆12May 14, 2019Updated 6 years ago
- ☆11Nov 25, 2020Updated 5 years ago
- Generating non-stationary multi-sensor signals under a spatial coherence constraint (Python)☆22Jan 14, 2025Updated last year
- This repository contains code for an acoustic simulation framework that can be used for acoustic/ultrasonic indoor positioning and/or dat…☆13May 7, 2024Updated last year
- speech-enhacement☆59Nov 5, 2019Updated 6 years ago
- Room acoustics simulator for multichannel microphone arrays☆58Feb 1, 2024Updated 2 years ago