A LSTM for voice activity detection. In fact, this is a homework which I didn't expected.
☆13Dec 3, 2020Updated 5 years ago
Alternatives and similar repositories for VAD-based-on-LSTM
Users that are interested in VAD-based-on-LSTM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Pytorch implementation of "spectro-temporal attention-based voice activity detection"☆13Jun 4, 2024Updated last year
- Pytorch version of Voice Activity Detection (VAD) based on Deep Learning (https://github.com/filippogiruzzi)☆27Mar 20, 2021Updated 5 years ago
- Simple DNN based Voice Activity Detection (VAD) using Pytorch☆42Feb 8, 2020Updated 6 years ago
- ConvLSTM-AE_VAD_ICME2017 (code reimplementation)☆21Oct 10, 2020Updated 5 years ago
- Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering☆24Oct 19, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ERB representation of an audio file implemented in Python☆27Oct 21, 2018Updated 7 years ago
- ☆28Jul 9, 2022Updated 3 years ago
- A wrapper for Audeering's wav2vec-based dimensional speech emotion recognition☆21Aug 9, 2023Updated 2 years ago
- acoustic interference (echo) cancellation project in summer internship☆88Aug 31, 2018Updated 7 years ago
- ☆17Jun 24, 2025Updated 9 months ago
- microphone array speech generator (MASG) in room acoustic☆39Jan 2, 2020Updated 6 years ago
- [ICLR 2020] Contrastive Representation Distillation (CRD), and benchmark of recent knowledge distillation methods☆10Jan 12, 2020Updated 6 years ago
- The implementation codes of paper: Multimodal Sentiment Analysis with Mutual Information-based Disentangled Representation Learning☆20May 8, 2025Updated 10 months ago
- A packaged convolutional voice activity detector for noisy environments.☆14Jun 15, 2019Updated 6 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 🦠 COVID-19 Daily Data from Worldometers with Python☆13Feb 28, 2021Updated 5 years ago
- 在Android上运行人脸表情识别的tflite模型☆12Apr 7, 2021Updated 4 years ago
- Pytorch implementation of SELF-ATTENTIVE VAD, ICASSP 2021☆159Oct 26, 2021Updated 4 years ago
- TensorFlow and deep learning without a PhD, translated to Chinese☆17Feb 18, 2017Updated 9 years ago
- Repo for our pooling approach on the DCASE2018 task4☆15Jul 6, 2023Updated 2 years ago
- Empirical Evaluation of Speaker Adaptation on DNN based Acoustic Model☆13Nov 25, 2019Updated 6 years ago
- ☆45Dec 5, 2019Updated 6 years ago
- Open source data for data visualization enthusiasts.☆22Dec 20, 2021Updated 4 years ago
- Benchmarking different VAD models on AVA-Speech dataset☆18May 21, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- 语音增强领域的相关数据仿真工具和方法汇总--持续更新☆45Jul 11, 2024Updated last year
- Code repository for the paper Direction of Arrival Estimation of Sound Sources Using Icosahedral CNNs☆45May 19, 2022Updated 3 years ago
- Binaural audio reproduction through loudspeakers. Also known as crosstalk cancellation.☆11Sep 12, 2024Updated last year
- The implementation of TaylorBeamformer, which is in submission to Interspeech2022☆48Jun 10, 2022Updated 3 years ago
- ☆11Nov 25, 2020Updated 5 years ago
- Translating Synthetic RIRs to Real RIRs☆45Sep 15, 2023Updated 2 years ago
- ☆14Jan 12, 2023Updated 3 years ago
- 一个小小的云fitting计算器,计算方法来自微博@摸发虱痒☆10Apr 10, 2022Updated 3 years ago
- ☆12Jun 17, 2019Updated 6 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- ☆13Nov 16, 2020Updated 5 years ago
- simple dnn based vad☆70Dec 2, 2018Updated 7 years ago
- Generating non-stationary multi-sensor signals under a spatial coherence constraint (Python)☆24Jan 14, 2025Updated last year
- Develop speaker recognition model based on i-vector using TIMIT database☆16Jul 4, 2019Updated 6 years ago
- Code for calculate DNS_MOS.☆43Dec 18, 2022Updated 3 years ago
- ☆11Jun 15, 2022Updated 3 years ago
- A curated list of awesome Voiceprint Recognition papers☆19Jul 9, 2021Updated 4 years ago