A project for VAD. It is a homework project for ASR
☆13Sep 25, 2017Updated 8 years ago
Alternatives and similar repositories for Voice-Activity-Detection
Users that are interested in Voice-Activity-Detection are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A package used to test webrtc apm functions, such as aec, ns☆17Feb 21, 2019Updated 7 years ago
- FFTW is a C subroutine library for computing the discrete Fourier transform (DFT) in one or more dimensions, of arbitrary input size, and…☆17Sep 6, 2018Updated 7 years ago
- Specification for media☆16Jul 4, 2025Updated 9 months ago
- assignments for e6870 ASR class☆42Apr 23, 2019Updated 6 years ago
- Implementation of CGMM-MVDR beamforming used for Clarity challenge☆14Jan 14, 2022Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- dddd☆23Dec 24, 2025Updated 3 months ago
- Relative transmission function based multichannel speech enhancement or BSS in LCMV-GSC structure☆42Jun 12, 2019Updated 6 years ago
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Feb 4, 2020Updated 6 years ago
- ☆16Apr 24, 2021Updated 4 years ago
- 3D Sound Effect on STM32F4☆10Oct 8, 2015Updated 10 years ago
- STT Service based on Kaldi ASR☆15Aug 17, 2018Updated 7 years ago
- uyghur text resource crawled from website☆12Dec 25, 2015Updated 10 years ago
- Library for real-time digital signal processing of microphone array signals. It is based on DSPONE adn WIPP and can perform binarula loca…☆16Mar 23, 2017Updated 9 years ago
- 防QQ变声功能(使用FMOD音频引擎)☆12Jun 1, 2017Updated 8 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- A Real-time Voice Changer based on SensorTile (STM32L4).☆11Apr 24, 2017Updated 8 years ago
- Distributed Audio Array Matlab Toolbox for public viewing and use. For more information, read here: http://vis.uky.edu/distributed-audio-…☆12Mar 6, 2019Updated 7 years ago
- Simulation of the sound field in a room using Object-Oriented Programming☆10Jan 18, 2021Updated 5 years ago
- ☆17Oct 26, 2018Updated 7 years ago
- 豆瓣小组自动回复机器人☆22Dec 8, 2022Updated 3 years ago
- Attempt at writing a LCMV Beamformer based on Frosts 1972 paper☆11Apr 29, 2016Updated 9 years ago
- A tensorflow implementation of my paper Combining beamforming and deep neural networks for multi-channel speech extraction☆68Dec 15, 2020Updated 5 years ago
- Wake-up-word(WUW)system is an emerging development in recent times. Voice interaction with systems have made life ease and aids in multi-…☆18Mar 11, 2019Updated 7 years ago
- Efficient Methods for BEamforming Deconvolution☆17Oct 26, 2017Updated 8 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Implementation of the offline method described in "Robust mvdr beamforming using time-frequency masks for online/offline asr in noise" (f…☆72Apr 16, 2018Updated 7 years ago
- Comfort Noise Generator Module Port From WebRTC☆22Mar 4, 2019Updated 7 years ago
- Modern audio compression for the internet.☆18Apr 22, 2018Updated 7 years ago
- ☆19Apr 1, 2020Updated 6 years ago
- 基于DNN和DTW算法配合VAD截取的微语音识别框架☆13Sep 27, 2017Updated 8 years ago
- https://ros.ai☆19Aug 28, 2019Updated 6 years ago
- ☆16Nov 13, 2017Updated 8 years ago
- Lattice combination algorithm to combine inaccurate transcripts with hypothesis lattices☆16Mar 19, 2024Updated 2 years ago
- ☆35Apr 8, 2019Updated 7 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- 科大讯飞语音唤醒、语音转换、语音识别的Node.js SDK,支持 win32和linux。☆15Mar 30, 2017Updated 9 years ago
- 总结了一些我的学习笔记,包括linux、C++、Java、Python、算法等,以及找工作时候的一些面经和笔记等。☆16Jun 12, 2019Updated 6 years ago
- 集成Webrtc的VAD,用于切分音频文件☆343Aug 26, 2020Updated 5 years ago
- This repo contains the scripts, models and required files for the Interspeech 2020 Deep Noise Suppression (DNS) Challenge. We are open so…☆15May 15, 2020Updated 5 years ago
- multi-channel target speech extraction with channel decorrelation and target speaker adaptation☆27Feb 19, 2021Updated 5 years ago
- 48-Channel Anechoic Audio Recordings of 3D Sources☆17Feb 4, 2020Updated 6 years ago
- A sample implementation of IMA-ADPCM wav encoder / decoder.☆18Aug 10, 2022Updated 3 years ago