VAD + resampling | High resolution spectrogram
☆14Nov 29, 2022Updated 3 years ago
Alternatives and similar repositories for preprocessing-of-speech
Users that are interested in preprocessing-of-speech are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Deep Convolutional Neural Network (DCNN) designed for the task of localizing human speech to 168 location classes using binaural microp…☆10Dec 16, 2017Updated 8 years ago
- Asymmetric Multi-Task Learning code, If you want to use it, please let me know and cite AMTL paper☆11Aug 3, 2016Updated 9 years ago
- Convert Juniper configurations to 'set-style'☆12Sep 2, 2023Updated 2 years ago
- Auto-KWS 2021 Challenge 1st place solution.☆11Jul 20, 2021Updated 4 years ago
- Four neural network architectures to classify sound source direction☆11Oct 3, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- steps to perform text-based speaker diarization with kaldi toolkit☆12Nov 2, 2018Updated 7 years ago
- Data generators in Python☆14Jun 10, 2019Updated 6 years ago
- Pytorch implementation of 'Improving Self-supervised Lightweight Model Learning via Hard-aware Metric Distillation. In ECCV 2022'☆12Mar 22, 2023Updated 3 years ago
- Deploy a Node app on Elastic Beanstalk with AWS Codebuild using aws-cdk☆10Mar 13, 2026Updated 2 weeks ago
- voice active detection (python ver/simple and easy-to-use)☆12May 1, 2017Updated 8 years ago
- ☆13Mar 25, 2021Updated 5 years ago
- ☆10Sep 19, 2018Updated 7 years ago
- Blockchain Attack Simulator (BCASim) is an Open Source Blockchain Simulator for Attack Analysis☆17Dec 21, 2025Updated 3 months ago
- A wrapper shard for llama.cpp that acts as a client to work directly with AI models through llama.cpp from within Crystal applications☆19Jan 23, 2025Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Convert Numerical Representations to Korean Pronunciation☆14Apr 20, 2020Updated 5 years ago
- Goal is to estimate the location of sound source using microphones array. LMS method is used to estimate time delays. Steepest descent al…☆14Oct 27, 2017Updated 8 years ago
- Spoken Language Identification from Short Utterances☆13Jul 6, 2022Updated 3 years ago
- 끝없이 책읽는 스터디☆13Updated this week
- ☆10Apr 22, 2019Updated 6 years ago
- Simple implementation of MUltiple SIgnal Classification☆14Jan 30, 2016Updated 10 years ago
- ☆11Jun 5, 2023Updated 2 years ago
- A curated list of blockchain security Capture the Flag (CTF) competitions☆15Jan 31, 2021Updated 5 years ago
- Official Implementation of "Domain Adaptive Few-Shot Open-Set Learning" in IEEE/CVF International Conference on Computer Vision (ICCV'23)☆16Dec 18, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Grand Challenge wrapper for whole-gland prostate segmentation with nnUNet☆16Nov 28, 2023Updated 2 years ago
- This repository creates speaker diarization recipes to be used within the egs folder of kaldi.☆17Aug 12, 2024Updated last year
- Typescript boiler plate for Next.js with Bulma using Typescript to quickly kick off a new project☆19Feb 2, 2022Updated 4 years ago
- Quaternion Neural Networks for 3D Sound Source Localization in Reverberant Environments.☆19Nov 21, 2022Updated 3 years ago
- Simple tutorials using Google's TensorFlow Framework☆42Jan 27, 2020Updated 6 years ago
- 48-Channel Anechoic Audio Recordings of 3D Sources☆17Feb 4, 2020Updated 6 years ago
- Reproduction of paper: Disentangling Correlated Speaker and Noise for Speech Synthesis via Data Augmentation and Adversarial Factorizatio…☆17Aug 15, 2019Updated 6 years ago
- UDP multicast MPEG-TS streams on Linux☆10Mar 25, 2025Updated last year
- Speech denoiser model using Keras☆20Jan 23, 2019Updated 7 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- ☆10Sep 13, 2025Updated 6 months ago
- ☆21Jul 11, 2023Updated 2 years ago
- DNN based binaural sound localization model, using GCC-PHAT as features☆22Jun 13, 2023Updated 2 years ago
- Various algorithms for voice activity detection☆22Jan 31, 2017Updated 9 years ago
- Stream Proxy - A proxy for the livestreams (streamlink and ffmpeg)☆16Jan 21, 2026Updated 2 months ago
- Performance comparison of Leaflet markercluster and supercluster☆12Apr 17, 2021Updated 4 years ago
- a simple FIX Protocol client that can be controlled using Groovy script scenarios☆18Oct 17, 2024Updated last year