hwanyyy/preprocessing-of-speech

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/hwanyyy/preprocessing-of-speech)

hwanyyy / preprocessing-of-speech

VAD + resampling | High resolution spectrogram

☆14

Alternatives and similar repositories for preprocessing-of-speech

Users that are interested in preprocessing-of-speech are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ghunkins / Binaural-Source-Localization-CNN
View on GitHub
A Deep Convolutional Neural Network (DCNN) designed for the task of localizing human speech to 168 location classes using binaural microp…
☆10Dec 16, 2017Updated 8 years ago
GWLee0524 / AMTL
View on GitHub
Asymmetric Multi-Task Learning code, If you want to use it, please let me know and cite AMTL paper
☆11Aug 3, 2016Updated 9 years ago
saidvandeklundert / juniper-config-converter
View on GitHub
Convert Juniper configurations to 'set-style'
☆13Sep 2, 2023Updated 2 years ago
LCF2764 / autoKWS2021_1st_solution
View on GitHub
Auto-KWS 2021 Challenge 1st place solution.
☆11Jul 20, 2021Updated 5 years ago
liuhao-lh / SMD
View on GitHub
Pytorch implementation of 'Improving Self-supervised Lightweight Model Learning via Hard-aware Metric Distillation. In ECCV 2022'
☆11Mar 22, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
arief25ramadhan / sound-source-localization
View on GitHub
Four neural network architectures to classify sound source direction
☆11Oct 3, 2020Updated 5 years ago
talhanai / kaldi-diar-latte
View on GitHub
steps to perform text-based speaker diarization with kaldi toolkit
☆12Nov 2, 2018Updated 7 years ago
matousc89 / signalz
View on GitHub
Data generators in Python
☆14Jun 10, 2019Updated 7 years ago
yakouyang / VAD
View on GitHub
voice active detection (python ver/simple and easy-to-use)
☆12May 1, 2017Updated 9 years ago
janson9192 / autokws2021
View on GitHub
☆13Mar 25, 2021Updated 5 years ago
jtkim-kaist / end-point-detection
View on GitHub
☆10Sep 19, 2018Updated 7 years ago
ryands17 / node-beanstalk-cdk
View on GitHub
Deploy a Node app on Elastic Beanstalk with AWS Codebuild using aws-cdk
☆10Updated this week
crimson-knight / llamero
View on GitHub
A wrapper shard for llama.cpp that acts as a client to work directly with AI models through llama.cpp from within Crystal applications
☆21Jul 21, 2026Updated last week
bcasim / bcasim
View on GitHub
Blockchain Attack Simulator (BCASim) is an Open Source Blockchain Simulator for Attack Analysis
☆17Dec 21, 2025Updated 7 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
triplet02 / KoNPron
View on GitHub
Convert Numerical Representations to Korean Pronunciation
☆14Apr 20, 2020Updated 6 years ago
PiSchool / spoken-language-id
View on GitHub
Spoken Language Identification from Short Utterances
☆13Jul 6, 2022Updated 4 years ago
JyothiM93 / Applied-Signal-Processing-to-SoundSourceLocalisation
View on GitHub
Goal is to estimate the location of sound source using microphones array. LMS method is used to estimate time delays. Steepest descent al…
☆14Oct 27, 2017Updated 8 years ago
AUSG / book-study
View on GitHub
끝없이 책읽는 스터디
☆14Updated this week
jjery2243542 / one-shot-vc-demo
View on GitHub
☆10Apr 22, 2019Updated 7 years ago
amjadsaadeh / pyMUSIC
View on GitHub
Simple implementation of MUltiple SIgnal Classification
☆14Jan 30, 2016Updated 10 years ago
cadia-lvl / kaldi-speaker-diarization
View on GitHub
This repository creates speaker diarization recipes to be used within the egs folder of kaldi.
☆17Aug 12, 2024Updated last year
rogiervandenberg / nextjs-typescript-bulma-boilerplate
View on GitHub
Typescript boiler plate for Next.js with Bulma using Typescript to quickly kick off a new project
☆19Feb 2, 2022Updated 4 years ago
voxeet / voxeet-uxkit-ios
View on GitHub
☆11Jun 5, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
0xjeffsec / awesome-blocksec-ctf
View on GitHub
A curated list of blockchain security Capture the Flag (CTF) competitions
☆15Jan 31, 2021Updated 5 years ago
RickyMexx / 3D-Sound-Localization
View on GitHub
Quaternion Neural Networks for 3D Sound Source Localization in Reverberant Environments.
☆19Nov 21, 2022Updated 3 years ago
meelement / noise_adversarial_tacotron
View on GitHub
Reproduction of paper: Disentangling Correlated Speaker and Noise for Speech Synthesis via Data Augmentation and Adversarial Factorizatio…
☆17Aug 15, 2019Updated 6 years ago
DIAGNijmegen / AbdomenMRUS-prostate-segmentation
View on GitHub
Grand Challenge wrapper for whole-gland prostate segmentation with nnUNet
☆16Nov 28, 2023Updated 2 years ago
fakufaku / pyramic-dataset
View on GitHub
48-Channel Anechoic Audio Recordings of 3D Sources
☆17Feb 4, 2020Updated 6 years ago
DebabrataPal7 / DAFOSNET
View on GitHub
Official Implementation of "Domain Adaptive Few-Shot Open-Set Learning" in IEEE/CVF International Conference on Computer Vision (ICCV'23)
☆18Dec 18, 2023Updated 2 years ago
urigott / Gait-events-identification-algorithm
View on GitHub
A python algorithm to detect foot contact and foot clearance using kinematic or inertial data during forward or backward walking
☆11Aug 3, 2021Updated 4 years ago
bill9800 / Speech-denoise-Autoencoder
View on GitHub
Speech denoiser model using Keras
☆20Jan 23, 2019Updated 7 years ago
bingo-todd / GCC-PHAT_DNN_Loc
View on GitHub
DNN based binaural sound localization model, using GCC-PHAT as features
☆22Jun 13, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
SyncfusionExamples / dotnet-maui-audio-recorder-and-player-
View on GitHub
☆10Sep 13, 2025Updated 10 months ago
lyarinet / multicast
View on GitHub
UDP multicast MPEG-TS streams on Linux
☆11Mar 25, 2025Updated last year
imyaman / kkeun-ma-rit-gi
View on GitHub
pull request 해보고 싶은 분들을 위한 놀이공간
☆26Mar 12, 2026Updated 4 months ago
bond005 / vad
View on GitHub
Various algorithms for voice activity detection
☆22Jan 31, 2017Updated 9 years ago
cmrfrd / DetectingDeepFakes_BlackHat2019
View on GitHub
Detect audio deep fakes with bispectral analysis
☆19Aug 6, 2019Updated 6 years ago
hudson-and-thames / example-notebooks
View on GitHub
☆21Jul 11, 2023Updated 3 years ago
alumae / voxlingua107_sb
View on GitHub
VoxLingua107 recipe for SpeechBrain
☆13Jul 3, 2021Updated 5 years ago