jtkim-kaist/end-point-detection

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/jtkim-kaist/end-point-detection)

jtkim-kaist / end-point-detection

☆10

Alternatives and similar repositories for end-point-detection

Users that are interested in end-point-detection are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

danpovey / openslr
View on GitHub
Repository for the web pages and scripts associated with OpenSLR: the open speech and language repository
☆27Jul 26, 2020Updated 6 years ago
jtkim-kaist / ram_modified
View on GitHub
"Recurrent Models of Visual Attention" in TensorFlow
☆41Apr 13, 2017Updated 9 years ago
talhanai / kaldi-diar-latte
View on GitHub
steps to perform text-based speaker diarization with kaldi toolkit
☆12Nov 2, 2018Updated 7 years ago
kate-egorova / ASR-hybrid-decoding
View on GitHub
This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…
☆11Feb 4, 2020Updated 6 years ago
yongxuUSTC / DNN-SpeechEnhancement
View on GitHub
DNN-based speech enhancement using Tensorflow by Haoyu Li (Tokyo univ.)
☆17Aug 31, 2017Updated 8 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
cadia-lvl / kaldi-speaker-diarization
View on GitHub
This repository creates speaker diarization recipes to be used within the egs folder of kaldi.
☆17Aug 12, 2024Updated last year
zouxinghao / MRCG
View on GitHub
a optional way to extract audio feature
☆14Jun 10, 2017Updated 9 years ago
language-agent-tutorial / language-agent-tutorial.github.io
View on GitHub
[EMNLP 2024 Tutorial] Language Agents: Foundations, Prospects, and Risks
☆10Nov 27, 2024Updated last year
jtkim-kaist / Speech-enhancement
View on GitHub
Deep neural network based speech enhancement toolkit
☆220Jun 14, 2019Updated 7 years ago
irebai / SpecAugment_KALDI
View on GitHub
A KALDI/C++ implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition
☆15Sep 4, 2019Updated 6 years ago
LCF2764 / autoKWS2021_1st_solution
View on GitHub
Auto-KWS 2021 Challenge 1st place solution.
☆11Jul 20, 2021Updated 5 years ago
liuhao-lh / SMD
View on GitHub
Pytorch implementation of 'Improving Self-supervised Lightweight Model Learning via Hard-aware Metric Distillation. In ECCV 2022'
☆11Mar 22, 2023Updated 3 years ago
WeblateOrg / siphashc
View on GitHub
python c-module for siphash
☆20Updated this week
sshh12 / Conv-VAD
View on GitHub
A packaged convolutional voice activity detector for noisy environments.
☆14Jun 15, 2019Updated 7 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
janson9192 / autokws2021
View on GitHub
☆13Mar 25, 2021Updated 5 years ago
MontrealCorpusTools / speechcorpustools
View on GitHub
Easier analysis of large speech corpora
☆24Jun 22, 2021Updated 5 years ago
hwanyyy / preprocessing-of-speech
View on GitHub
VAD + resampling | High resolution spectrogram
☆14Nov 29, 2022Updated 3 years ago
sooftware / jasper
View on GitHub
PyTorch implementation of "Jasper: An End-to-End Convolutional Neural Acoustic Model" (INTERSPEECH 2019)
☆32Mar 4, 2021Updated 5 years ago
qcri / ArabicASRChallenge2016
View on GitHub
This repository
☆32Nov 13, 2022Updated 3 years ago
triplet02 / KoNPron
View on GitHub
Convert Numerical Representations to Korean Pronunciation
☆14Apr 20, 2020Updated 6 years ago
r9y9 / MelGeneralizedCepstrums.jl
View on GitHub
Mel-Generalized Cepstrum analysis
☆19Jul 21, 2017Updated 9 years ago
eurecom-asp / pc-darts-anti-spoofing
View on GitHub
This repository includes the code to reproduce our paper Partially-Connected Differentiable Architecture Search for Deepfake and Spoofing…
☆18Apr 30, 2022Updated 4 years ago
jihyun300 / Speech-Recognizer
View on GitHub
Construct GMM-HMM and Implement the Viterbi algorithm for continuous speech recognition
☆15Apr 1, 2018Updated 8 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
PiSchool / spoken-language-id
View on GitHub
Spoken Language Identification from Short Utterances
☆13Jul 6, 2022Updated 4 years ago
AmirmohammadRostami / ASV-anti-spoofing-with-EABN
View on GitHub
☆15Feb 25, 2023Updated 3 years ago
tomkocse / sim-rir-preparation
View on GitHub
Script to simulate room impulse responses
☆16Sep 29, 2016Updated 9 years ago
tuanio / nextformer
View on GitHub
PyTorch implementation of "Nextformer: A ConvNeXt Augmented Conformer For End-To-End Speech Recognition"
☆10Dec 15, 2022Updated 3 years ago
akosiorek / akosiorek.github.io
View on GitHub
☆20Feb 8, 2026Updated 5 months ago
idiap / CNN_QbE_STD
View on GitHub
Implementation of the work presented in "CNN based Query by Example Spoken Term Detection"
☆32Sep 3, 2018Updated 7 years ago
DebabrataPal7 / DAFOSNET
View on GitHub
Official Implementation of "Domain Adaptive Few-Shot Open-Set Learning" in IEEE/CVF International Conference on Computer Vision (ICCV'23)
☆18Dec 18, 2023Updated 2 years ago
ysbsb / awesome-quantization
View on GitHub
Awesome Quantization Paper lists with Codes
☆10Feb 24, 2021Updated 5 years ago
datemoon / ASR-decoder
View on GitHub
it's ASR decoder and make graph project
☆33May 26, 2022Updated 4 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
tonnetonne814 / unofficial-vits2-44100-Ja
View on GitHub
44100Hz日本語音源に対応させた unofficial vits2-TTS implementation in pytorchです。
☆24Sep 1, 2023Updated 2 years ago
SiliconLabs / mltk
View on GitHub
A Python package with command-line utilities and scripts to aid the development of machine learning models for Silicon Lab's embedded pl…
☆62Aug 20, 2025Updated 11 months ago
jtkim-kaist / VAD
View on GitHub
Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
☆869Jun 9, 2021Updated 5 years ago
apoorvnandan / speech-recognition-primer
View on GitHub
This repository contains code for a tutorial on end to end automatic speech recognition.
☆18Sep 10, 2019Updated 6 years ago
lovehyun / tutorial-kubernetes
View on GitHub
☆19Feb 17, 2023Updated 3 years ago
chenzhehuai / kaldi-decoders
View on GitHub
Custom decoders for Kaldi
☆13Jun 5, 2019Updated 7 years ago
vrenkens / nabu
View on GitHub
Code for end-to-end ASR with neural networks, build with TensorFlow
☆110Jan 24, 2019Updated 7 years ago