patyork/AutomaticSpeechChunker

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/patyork/AutomaticSpeechChunker)

patyork / AutomaticSpeechChunker

From a large speech audio file and its corresponding body of text, automatically chunk the audio and text into (phrase, audio_snippet) pairs. For use with the Connectionist Temporal Classification (CTC) cost algorithm.

☆17

Alternatives and similar repositories for AutomaticSpeechChunker

Users that are interested in AutomaticSpeechChunker are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Hannes1 / react-native-wenet
View on GitHub
Wenet speech to text for react native
☆10Nov 1, 2022Updated 3 years ago
tiro-is / tiro-speech-core
View on GitHub
This is a mirror of https://gitlab.com/tiro-is/tiro-speech-core
☆15Jun 19, 2023Updated 3 years ago
luomingshuang / k2-speechbrain
View on GitHub
In this repository, I try to combine k2 with speechbrain to decode well and fastly.
☆16Jun 17, 2022Updated 4 years ago
danijel3 / SparrowhawkTest
View on GitHub
A simple tutorial on setting up Sparrowhawk - a text-to-speech normalization engine
☆14Oct 16, 2017Updated 8 years ago
nii-yamagishilab / speaker_sex_attribute_privacy
View on GitHub
Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE
☆15Nov 30, 2022Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
JarbasAl / kaldi_spotter
View on GitHub
wake word spotting with kaldi
☆19Dec 3, 2020Updated 5 years ago
cyfer0618 / kaldi-pytorch-rnnlm
View on GitHub
Enable RNNLM lattice rescoring with Pytorch [kaldi]
☆12Jun 5, 2020Updated 6 years ago
BUTSpeechFIT / ASR-hybrid-decoding
View on GitHub
☆17Nov 25, 2019Updated 6 years ago
projecte-aina / oTranscribe-plus
View on GitHub
A free & open tool for transcribing audio interviews with offline ASR support
☆25Dec 21, 2023Updated 2 years ago
markusdr / transducersaurus
View on GitHub
Automatically exported from code.google.com/p/transducersaurus
☆11Apr 1, 2015Updated 11 years ago
miccio-dk / NISQA
View on GitHub
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
☆16Apr 13, 2022Updated 4 years ago
FantSun / Speechflow
View on GitHub
Speechflow for emotion recognition related information decomposition
☆10Jul 27, 2021Updated 5 years ago
charlesliucn / LanMIT
View on GitHub
📖 LanMIT: A Toolkit for Improving Language Models in Low-resourced Speech Recognition based on Kaldi.
☆22Jul 12, 2019Updated 7 years ago
bagustris / s3prl-ser
View on GitHub
S3PRL for Speech Emotion Recognition (see s3prl > downstream)
☆15Feb 28, 2026Updated 4 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
uhh-lt / kaldi-model-server
View on GitHub
Simple Kaldi model server for chain (nnet3) models in online recognition mode directly from a local microphone
☆35Feb 18, 2022Updated 4 years ago
DistantSpeechRecognition / sweethomelisten
View on GitHub
☆17Apr 8, 2016Updated 10 years ago
louiskirsch / speechT
View on GitHub
An opensource speech-to-text software written in tensorflow
☆160Oct 15, 2022Updated 3 years ago
idnavid / speech_activity_detection
View on GitHub
Unsupervised speech activity detection system.
☆11Jul 2, 2018Updated 8 years ago
souvikg544 / TTS_Data_Maker
View on GitHub
Text to speech is an emerging zone of AI. This repository helps to create a dataset with audio and transcripts for personalized text to s…
☆28Mar 14, 2023Updated 3 years ago
uiuc-sst / asr24
View on GitHub
24-hour Automatic Speech Recognition
☆27Jun 4, 2021Updated 5 years ago
yt605155624 / TTSAndroid
View on GitHub
TTS Android demo of PaddleSpeech, merged into https://github.com/PaddlePaddle/PaddleSpeech/tree/develop/demos
☆28Nov 30, 2022Updated 3 years ago
SergMa / free-nross
View on GitHub
Free noise reduction of speech signals
☆12Jul 26, 2016Updated 10 years ago
kate-egorova / ASR-hybrid-decoding
View on GitHub
This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…
☆11Feb 4, 2020Updated 6 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
usc-sail / barista
View on GitHub
Barista is an open-source framework for concurrent speech processing.
☆36Mar 19, 2014Updated 12 years ago
amaas / stanford-ctc
View on GitHub
Neural net code for lexicon-free speech recognition with connectionist temporal classification
☆250Feb 23, 2016Updated 10 years ago
articulateinstruments / DeepLabCut-for-Speech-Production
View on GitHub
Trained deep neural-net models for estimating articulatory keypoints from midsagittal ultrasound tongue videos and front-view lip camera …
☆25Jun 13, 2023Updated 3 years ago
dafyddg / RFA
View on GitHub
Implementation of the Rhythm Formant Analysis methodology for identifying speech rhythms and rhythm variation in the low frequency spectr…
☆17Apr 27, 2023Updated 3 years ago
fchest / Speech-Transformer-multi-GPUs
View on GitHub
A PyTorch implementation of Speech Transformer with multi-GPUs, an End-to-End ASR with Transformer network on Mandarin Chinese. This code…
☆10Dec 25, 2019Updated 6 years ago
talhanai / kaldi-diar-latte
View on GitHub
steps to perform text-based speaker diarization with kaldi toolkit
☆12Nov 2, 2018Updated 7 years ago
ICLR-DAP / Deep-Audio-Prior
View on GitHub
Anonymous ICLR Submission
☆14Sep 25, 2019Updated 6 years ago
egorsmkv / asr-corpus-creator
View on GitHub
This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.
☆27Feb 15, 2024Updated 2 years ago
alefiury / SE-R-2022-SER-Track
View on GitHub
Code for the winning solution in the SE&R 2022 Challenge - SER track.
☆16Mar 28, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
motazsaad / ara-pronunciation-tool
View on GitHub
A python tool that converts Arabic diacritised text to a sequence of phonemes and creates a pronunciation dictionary. This code is based …
☆15Sep 5, 2017Updated 8 years ago
idiap / zff_vad
View on GitHub
Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering
☆23Oct 19, 2023Updated 2 years ago
gooofy / py-kaldi-asr
View on GitHub
Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.
☆169Feb 23, 2021Updated 5 years ago
espnet / espnet_tts_frontend
View on GitHub
Text frontend for ESPnet tts recipes
☆35Jun 1, 2021Updated 5 years ago
sarahjuan / iban
View on GitHub
☆14Jun 12, 2015Updated 11 years ago
qiujiali / lattice-rescore
View on GitHub
☆16Jun 13, 2022Updated 4 years ago
skinahan / DIVA_PyTorch
View on GitHub
Implementation of the DIVA model of speech acquisition and production using PyTorch
☆23Jan 18, 2023Updated 3 years ago