☆21Feb 5, 2018Updated 8 years ago
Alternatives and similar repositories for docker-kaldi
Users that are interested in docker-kaldi are comparing it to the libraries listed below
Sorting:
- Build kaldi inside docker containers with option for CUDA support☆12Feb 6, 2017Updated 9 years ago
- This repository contains the files used for our Interspeech 2017 paper.☆16May 30, 2017Updated 8 years ago
- PolEval 2021 Task 1☆15Jun 28, 2022Updated 3 years ago
- ASR library☆14Dec 3, 2018Updated 7 years ago
- Data preparation code for building Kaldi ASR system☆14Mar 18, 2017Updated 8 years ago
- Sisyphus recipies for ASR☆19Updated this week
- Portal of Johannes and Felix's RNN implementation and further modifications for ASR☆21Nov 27, 2014Updated 11 years ago
- wake word spotting with kaldi☆19Dec 3, 2020Updated 5 years ago
- Experimenting with musically motivated convolutional neural networks☆16Jun 8, 2016Updated 9 years ago
- A baseline Automatic Speech Recognition system for Polish based on Kaldi.☆18Dec 21, 2021Updated 4 years ago
- A simple toolkit for speaker segmentation and identification☆31Jun 15, 2013Updated 12 years ago
- EESEN based offline transcriber VM using models trained on TEDLIUM and Cantab Research☆50Jun 4, 2019Updated 6 years ago
- Convolutional neural networks for sound classification☆20Dec 30, 2017Updated 8 years ago
- Zero-Resource Speech Discovery, Search, and Evaluation Tools☆29Aug 6, 2015Updated 10 years ago
- Pumilio: A Web-Based Management System for Ecological Recordings☆13Oct 29, 2018Updated 7 years ago
- Some notes on Kaldi☆31Feb 20, 2015Updated 11 years ago
- Audio Analysis by Conceptor☆30Aug 20, 2015Updated 10 years ago
- VoxSRC Challenge☆31Jun 11, 2019Updated 6 years ago
- Tensorflow with KenLM integrated for beam search scoring☆34Jul 28, 2017Updated 8 years ago
- Code, source data, examples, and audio excerpts for Flow: Expressive Rhythm in the Rapping Voice☆10Feb 13, 2020Updated 6 years ago
- ☆76Mar 18, 2022Updated 3 years ago
- ChiNese Text Normalization (CNTN) tool for Text-to-speech system☆37Apr 12, 2018Updated 7 years ago
- Text-to-Speech tutorial at SLTU 2016☆35May 10, 2016Updated 9 years ago
- ☆36Feb 23, 2017Updated 9 years ago
- Github mirror of MediaWiki extension Wikispeech - our actual code is hosted with Gerrit (please see https://www.mediawiki.org/wiki/Develo…☆12Updated this week
- Research_speech_speaker_verification_nist_sre2010☆12Mar 1, 2016Updated 9 years ago
- Listen to the weather using Sonic Pi and data from Mathematica☆11Dec 6, 2018Updated 7 years ago
- Python wrapper for Kaldi decoders (Kaldi https://sourceforge.net/projects/kaldi/)☆80Dec 13, 2015Updated 10 years ago
- Resources for "Simple Speech Representation Learning from Perceptual Data".☆11Sep 18, 2023Updated 2 years ago
- ☆13Feb 21, 2026Updated last week
- 1st place solution to the DCASE 2020 - Task 5 - Urban Sound Tagging with Spatiotemporal Context☆16Dec 8, 2022Updated 3 years ago
- ATC-Anno is an annotation tool for Air Traffic Control data that offers automatic semantic and concept annotation.☆12Nov 17, 2023Updated 2 years ago
- Configuration Information for Qt + SGX on TI Platforms☆24Sep 14, 2013Updated 12 years ago
- ☆10Jul 24, 2019Updated 6 years ago
- Implementation of "Look, Listen and Recognise:character-aware audio-visual subtitling"☆19Nov 3, 2025Updated 3 months ago
- Small compression utility☆38Jan 20, 2026Updated last month
- PyGun: Procedural Generation of Anechoic Gunshot Sounds☆14Oct 8, 2016Updated 9 years ago
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆11Oct 21, 2022Updated 3 years ago
- A Tree-LSTM-based dependency tree sentiment labeler☆15May 9, 2019Updated 6 years ago