☆15Mar 15, 2022Updated 3 years ago
Alternatives and similar repositories for iclr22-wctc
Users that are interested in iclr22-wctc are comparing it to the libraries listed below
Sorting:
- ☆14Jun 12, 2015Updated 10 years ago
- Hack and Tell @ Saarland University☆19Dec 11, 2017Updated 8 years ago
- Convert Korean hangul to romanized syllables☆23Jun 18, 2024Updated last year
- Primer on CTC implementation in pure Python PyTorch code☆111Jul 27, 2024Updated last year
- TTS Android demo of PaddleSpeech, merged into https://github.com/PaddlePaddle/PaddleSpeech/tree/develop/demos☆28Nov 30, 2022Updated 3 years ago
- Transcribing Speech with Multinomial Diffusion, training code and models.☆80Sep 27, 2023Updated 2 years ago
- A GPU language model, based on btree backed tries.☆29Mar 6, 2018Updated 7 years ago
- Segment a given audio into utterances using a trained end-to-end ASR model.☆74Oct 9, 2020Updated 5 years ago
- Speech Recognition implementation using Artificial Neural Networks☆10Sep 7, 2015Updated 10 years ago
- A JAX library for building lattice-based speech transducer models☆46Updated this week
- A fast python library for aligning similar audio snippets passed in as NumPy arrays☆48Oct 27, 2025Updated 4 months ago
- Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training☆41Dec 18, 2020Updated 5 years ago
- Simple implementation of TDOA localization algorithm.☆13Oct 12, 2016Updated 9 years ago
- Hadoop-based tool for extraction of large scale synchronous grammars for paraphrasing and machine translation☆15Dec 2, 2016Updated 9 years ago
- Research_speech_speaker_verification_nist_sre2010☆12Mar 1, 2016Updated 10 years ago
- Машинне навчання для інженерів із систем керування☆11Jul 19, 2023Updated 2 years ago
- Using acceleration and heart rate data to classify awake, deep, and light sleep☆10Dec 21, 2017Updated 8 years ago
- A signal processing library, currently sufficient for basic speech recognition stuff like mel frequency cepstrum☆19Mar 15, 2012Updated 13 years ago
- This is now the official location of the Kaldi project.☆10Aug 22, 2019Updated 6 years ago
- This is a telegram bot for correcting language mistakes in group chats☆10Jun 29, 2021Updated 4 years ago
- A pakage for crawling audio from Youtube☆42Aug 8, 2023Updated 2 years ago
- (R&D) Text to speech using phonemes as inputs and audio codec codes as outputs. Loosely based on MegaByte, VALL-E and Encodec.☆48Sep 4, 2023Updated 2 years ago
- Multipurpose Multi Speaker Mixture Signal Generator☆46Feb 6, 2025Updated last year
- ☆46Nov 2, 2023Updated 2 years ago
- Code for the Paper Speech Recognition and Multi-Speaker Diarization of Long Conversations☆38Jun 12, 2023Updated 2 years ago
- Visualization for hidden Markov model computations☆14Dec 19, 2014Updated 11 years ago
- Multi-layer perceptron, Autoencoder, and Restricted Boltzmann Machine☆10Sep 15, 2018Updated 7 years ago
- Audio source separation using CASA approaches in Python.☆11Apr 2, 2015Updated 10 years ago
- NER as a (Micro)Service☆10Jan 11, 2017Updated 9 years ago
- Grapheme to phoneme converter for Estonian☆14May 27, 2021Updated 4 years ago
- Statistical WHOIS parser☆10Apr 17, 2017Updated 8 years ago
- ☆12Oct 7, 2020Updated 5 years ago
- how to generate the full-contextual labels from un-seen text for the application of HMM-based speech synthesis (HTS)☆12Nov 22, 2019Updated 6 years ago
- Python bindings for the htmd Rust library, a fast HTML to Markdown converter☆11Feb 23, 2026Updated last week
- ☆10Aug 3, 2020Updated 5 years ago
- Implementation of joint bayesian model, written in python.☆11Aug 2, 2021Updated 4 years ago
- Music segmentation by ordinal linear discriminant analysis☆18Nov 10, 2017Updated 8 years ago
- Multiobjective Optimization Training of PLDA for Speaker Verification☆10Jun 14, 2018Updated 7 years ago
- ☆17Jul 29, 2018Updated 7 years ago