☆19Aug 27, 2018Updated 7 years ago
Alternatives and similar repositories for deepsphinx
Users that are interested in deepsphinx are comparing it to the libraries listed below
Sorting:
- text to speech☆10Mar 19, 2024Updated last year
- ☆10Apr 17, 2024Updated last year
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated 10 months ago
- Cantonese Grapheme-to-Phoneme Converter based on GitYCC/g2pW☆15Dec 10, 2024Updated last year
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Feb 4, 2020Updated 6 years ago
- The Cantonese Wordnet☆14Dec 4, 2023Updated 2 years ago
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆18Oct 2, 2024Updated last year
- Aligntune : A Modular Toolkit for Post Training Alignment of LLMs☆33Feb 23, 2026Updated last week
- Attentional Neural Network that translates text to phones.☆11Jan 25, 2018Updated 8 years ago
- Toward Multi Modality Language Model - implementation of GPT-4o/Project Astra☆16Dec 10, 2024Updated last year
- A Tensorflow SqueezeNet implementation☆14Oct 1, 2018Updated 7 years ago
- ☆14Aug 19, 2024Updated last year
- LoRA-based phoneme/prosody control for LLM-based TTS with no G2P - Lightweight adapter for edit and control the target language's phoneme…☆23Aug 14, 2025Updated 6 months ago
- Support material and source code for the model described in : "A Recurrent Encoder-Decoder Approach With Skip-Filtering Connections For M…☆13Sep 19, 2017Updated 8 years ago
- Extraction de LExique par Variation d'Entropie - Lexicon extraction based on the variation of entropy☆14Oct 25, 2020Updated 5 years ago
- ☆14Jul 24, 2025Updated 7 months ago
- 英単語から読みを推測するライブラリ。☆26Nov 8, 2025Updated 3 months ago
- A lightweight audio codec based on a single quantizer☆32Sep 4, 2025Updated 5 months ago
- Dynamic Entity Summarization (DynES)☆20May 10, 2019Updated 6 years ago
- A family of efficient speech models for multilingual phone recognition☆45Feb 12, 2026Updated 2 weeks ago
- Visual Speech Recongnition☆19Dec 24, 2024Updated last year
- Dynamic memory networks in tensorflow with demo to visualize attention.☆22Mar 8, 2018Updated 7 years ago
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler☆23Mar 21, 2021Updated 4 years ago
- An AR+AR TTS attempt.☆18Jan 13, 2025Updated last year
- BurrMill core☆22Nov 2, 2021Updated 4 years ago
- Seq2Seq Chatbot with attention mechanism☆19Apr 27, 2017Updated 8 years ago
- Python implementation of CTC beam search decoder + agnostic LM scorer☆20Dec 16, 2020Updated 5 years ago
- Chinese Wordnet v.2☆22Aug 15, 2016Updated 9 years ago
- Search-based-Neural-Structured-Learning-for-Sequential-Question-Answering☆33Jun 12, 2023Updated 2 years ago
- Shanghainese TTS☆27Jul 30, 2023Updated 2 years ago
- Collection of scripts from mHuBERT-147.☆32Nov 19, 2024Updated last year
- 论文“Attention-over-Attention Neural Networks for Reading Comprehension”中AoA模型实现☆57Jun 28, 2017Updated 8 years ago
- Attention_CopyNet☆29Aug 18, 2016Updated 9 years ago
- Very Simple Question Answer System using Chinese Wikipedia Data☆24May 18, 2024Updated last year
- The official implementation of the DIFFA series for dLLM-based large audio language model☆59Feb 2, 2026Updated last month
- A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.☆28Mar 14, 2025Updated 11 months ago
- Official repository for NAST: Noise Aware Speech Tokenization for Speech Language Models (Interspeech 2024) https://arxiv.org/abs/2406.11…☆46Jul 2, 2024Updated last year
- Implementation of VAE and Style-GAN Architecture Achieving State of the Art Reconstruction☆29Mar 24, 2023Updated 2 years ago
- Spoken Cantonese from Hong Kong.☆30Nov 12, 2025Updated 3 months ago