Using Kaldi (Automatic Speech Recognition) and Gentle (Forced Word Aligner), this script finds both rhymes and alliteration in speeches with matching audio and text.
☆13May 4, 2018Updated 7 years ago
Alternatives and similar repositories for Oral-Poetics-Detection
Users that are interested in Oral-Poetics-Detection are comparing it to the libraries listed below
Sorting:
- This is application for dysarthria to improve their pronunciation by using deep learning☆10Dec 29, 2020Updated 5 years ago
- Perform the forced decoding with target transcription☆11Sep 12, 2018Updated 7 years ago
- ☆13Apr 9, 2021Updated 4 years ago
- A python wrapper for kaldi-online-decoder using Cython☆12Sep 1, 2017Updated 8 years ago
- ☆13Oct 27, 2021Updated 4 years ago
- python wrap for hts engine☆14Jan 30, 2018Updated 8 years ago
- Deploy Kaldi models using grpc for bidirectional streaming.☆17Sep 30, 2024Updated last year
- wake word spotting with kaldi☆19Dec 3, 2020Updated 5 years ago
- Latest SRILM = 1.7☆18Apr 22, 2016Updated 9 years ago
- ☆25Jun 14, 2022Updated 3 years ago
- Keyword Search Recipe for Subword ASR☆30Jul 12, 2019Updated 6 years ago
- SpeakerVoiceIdentifier can recognize the voice of a speaker by learning.☆35Feb 20, 2017Updated 9 years ago
- Phone-level evaluation of L2 speakers (GOP algorithm)☆27Mar 1, 2017Updated 9 years ago
- Tacotron text to speech in C++(synthesize only)☆77Oct 17, 2019Updated 6 years ago
- 基于RNN、CNN、XGboost的问答系统意图识别模块☆35Jun 25, 2018Updated 7 years ago
- Belief Revision based Caption Re-ranker with Visual Semantic Information. COLING 2022☆11Apr 13, 2025Updated 10 months ago
- Computation Graph framework implemented using only NumPy☆10Mar 31, 2024Updated last year
- 基于kaldi的ios本地语音识别(本地实时流)Kaldi-based ios native speech recognition (local real-time streaming)☆74Sep 13, 2021Updated 4 years ago
- Goodness of Pronunciation using Kaldi on Epa-DB database☆35Jan 17, 2024Updated 2 years ago
- Adapt Kaldi-ASR nnet3 chain models from Zamia-Speech.org to a different language model☆33Jan 26, 2020Updated 6 years ago
- ChiNese Text Normalization (CNTN) tool for Text-to-speech system☆37Apr 12, 2018Updated 7 years ago
- Tool to translate numbers to spanish strings, with tests!☆11Nov 16, 2016Updated 9 years ago
- Split ELAN Annotation Files and corresponding speech files into a corpus format for common ASR and Forced Aligners☆11Oct 15, 2018Updated 7 years ago
- EduAction is an educational content generation application powered by GenAI developed during the Encode Club AI Hackathon London 2024.☆12Mar 24, 2024Updated last year
- A voice spoofing detection system, based on paper presented at ICSPIS 2021☆10Feb 11, 2022Updated 4 years ago
- 在Android上运行人脸表情识别的tflite模型☆12Apr 7, 2021Updated 4 years ago
- A full featured tab component for Angular (2 and above, including 4).☆12Oct 21, 2021Updated 4 years ago
- Keep your public and secret information grounded in your browser!☆10Jan 6, 2023Updated 3 years ago
- Phase Vocoder and Wavelet Transform Implementation for Pitch Shifting a sound signal☆11Jul 27, 2020Updated 5 years ago
- Generalizable Cas9/sgRNA prediction models for multiple Cas9 variants☆10Jan 17, 2026Updated last month
- Document Scanner with OCR iOS app written in Swift☆10Nov 22, 2021Updated 4 years ago
- ☆40Aug 15, 2021Updated 4 years ago
- Separate vocals and accompaniment from audio file. Uses spleeter from Deezer.☆11Jan 27, 2026Updated last month
- HTML5 Piano Roll: MIDI file visualizer, player, editor☆14Sep 21, 2014Updated 11 years ago
- Learning Svelte by building a simple personal website☆10Dec 4, 2022Updated 3 years ago
- Classifies Emotion in Speech Signal☆10May 30, 2016Updated 9 years ago
- acnn for text-independent speaker recognition☆10Feb 8, 2022Updated 4 years ago
- an ML & deep learning algorithms/models to assess spoken English language proficiency +++ it transforms sounds/language in a 3-dimension …☆16Jun 11, 2019Updated 6 years ago
- Simple rules based grapheme to phoneme in Python☆11Sep 2, 2017Updated 8 years ago