bootphon / seshat
The Seshat audio annotation management platform
☆13Updated 3 years ago
Related projects: ⓘ
- A library to create and load tfrecord files as tf.data.Dataset☆9Updated 4 months ago
- audio, NLP, ML with huggingface, nvidia/nemo, speechbrain☆10Updated last year
- A crash course for training speech recognition models using DeepSpeech.☆23Updated 3 years ago
- It is an algorithm analysed the acoustic features of a voice and creates an acoustic classifier - USEFUL for auto-speech-rater☆11Updated 5 years ago
- Breaks a word into syllables using an LSTM-based neural network.☆19Updated last year
- A repo listing known open source voice tools, ordered by where they sit in the voice stack☆25Updated last year
- From a large speech audio file and its corresponding body of text, automatically chunk the audio and text into (phrase, audio_snippet) pa…☆17Updated 9 years ago
- OCTRA is a web-application for the orthographic transcription of audio files.☆35Updated last week
- Generic Environment for Context-Aware Correction of Orthography☆22Updated 2 years ago
- Artie Bias Corpus: an audio corpus + code for detecting demographic bias☆20Updated 4 years ago
- Gentle and praatio scripts for easy forced alignment☆18Updated last year
- Simple and clean Python implementation of TextRank as per seminal paper by Rada Mihalcea and Paul Tarau. This implementation performs bot…☆11Updated 3 years ago
- speech engine training projects☆28Updated 3 years ago
- Fast and accurate natural language detection. Detector written in Python. Nito-ELD, ELD.☆11Updated 11 months ago
- automatically align transcribed audio and generate a wav2letter training corpus☆34Updated last year
- NEAL (Nature+Energy Audio Labeller) is an open-source interactive audio data annotation tool.☆12Updated 2 weeks ago
- Featurize words into orthographic and phonological vectors.☆39Updated last year
- Collaborative audio annotation tool☆17Updated 2 years ago
- Unicode Standard tokenization routines and orthography profile segmentation☆31Updated 2 years ago
- Scansion tool for Spanish texts☆10Updated 9 months ago
- OPUS (opus.nlpl.eu) Python3 API☆14Updated this week
- Gamma Agreement in Python☆43Updated 6 months ago
- 🦁 Nala is an agile open-source voice assistant framework (20+ actions).☆35Updated last year
- Python module for syllabifying English ARPABET transcriptions☆63Updated 5 years ago
- finite-state toolkit, EM and Bayesian (Gibbs sampling) training for FST and context-free derivation forests☆41Updated last year
- A Datasette plugin providing an MLOps platform to train, eval and predict machine learning models☆15Updated last week
- 🐍 Coqui's machine learning job scheduler☆31Updated 3 years ago
- Running Mozilla's implementation of Baidu DeepSpeech on Google Colaboratory☆16Updated 5 years ago
- Experiments with generating GPT-2 fanfiction on specified topics.☆11Updated 5 years ago
- Evaluation of STT models for german language☆15Updated 2 years ago