IPS-LMU / octra
OCTRA is a web-application for the orthographic transcription of audio files.
☆37Updated this week
Alternatives and similar repositories for octra:
Users that are interested in octra are comparing it to the libraries listed below
- A free & open tool for transcribing audio interviews with offline ASR support☆24Updated last year
- 24-hour Automatic Speech Recognition☆27Updated 3 years ago
- Identifying individual speakers in an audio stream based on the unique characteristics found in individual voices using Python☆18Updated last year
- Simple text to phonemes converter for multiple languages☆20Updated 2 years ago
- automatically align transcribed audio and generate a wav2letter training corpus☆36Updated last year
- TTS Client for Coqui TTS server☆13Updated 2 years ago
- The EveryVoice TTS Toolkit - Text To Speech for your language☆24Updated this week
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆25Updated last year
- Emotive Speech generation based on DAVID: An open-source platform for real-time emotional speech transformation using pysox☆13Updated 6 years ago
- 🫠 check your data, before you wreck your model☆16Updated 2 years ago
- An even smaller speech recognizer / force aligner☆32Updated last month
- OpenAI Whisper Prompt Examples☆50Updated last year
- The EMU-webApp is an online and offline web application for labeling, visualizing and correcting speech and derived speech data.☆51Updated 4 months ago
- SylNet: An Adaptable End-to-End Syllable Count Estimator for Speech☆25Updated last year
- 🎹 pyannote + 🗒 notebook = pyannotebook☆26Updated last year
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Updated last year
- A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.☆15Updated 4 years ago
- An in-browser app for labeling audio clips at random, using Docker and Flask.☆53Updated 7 years ago
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Updated 2 years ago
- Forced Alignments for Common Voice☆31Updated 4 years ago
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆30Updated last year
- Labeled data for homograph disambiguation☆54Updated last year
- phone inventory library☆16Updated last year
- A curated list of awesome voice activity detection☆29Updated 2 months ago
- Gentle and praatio scripts for easy forced alignment☆18Updated 2 years ago
- Code for AccentDB.☆20Updated 3 years ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆28Updated last year
- ☆23Updated 2 years ago
- Tools to create your own voice dataset for TTS training☆65Updated 4 years ago
- Speaker change detection using SincNet and an LSTM/Transformer☆46Updated 7 months ago