IPS-LMU / octra
OCTRA is a web-application for the orthographic transcription of audio files.
☆38Updated last week
Alternatives and similar repositories for octra:
Users that are interested in octra are comparing it to the libraries listed below
- A free & open tool for transcribing audio interviews with offline ASR support☆24Updated last year
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆25Updated last year
- Identifying individual speakers in an audio stream based on the unique characteristics found in individual voices using Python☆18Updated last year
- An in-browser app for labeling audio clips at random, using Docker and Flask.☆53Updated 7 years ago
- The EMU-webApp is an online and offline web application for labeling, visualizing and correcting speech and derived speech data.☆51Updated 6 months ago
- ☆23Updated 2 years ago
- ☆11Updated 9 years ago
- automatically align transcribed audio and generate a wav2letter training corpus☆36Updated last year
- 24-hour Automatic Speech Recognition☆27Updated 3 years ago
- Simple text to phonemes converter for multiple languages☆20Updated 2 years ago
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Updated 2 years ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated last year
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆28Updated last year
- Code for our ACML and INTERSPEECH papers: "Speaker Diarization as a Fully Online Bandit Learning Problem in MiniVox".☆27Updated 3 years ago
- Simple PyTorch Denoisers for Waveform Audio☆34Updated 3 weeks ago
- Gentle and praatio scripts for easy forced alignment☆18Updated 2 years ago
- An even smaller speech recognizer / force aligner☆32Updated 3 months ago
- The EveryVoice TTS Toolkit - Text To Speech for your language☆25Updated last month
- Stable timestamps and confidence score for words of OpenAI's Whisper outputs down to word-level.☆25Updated 2 years ago
- Breaks a word into syllables using an LSTM-based neural network.☆19Updated last year
- 🫠 check your data, before you wreck your model☆16Updated 2 years ago
- 🎹 pyannote + 🗒 notebook = pyannotebook☆26Updated last year
- Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with the…☆47Updated 2 years ago
- Web app to annotate word onsets and offsets on spectrograms☆28Updated 2 years ago
- Python library for audio augmentation☆83Updated last year
- Interface for using TTS and vocoder models in the form of a text editor☆20Updated 2 years ago
- ☆32Updated 3 years ago
- ☆12Updated 2 years ago
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆31Updated last year
- TTS Client for Coqui TTS server☆13Updated 2 years ago