IPS-LMU / octra
OCTRA is a web-application for the orthographic transcription of audio files.
β39Updated 2 weeks ago
Alternatives and similar repositories for octra
Users that are interested in octra are comparing it to the libraries listed below
Sorting:
- A free & open tool for transcribing audio interviews with offline ASR supportβ24Updated last year
- π« check your data, before you wreck your modelβ16Updated 2 years ago
- β11Updated 9 years ago
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speechβ¦β17Updated 2 years ago
- Identifying individual speakers in an audio stream based on the unique characteristics found in individual voices using Pythonβ18Updated last year
- From a large speech audio file and its corresponding body of text, automatically chunk the audio and text into (phrase, audio_snippet) paβ¦β17Updated 10 years ago
- The EveryVoice TTS Toolkit - Text To Speech for your languageβ33Updated this week
- πΉ pyannote + π notebook = pyannotebookβ26Updated last year
- An even smaller speech recognizer / force alignerβ32Updated 4 months ago
- Unicode Standard tokenization routines and orthography profile segmentationβ37Updated 2 months ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zooβ25Updated 2 years ago
- Simple text to phonemes converter for multiple languagesβ20Updated 2 years ago
- 24-hour Automatic Speech Recognitionβ27Updated 3 years ago
- automatically align transcribed audio and generate a wav2letter training corpusβ36Updated 2 years ago
- Uses machine learning to denoise audio containing speechβ33Updated 10 months ago
- Gentle and praatio scripts for easy forced alignmentβ18Updated 2 years ago
- Code for our ACML and INTERSPEECH papers: "Speaker Diarization as a Fully Online Bandit Learning Problem in MiniVox".β27Updated 3 years ago
- Interface for using TTS and vocoder models in the form of a text editorβ20Updated 2 years ago
- Easily turn large sets of audio urls to an audio dataset.β21Updated 2 years ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.β13Updated last year
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languagesβ13Updated 2 years ago
- Voice activity detection and speaker gender segmentation audiovisual corpusβ13Updated 3 months ago
- Forced Alignments for Common Voiceβ31Updated 4 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.β27Updated last year
- Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with theβ¦β47Updated 2 years ago
- Simple PyTorch Denoisers for Waveform Audioβ35Updated 3 weeks ago
- TTS Client for Coqui TTS serverβ13Updated 2 years ago
- Similarity Learning applied to Speaker Verification and Semantic Textual Similarityβ12Updated 5 years ago
- Running Mozilla's implementation of Baidu DeepSpeech on Google Colaboratoryβ16Updated 6 years ago
- β13Updated 2 months ago