clarinsi / Slovene_ASR_e2eLinks
Automatic Speech Recognition tool
☆20Updated 2 years ago
Alternatives and similar repositories for Slovene_ASR_e2e
Users that are interested in Slovene_ASR_e2e are comparing it to the libraries listed below
Sorting:
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Updated 2 years ago
 - Forced alignment decoder for Whisper.☆14Updated last year
 - Using OpenVINO to speed up MeloTTS inference☆13Updated last year
 - Mission to create a Hebrew TTS model as powerful and user-friendly as WaveNet☆37Updated 9 months ago
 - (WIP) A retrain of F5-TTS on permissively-licensed data☆13Updated 6 months ago
 - Zero-shot voice cloning text-to-speech (TTS) with explicit emotion class conditioning built on F5-TTS☆20Updated 2 months ago
 - Grapheme-to-phoneme tool for corpus conversion, where phonemes match Phoible inventories☆18Updated 6 months ago
 - Using YouTube to prepare a speech recognition dataset for any language☆10Updated 4 years ago
 - ☆17Updated 4 years ago
 - A composition of offline tools to achieve high quality multilingual speech to text transcription☆22Updated 2 months ago
 - Evaluation of STT models for german language☆15Updated 3 years ago
 - Text-to-Speech conversor for Basque and Spanish. It includes linguistic processing and built voices for the languages aforementioned. Its…☆17Updated last year
 - Official PyTorch implementation of "AutoStyle-TTS: Retrieval-Augmented Generation based Automatic Style Matching Text-to-Speech Synthesis…☆16Updated 7 months ago
 - BBB plugin for automatic subtitles in conference calls☆29Updated 3 years ago
 - Simple inference for Vits2 TTS Using ONNXRUNTIME and espeak-ng on C++☆18Updated last year
 - ☆11Updated last month
 - Deploy Kaldi models using grpc for bidirectional streaming.☆17Updated last year
 - ☆14Updated last year
 - S3PRL for Speech Emotion Recognition (see s3prl > downstream)☆15Updated 8 months ago
 - A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts☆15Updated 11 months ago
 - MnTTS: An Open-Source Mongolian Text-to-Speech Synthesis Dataset and Accompanied Baseline. (Accepted by IALP'2022)☆22Updated 2 years ago
 - ☆11Updated 2 years ago
 - ☆54Updated 2 years ago
 - Hebrew grapheme to phoneme (G2P)☆72Updated 2 weeks ago
 - ☆15Updated 4 months ago
 - A simple, accessible and offline real-time transcription app for Android.☆12Updated last year
 - ☆11Updated 3 years ago
 - AsoSoft Speech Corpus for Central-Kurdish Text-To-Speech☆20Updated 3 years ago
 - ☆19Updated 7 months ago
 - Python implementation of a few speech intelligibility prediction algorithms☆14Updated last year