clarinsi / Slovene_ASR_e2e
Automatic Speech Recognition tool
☆16Updated last year
Related projects ⓘ
Alternatives and complementary repositories for Slovene_ASR_e2e
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Updated last year
- Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port…☆12Updated 2 months ago
- Speech-To-Text forced-alignment Speech processing Universal PERformance Benchmark☆22Updated 3 months ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆18Updated 8 months ago
- brainless concatenative text to speech☆11Updated 3 years ago
- 🫠 check your data, before you wreck your model☆16Updated 2 years ago
- AsoSoft Speech Corpus for Central-Kurdish Text-To-Speech☆13Updated 2 years ago
- Project of Singing Voice Conversion.☆14Updated last year
- proof of concept conversation orchestrator with a speech-language model☆13Updated 3 weeks ago
- ☆10Updated 2 months ago
- 'Grad-TTS' with Multilingual Cleaners☆10Updated 7 months ago
- A fork of Lyra V2 (a low-bitrate neural audio codec) that supports a webassembly build.☆25Updated last year
- This is a TTS model based on VITS that can control the output speech emotion through natural language and control the speaker through ref…☆4Updated 2 months ago
- Simplified recipes for preparing commonly used speech datasets, and a PyTorch-compatible Python data loader that can perform standard fea…☆15Updated last year
- [Early Alpha] A unified framework for text-to-speech, voice conversion, automatic speech recognition, audio classification, voice activit…☆21Updated 5 months ago
- Real-time end-to-end singing voice convertion☆18Updated last week
- Coqui STT offline engine API for NodeJs developers. With a simple HTTP ASR server.☆25Updated 3 years ago
- Aligner for text-to-speech☆15Updated 3 months ago
- Simple inference for Vits2 TTS Using ONNXRUNTIME and espeak-ng on C++☆12Updated 6 months ago
- KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a l…☆22Updated 3 months ago
- Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform with Korean C…☆13Updated 10 months ago
- ☆19Updated 2 weeks ago
- ☆12Updated last year
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler☆23Updated 3 years ago
- ☆22Updated 3 years ago
- zero-shot realtime TTS system, fully offline, free and open source☆14Updated this week
- MnTTS: An Open-Source Mongolian Text-to-Speech Synthesis Dataset and Accompanied Baseline. (Accepted by IALP'2022)☆16Updated last year
- BurrMill core☆21Updated 3 years ago
- ☆12Updated 3 months ago
- Zero-Shot Foreign Accent Conversion without a Native Reference☆28Updated 6 months ago