clarinsi / Slovene_ASR_e2eLinks
Automatic Speech Recognition tool
☆20Updated 2 years ago
Alternatives and similar repositories for Slovene_ASR_e2e
Users that are interested in Slovene_ASR_e2e are comparing it to the libraries listed below
Sorting:
- Whisper finetuning☆15Updated 9 months ago
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Updated 2 years ago
- Grapheme-to-phoneme tool for corpus conversion, where phonemes match Phoible inventories☆19Updated 9 months ago
- Voice activity detection and speaker gender segmentation audiovisual corpus☆16Updated last year
- eCMU: An Efficient Phase-aware Framework for Music Source Separation with Conformer (IEEE RIVF23)☆10Updated last year
- Crawling and creating a German language model resource☆18Updated 3 years ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Updated 4 years ago
- BBB plugin for automatic subtitles in conference calls☆29Updated 3 years ago
- Forced alignment decoder for Whisper.☆14Updated last year
- S3PRL for Speech Emotion Recognition (see s3prl > downstream)☆15Updated 11 months ago
- All-in-one Speech Transcription☆10Updated this week
- Free Dutch voice dataset☆12Updated 5 years ago
- One command to start a streaming ASR server.☆12Updated last year
- MnTTS: An Open-Source Mongolian Text-to-Speech Synthesis Dataset and Accompanied Baseline. (Accepted by IALP'2022)☆22Updated 3 years ago
- Transfer learning approach to pronunciation scoring☆11Updated 2 years ago
- AsoSoft Speech Corpus for Central-Kurdish Text-To-Speech☆19Updated 3 years ago
- ☆11Updated 4 months ago
- Wenet speech to text for react native☆10Updated 3 years ago
- Evaluation of STT models for german language☆15Updated 4 years ago
- ☆23Updated last month
- Pybind11 bindings for Kaldi☆15Updated 2 weeks ago
- (WIP) A retrain of F5-TTS on permissively-licensed data☆13Updated 9 months ago
- A simple, accessible and offline real-time transcription app for Android.☆14Updated last year
- This is the experimental description of MnTTS2.☆11Updated last year
- Tracking beer/wine using Audio Event Detection with Machine Learning☆15Updated last year
- Mission to create a Hebrew TTS model as powerful and user-friendly as WaveNet☆38Updated last year
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆11Updated 3 years ago
- ☆13Updated last year
- Using OpenVINO to speed up MeloTTS inference☆15Updated last year
- Tools for the automatic detection of speech-related inhalation events and characterisation of the speech respiratory cycle.☆11Updated last year