daanzu / speech-training-recorderLinks
Simple GUI application to help record audio dictated from given text prompts, for use with training speech recognition or speech synthesis.
☆41Updated 3 years ago
Alternatives and similar repositories for speech-training-recorder
Users that are interested in speech-training-recorder are comparing it to the libraries listed below
Sorting:
- Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-gramma…☆20Updated 3 years ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Updated 4 years ago
- a python library for speech enhancement☆80Updated 11 months ago
- Kaldi API for Android, Python and Node. Forked from vosk-api with minimal modifications.☆16Updated 4 years ago
- On-device voice activity detection (VAD) powered by deep learning☆217Updated this week
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆114Updated 2 years ago
- Forced Alignments for Common Voice☆31Updated 4 years ago
- A complete speech segmentation system using Kaldi and x-vectors for voice activity detection (VAD) and speaker diarisation.☆29Updated 11 months ago
- python wrapper for rnnoise library☆48Updated 2 years ago
- An unofficial PyTorch implementation of the StreamVC(Real-Time Low-Latency Voice Conversion)☆123Updated 10 months ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆98Updated 7 months ago
- An unofficial pytorch implementation of "STREAMVC: REAL-TIME LOW-LATENCY VOICE CONVERSION".☆67Updated last month
- 🐸STT integration examples☆128Updated 2 years ago
- A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.☆23Updated 2 months ago
- automatically align transcribed audio and generate a wav2letter training corpus☆36Updated 2 years ago
- An online speech recognition extension toolkit of Kaldi☆56Updated 3 years ago
- SEPIA server to support open-source speech recognition via WebSocket connection.☆127Updated 6 months ago
- Code for our INTERSPEECH paper Simul-Whisper: Attention-Guided Streaming Whisper with Truncation Detection☆62Updated 2 months ago
- This repo related to the paper "A Framework for Phoneme-Level Pronunciation Assessment Using CTC" for INTERSPEECH2024☆21Updated 6 months ago
- CDER (Conversational Diarization Error Rate) Scoring Tool☆21Updated 2 years ago
- Scripts to simplify data prepping for Mozilla DeepSpeech.☆14Updated 5 years ago
- 🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition☆15Updated last year
- Filtering and Noise Adding Tool☆29Updated 3 years ago
- Keyword Search Recipe for Subword ASR☆30Updated 5 years ago
- [Last Updated 2021] TTS from Cookie. Messy and experimental!☆43Updated 2 years ago
- ☆12Updated 4 months ago
- A free & open tool for transcribing audio interviews with offline ASR support☆24Updated last year
- wake word spotting with kaldi☆19Updated 4 years ago
- Lightweight wrapper for Silero VAD using internal ONNX Runtime and with no python package dependencies☆14Updated 6 months ago
- ☆33Updated 3 years ago