speechsuper / SpeechSuper-API-Samples
Deep learning based speech and pronunciation assessment API for 8 languages.
☆30Updated 5 months ago
Related projects ⓘ
Alternatives and complementary repositories for SpeechSuper-API-Samples
- A non-native English corpus for pronunciation scoring task☆110Updated 3 months ago
- Code for the ICASSP 2022 paper "Transformer-Based Multi-Aspect Multi-Granularity Non-native English Speaker Pronunciation Assessment".☆150Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆84Updated 6 months ago
- Timething is a library for aligning text transcripts with their audio recordings.☆101Updated 11 months ago
- Real-Time Whisper Voice Recognition with vosk model feedback.☆105Updated last year
- Text to speech alignment using CTC forced alignment☆130Updated 2 weeks ago
- ☆65Updated 3 weeks ago
- 🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition☆15Updated 8 months ago
- ☆19Updated 7 months ago
- This tool uses AI to evaluate your pronunciation.☆151Updated last year
- ONNX Inference of Pyannote Segmentation☆65Updated 2 months ago
- Listen to any audio stream on your machine and print out the transcribed or translated audio.☆114Updated last year
- Phoneme Recognition using pre-trained models Wav2vec2, HuBERT and WavLM. Throughout this project, we compared specifically three differen…☆205Updated 2 years ago
- A testing repo to share code and thoughts on diarisation☆51Updated 7 months ago
- Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)☆54Updated 5 months ago
- Community framework for training tortoise☆38Updated 2 years ago
- Flask webapp/endpoint that compares the user's speech with different accents and assigns similarity scores based on speed, voice (DTW/MFC…☆16Updated 7 years ago
- Application of MB-iSTFT-VITS components to vits2_pytorch☆114Updated this week
- 😎 Awesome lists about Speech Emotion Recognition☆66Updated last week
- Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection☆253Updated 2 months ago
- A simple Python package to easily use Meta's Massively Multilingual Speech (MMS) project☆52Updated last year
- Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.☆251Updated last year
- Text to speech is an emerging zone of AI. This repository helps to create a dataset with audio and transcripts for personalized text to s…☆27Updated last year
- Python forced alignment☆73Updated 7 months ago
- Awesome TTS☆54Updated 3 years ago
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…☆153Updated last month
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆99Updated last year
- This is an implementation for train hifigan part of XTTSv2 model using Coqui/TTS.☆59Updated this week
- Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event …☆320Updated 8 months ago