linto-ai / linto-sttLinks
An automatic speech recognition API
☆60Updated this week
Alternatives and similar repositories for linto-stt
Users that are interested in linto-stt are comparing it to the libraries listed below
Sorting:
- On-device speaker diarization powered by deep learning☆46Updated 3 weeks ago
- Reproducible experimental protocols for multimedia (audio, video, text) database☆100Updated 3 months ago
- Tunable pipelines☆34Updated 3 months ago
- Various speech datasets made available to the public☆118Updated 5 months ago
- Open models for Coqui STT☆139Updated 2 years ago
- On-device voice activity detection (VAD) powered by deep learning☆217Updated this week
- Advanced data structures for handling temporal segments with attached labels.☆113Updated 3 months ago
- ☆103Updated last week
- Create an LJSpeech structured voice dataset on wave input☆30Updated 8 months ago
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆83Updated last year
- ☆40Updated last year
- ONNX Inference of Pyannote Segmentation☆90Updated 5 months ago
- A curated list of awesome voice activity detection☆54Updated 6 months ago
- SEPIA server to support open-source speech recognition via WebSocket connection.☆127Updated 6 months ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆114Updated 2 years ago
- Speaker diarization service☆23Updated last month
- Model for recasing and repunctuating ASR transcripts☆133Updated last year
- On-device noise suppression powered by deep learning☆70Updated last month
- ☆38Updated 3 years ago
- automatically align transcribed audio and generate a wav2letter training corpus☆36Updated 2 years ago
- Application for viewing Rich Transcription Time Marked (RTTM) files in an interactive way☆43Updated 2 years ago
- A toolkit for reproducible evaluation, diagnostic, and error analysis of speaker diarization systems☆210Updated 3 months ago
- ☆17Updated 2 years ago
- Diarization scoring tools.☆247Updated 2 years ago
- Experiments to test different speech recognition systems for SEPIA Framework☆60Updated 2 years ago
- ☆54Updated last year
- The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based …☆136Updated 3 months ago
- This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsuperv…☆140Updated 3 weeks ago
- Add n-gram and large language model support to Whisper models.☆19Updated last month
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!☆163Updated 3 weeks ago