linto-ai / linto-stt
An automatic speech recognition API
☆54Updated last week
Alternatives and similar repositories for linto-stt:
Users that are interested in linto-stt are comparing it to the libraries listed below
- On-device speaker diarization powered by deep learning☆39Updated last week
- Model for recasing and repunctuating ASR transcripts☆133Updated 11 months ago
- Reproducible experimental protocols for multimedia (audio, video, text) database☆98Updated last month
- Python server for communicating with Kaldi from the browser using WebRTC☆69Updated last year
- ☆39Updated last year
- Simple Kaldi model server for chain (nnet3) models in online recognition mode directly from a local microphone☆35Updated 3 years ago
- Various speech datasets made available to the public☆114Updated 3 months ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆146Updated 10 months ago
- On-device voice activity detection (VAD) powered by deep learning☆202Updated last week
- A curated list of awesome voice activity detection☆44Updated 4 months ago
- ☆10Updated this week
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆82Updated last year
- Open models for Coqui STT☆134Updated last year
- 🐸STT integration examples☆126Updated 2 years ago
- Tunable pipelines☆31Updated last month
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆110Updated 2 years ago
- On-device noise suppression powered by deep learning☆68Updated last week
- automatically align transcribed audio and generate a wav2letter training corpus☆36Updated last year
- ONNX Inference of Pyannote Segmentation☆81Updated 3 months ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆25Updated 2 years ago
- Colab notebooks for Next-gen Kaldi☆26Updated last month
- Some fast-ish algorithms for batch text search in moderate-sized collections, intended for data cleanup☆68Updated 6 months ago
- Speaker change detection using SincNet and an LSTM/Transformer☆48Updated 8 months ago
- ☆78Updated this week
- NPTEL2020: Speech2Text dataset for Indian-English Accent☆75Updated 3 years ago
- Onnx wrapper for espnet infrernce model☆161Updated 5 months ago
- OpenAI Whisper Prompt Examples☆52Updated last year
- A toolkit for reproducible evaluation, diagnostic, and error analysis of speaker diarization systems☆204Updated last month
- Online streaming speaker change detection model in Pytorch☆38Updated last year
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!☆154Updated 2 weeks ago