linto-ai / linto-stt
An automatic speech recognition API
☆49Updated this week
Alternatives and similar repositories for linto-stt:
Users that are interested in linto-stt are comparing it to the libraries listed below
- On-device speaker diarization powered by deep learning☆38Updated last week
- Various speech datasets made available to the public☆113Updated 2 months ago
- ☆10Updated last week
- ☆17Updated last year
- Create an LJSpeech structured voice dataset on wave input☆26Updated 4 months ago
- Speaker diarization service☆21Updated this week
- Reproducible experimental protocols for multimedia (audio, video, text) database☆96Updated last week
- ☆43Updated 2 years ago
- ☆25Updated 2 years ago
- Crawling and creating a German language model resource☆19Updated 2 years ago
- NPTEL2020: Speech2Text dataset for Indian-English Accent☆74Updated 3 years ago
- Python server for communicating with Kaldi from the browser using WebRTC☆69Updated last year
- Tools to create your own voice dataset for TTS training☆66Updated 4 years ago
- On-device voice activity detection (VAD) powered by deep learning☆198Updated this week
- Open models for Coqui STT☆129Updated last year
- Running Mozilla's implementation of Baidu DeepSpeech on Google Colaboratory☆16Updated 5 years ago
- ☆39Updated last year
- scripts to align a given wave to its transcription using trained models by Kaldi☆32Updated 5 years ago
- An even smaller speech recognizer / force aligner☆32Updated 2 months ago
- ☆80Updated 8 months ago
- Speaker diarization model☆23Updated last year
- C++ version of pyannote audio overlapped speech detection pipeline☆11Updated last year
- Online streaming speaker change detection model in Pytorch☆38Updated last year
- Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IW…☆17Updated 2 years ago
- The EveryVoice TTS Toolkit - Text To Speech for your language☆24Updated last week
- Coqui Inference Engine☆38Updated 3 years ago
- Data and code for grapheme-to-phoneme transducers in lots of languages☆132Updated 10 months ago
- Simple Kaldi model server for chain (nnet3) models in online recognition mode directly from a local microphone☆36Updated 3 years ago
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆80Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆92Updated 9 months ago