linto-ai / linto-sttLinks
An automatic speech recognition API
☆68Updated this week
Alternatives and similar repositories for linto-stt
Users that are interested in linto-stt are comparing it to the libraries listed below
Sorting:
- On-device speaker diarization powered by deep learning☆53Updated 2 weeks ago
- On-device noise suppression powered by deep learning☆75Updated 2 weeks ago
- On-device voice activity detection (VAD) powered by deep learning☆227Updated last week
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆150Updated last year
- A python package for deep multilingual punctuation prediction.☆130Updated last year
- ONNX Inference of Pyannote Segmentation☆92Updated 8 months ago
- Model for recasing and repunctuating ASR transcripts☆137Updated last year
- 🐸STT integration examples☆129Updated 2 years ago
- Reproducible experimental protocols for multimedia (audio, video, text) database☆107Updated 6 months ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆118Updated 2 years ago
- Real-Time Whisper Voice Recognition with vosk model feedback.☆118Updated 2 years ago
- Timething is a library for aligning text transcripts with their audio recordings.☆122Updated 8 months ago
- Tunable pipelines☆36Updated 6 months ago
- Add n-gram and large language model (LLM) support to Whisper models.☆31Updated 3 months ago
- Create an LJSpeech structured voice dataset on wave input☆33Updated 10 months ago
- A curated list of awesome voice activity detection☆62Updated 9 months ago
- C++ version of pyannote audio speaker diarizaiton pipeline☆21Updated last year
- Goodness of Pronunciation using Kaldi on Epa-DB database☆35Updated last year
- Deep Learning - one shot learning for speaker recognition using Filter Banks☆170Updated last year
- openvino version of openai/whisper☆172Updated last year
- Various speech datasets made available to the public☆128Updated 8 months ago
- Open models for Coqui STT☆141Updated 2 years ago
- ☆199Updated 3 years ago
- A model that predicts the punctuation of English, Italian, French and German texts.☆80Updated 2 years ago
- A live speech recognition using Facebooks wav2vec 2.0 model.☆363Updated last year
- Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environments☆102Updated 5 years ago
- Experiments to test different speech recognition systems for SEPIA Framework☆60Updated 2 years ago
- streaming speech to text server using Whisper☆94Updated 2 years ago
- VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram☆254Updated last year
- ☆11Updated 3 weeks ago