linto-ai / linto-sttLinks
An automatic speech recognition API
☆76Updated 2 weeks ago
Alternatives and similar repositories for linto-stt
Users that are interested in linto-stt are comparing it to the libraries listed below
Sorting:
- Tunable pipelines☆40Updated 2 months ago
- On-device noise suppression powered by deep learning☆77Updated last week
- On-device voice activity detection (VAD) powered by deep learning☆233Updated 2 weeks ago
- Reproducible experimental protocols for multimedia (audio, video, text) database☆107Updated 2 months ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆153Updated last year
- A curated list of awesome voice activity detection☆69Updated last year
- Timething is a library for aligning text transcripts with their audio recordings.☆126Updated last year
- On-device speaker diarization powered by deep learning☆57Updated last week
- Model for recasing and repunctuating ASR transcripts☆142Updated last year
- A model that predicts the punctuation of English, Italian, French and German texts.☆83Updated 2 years ago
- Gecko - A Tool for Effective Annotation of Human Conversations☆298Updated this week
- ☆44Updated last year
- Deep Learning - one shot learning for speaker recognition using Filter Banks☆170Updated last year
- The human speaks a language with an accent. A particular accent necessarily reflects a person's linguistic background. The model defines …☆62Updated 4 years ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆119Updated 2 years ago
- 🐸STT integration examples☆129Updated 3 years ago
- A live speech recognition using Facebooks wav2vec 2.0 model.☆375Updated last year
- Real-Time Whisper Voice Recognition with vosk model feedback.☆119Updated 2 years ago
- Open models for Coqui STT☆148Updated 2 years ago
- Mirror of hf.co/pyannote/speaker-diarization-3.1☆28Updated last year
- Goodness of Pronunciation using Kaldi on Epa-DB database☆35Updated last year
- Advanced data structures for handling temporal segments with attached labels.☆122Updated 2 months ago
- Various speech datasets made available to the public☆129Updated 11 months ago
- VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram☆265Updated last year
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆107Updated 2 years ago
- ONNX Inference of Pyannote Segmentation☆97Updated 11 months ago
- Speaker diarization python system based on binary key speaker modelling☆60Updated 3 years ago
- A tokenizer, text cleaner, and phonemizer for many human languages.☆328Updated last year
- A non-native English corpus for pronunciation scoring task☆161Updated last month
- ☆362Updated last month