linto-ai / linto-sttLinks
An automatic speech recognition API
☆71Updated last month
Alternatives and similar repositories for linto-stt
Users that are interested in linto-stt are comparing it to the libraries listed below
Sorting:
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆151Updated last year
- Timething is a library for aligning text transcripts with their audio recordings.☆124Updated 10 months ago
- A curated list of awesome voice activity detection☆67Updated 11 months ago
- Reproducible experimental protocols for multimedia (audio, video, text) database☆107Updated last month
- On-device voice activity detection (VAD) powered by deep learning☆231Updated last month
- 🐸STT integration examples☆129Updated 3 years ago
- On-device noise suppression powered by deep learning☆75Updated 2 months ago
- On-device speaker diarization powered by deep learning☆56Updated 2 months ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆119Updated 2 years ago
- Gecko - A Tool for Effective Annotation of Human Conversations☆298Updated 2 years ago
- Open models for Coqui STT☆146Updated 2 years ago
- ☆43Updated last year
- Tunable pipelines☆40Updated last month
- A live speech recognition using Facebooks wav2vec 2.0 model.☆372Updated last year
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆107Updated 2 years ago
- Add n-gram and large language model (LLM) support to Whisper models.☆32Updated 5 months ago
- 🐸TTS recipes for different datasets☆86Updated 3 years ago
- A tokenizer, text cleaner, and phonemizer for many human languages.☆326Updated 11 months ago
- Real-Time Whisper Voice Recognition with vosk model feedback.☆119Updated 2 years ago
- How to create your own model for vosk☆75Updated 4 years ago
- Simple diarization model☆52Updated 4 months ago
- ONNX Inference of Pyannote Segmentation☆94Updated 10 months ago
- Wake word detection modeling toolkit for Firefox Voice, supporting open datasets like Speech Commands and Common Voice.☆213Updated last year
- Various speech datasets made available to the public☆131Updated 10 months ago
- A lightweight end-of-utterance detection model fine-tuned on SmolLM2-135M, optimized for Raspberry Pi and low-power devices.☆34Updated 6 months ago
- A simple Python wrapper for audio noise reduction RNNoise. Simplifies work with it, adds new trained models and detailed instructions for…☆175Updated last year
- python wrapper for rnnoise library☆48Updated 2 years ago
- Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event …☆404Updated last year
- Model for recasing and repunctuating ASR transcripts☆140Updated last year
- Web app for keyword spotting using TensorflowJS☆74Updated 2 years ago