linto-ai / linto-sttLinks
An automatic speech recognition API
☆73Updated this week
Alternatives and similar repositories for linto-stt
Users that are interested in linto-stt are comparing it to the libraries listed below
Sorting:
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆152Updated last year
- On-device voice activity detection (VAD) powered by deep learning☆233Updated last month
- On-device noise suppression powered by deep learning☆76Updated 3 months ago
- Model for recasing and repunctuating ASR transcripts☆141Updated last year
- Reproducible experimental protocols for multimedia (audio, video, text) database☆107Updated last month
- On-device speaker diarization powered by deep learning☆57Updated 3 months ago
- A curated list of awesome voice activity detection☆68Updated 11 months ago
- Real-Time Whisper Voice Recognition with vosk model feedback.☆119Updated 2 years ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆119Updated 2 years ago
- 🐸STT integration examples☆129Updated 3 years ago
- Timething is a library for aligning text transcripts with their audio recordings.☆125Updated 11 months ago
- Open models for Coqui STT☆146Updated 2 years ago
- ONNX Inference of Pyannote Segmentation☆95Updated 10 months ago
- ☆43Updated last year
- A model that predicts the punctuation of English, Italian, French and German texts.☆81Updated 2 years ago
- Advanced data structures for handling temporal segments with attached labels.☆122Updated last month
- A tokenizer, text cleaner, and phonemizer for many human languages.☆328Updated last year
- Tunable pipelines☆40Updated 2 months ago
- Experiments to test different speech recognition systems for SEPIA Framework☆63Updated 2 years ago
- Deep Learning - one shot learning for speaker recognition using Filter Banks☆170Updated last year
- A live speech recognition using Facebooks wav2vec 2.0 model.☆373Updated last year
- Mirror of hf.co/pyannote/speaker-diarization-3.1☆27Updated last year
- VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram☆260Updated last year
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆91Updated 2 years ago
- This project is about performing Speaker diarization for Hindi Language.☆52Updated 4 years ago
- OpenAI Whisper Prompt Examples☆52Updated 2 years ago
- speaker diarization system using an LSTM☆50Updated 2 years ago
- openvino version of openai/whisper☆176Updated 2 years ago
- Python server for communicating with Kaldi from the browser using WebRTC☆69Updated 2 years ago
- Universal multilingual automatic speech transcription into IPA☆70Updated 8 months ago