linto-ai / linto-sttLinks
An automatic speech recognition API
☆63Updated this week
Alternatives and similar repositories for linto-stt
Users that are interested in linto-stt are comparing it to the libraries listed below
Sorting:
- On-device speaker diarization powered by deep learning☆51Updated 3 weeks ago
- On-device voice activity detection (VAD) powered by deep learning☆219Updated last week
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆149Updated last year
- Reproducible experimental protocols for multimedia (audio, video, text) database☆104Updated 5 months ago
- Timething is a library for aligning text transcripts with their audio recordings.☆122Updated 7 months ago
- Deep Learning - one shot learning for speaker recognition using Filter Banks☆168Updated last year
- A curated list of awesome voice activity detection☆59Updated 7 months ago
- Open models for Coqui STT☆141Updated 2 years ago
- Add n-gram and large language model (LLM) support to Whisper models.☆29Updated 2 months ago
- C++ version of pyannote audio speaker diarizaiton pipeline☆21Updated last year
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆116Updated 2 years ago
- ONNX Inference of Pyannote Segmentation☆92Updated 6 months ago
- OpenAI Whisper Prompt Examples☆52Updated 2 years ago
- Speaker change detection using SincNet and an LSTM/Transformer☆52Updated last month
- 🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition☆15Updated last year
- On-device noise suppression powered by deep learning☆73Updated 3 weeks ago
- Various speech datasets made available to the public☆123Updated 7 months ago
- Model for recasing and repunctuating ASR transcripts☆135Updated last year
- VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram☆251Updated 11 months ago
- Python server for communicating with Kaldi from the browser using WebRTC☆69Updated last year
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆88Updated last year
- Advanced data structures for handling temporal segments with attached labels.☆114Updated 5 months ago
- A non-native English corpus for pronunciation scoring task☆143Updated last year
- Gecko - A Tool for Effective Annotation of Human Conversations☆291Updated 2 years ago
- A tokenizer, text cleaner, and phonemizer for many human languages.☆320Updated 8 months ago
- 🐸STT integration examples☆129Updated 2 years ago
- Tunable pipelines☆34Updated 4 months ago
- This repository creates speaker diarization recipes to be used within the egs folder of kaldi.☆17Updated 11 months ago
- A complete speech segmentation system using Kaldi and x-vectors for voice activity detection (VAD) and speaker diarisation.☆29Updated last year
- ☆40Updated last year