linto-ai / linto-sttLinks
An automatic speech recognition API
☆69Updated 3 weeks ago
Alternatives and similar repositories for linto-stt
Users that are interested in linto-stt are comparing it to the libraries listed below
Sorting:
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆150Updated last year
- Model for recasing and repunctuating ASR transcripts☆138Updated last year
- On-device voice activity detection (VAD) powered by deep learning☆228Updated last month
- On-device speaker diarization powered by deep learning☆53Updated last month
- Timething is a library for aligning text transcripts with their audio recordings.☆122Updated 9 months ago
- 🐸STT integration examples☆129Updated 2 years ago
- Tools to create your own voice dataset for TTS training☆68Updated 4 years ago
- Reproducible experimental protocols for multimedia (audio, video, text) database☆107Updated this week
- streaming speech to text server using Whisper☆94Updated 2 years ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆118Updated 2 years ago
- openvino version of openai/whisper☆175Updated last year
- Real-Time Whisper Voice Recognition with vosk model feedback.☆119Updated 2 years ago
- ☆124Updated last month
- Create an LJSpeech structured voice dataset on wave input☆34Updated 11 months ago
- A tokenizer, text cleaner, and phonemizer for many human languages.☆325Updated 10 months ago
- Open models for Coqui STT☆141Updated 2 years ago
- A live speech recognition using Facebooks wav2vec 2.0 model.☆364Updated last year
- ONNX Inference of Pyannote Segmentation☆93Updated 8 months ago
- Goodness of Pronunciation using Kaldi on Epa-DB database☆35Updated last year
- Text to speech alignment using CTC forced alignment☆354Updated last month
- A curated list of awesome voice activity detection☆62Updated 9 months ago
- On-device noise suppression powered by deep learning☆74Updated last month
- Gecko - A Tool for Effective Annotation of Human Conversations☆297Updated 2 years ago
- Deep Learning - one shot learning for speaker recognition using Filter Banks☆169Updated last year
- A model that predicts the punctuation of English, Italian, French and German texts.☆80Updated 2 years ago
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…☆234Updated 3 weeks ago
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!☆174Updated this week
- A python package for deep multilingual punctuation prediction.☆131Updated last year
- A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper "State of the Art" models☆66Updated 2 years ago
- 🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. 🎧👥📊 Advanced audio processing.☆251Updated last year