linto-ai / linto-sttLinks
An automatic speech recognition API
☆66Updated 2 weeks ago
Alternatives and similar repositories for linto-stt
Users that are interested in linto-stt are comparing it to the libraries listed below
Sorting:
- On-device voice activity detection (VAD) powered by deep learning☆222Updated 2 weeks ago
- Model for recasing and repunctuating ASR transcripts☆136Updated last year
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆149Updated last year
- On-device speaker diarization powered by deep learning☆52Updated 3 weeks ago
- Reproducible experimental protocols for multimedia (audio, video, text) database☆106Updated 5 months ago
- A model that predicts the punctuation of English, Italian, French and German texts.☆80Updated 2 years ago
- Timething is a library for aligning text transcripts with their audio recordings.☆122Updated 8 months ago
- 🐸STT integration examples☆130Updated 2 years ago
- On-device noise suppression powered by deep learning☆73Updated 3 weeks ago
- ONNX Inference of Pyannote Segmentation☆92Updated 7 months ago
- Tunable pipelines☆35Updated 5 months ago
- Open models for Coqui STT☆142Updated 2 years ago
- Deep Learning - one shot learning for speaker recognition using Filter Banks☆170Updated last year
- Create an LJSpeech structured voice dataset on wave input☆33Updated 10 months ago
- A curated list of awesome voice activity detection☆59Updated 8 months ago
- Real-Time Whisper Voice Recognition with vosk model feedback.☆117Updated 2 years ago
- A python package for deep multilingual punctuation prediction.☆128Updated 11 months ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆116Updated 2 years ago
- The human speaks a language with an accent. A particular accent necessarily reflects a person's linguistic background. The model defines …☆62Updated 3 years ago
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!☆171Updated last month
- This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at…☆423Updated 4 months ago
- Various speech datasets made available to the public☆126Updated 7 months ago
- Gecko - A Tool for Effective Annotation of Human Conversations☆293Updated 2 years ago
- Add n-gram and large language model (LLM) support to Whisper models.☆31Updated 3 months ago
- openvino version of openai/whisper☆170Updated last year
- Tools to create your own voice dataset for TTS training☆67Updated 4 years ago
- A tokenizer, text cleaner, and phonemizer for many human languages.☆320Updated 8 months ago
- 🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. 🎧👥📊 Advanced audio processing.☆248Updated last year
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆137Updated last year
- A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper "State of the Art" models☆66Updated 2 years ago