linto-ai / linto-stt
An automatic speech recognition API
☆56Updated this week
Alternatives and similar repositories for linto-stt:
Users that are interested in linto-stt are comparing it to the libraries listed below
- On-device speaker diarization powered by deep learning☆43Updated last month
- On-device voice activity detection (VAD) powered by deep learning☆206Updated last week
- Model for recasing and repunctuating ASR transcripts☆133Updated last year
- Real-Time Whisper Voice Recognition with vosk model feedback.☆111Updated last year
- Reproducible experimental protocols for multimedia (audio, video, text) database☆100Updated 2 months ago
- A curated list of awesome voice activity detection☆48Updated 4 months ago
- ONNX Inference of Pyannote Segmentation☆84Updated 3 months ago
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆81Updated last year
- A model that predicts the punctuation of English, Italian, French and German texts.☆80Updated 2 years ago
- Various speech datasets made available to the public☆116Updated 4 months ago
- Timething is a library for aligning text transcripts with their audio recordings.☆116Updated 4 months ago
- Diarization scoring tools.☆240Updated 2 years ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆111Updated 2 years ago
- Simple Kaldi model server for chain (nnet3) models in online recognition mode directly from a local microphone☆35Updated 3 years ago
- C++ version of pyannote audio speaker diarizaiton pipeline☆20Updated last year
- Experiments to test different speech recognition systems for SEPIA Framework☆60Updated last year
- Speaker diarization service☆21Updated this week
- Speaker diarization python system based on binary key speaker modelling☆61Updated 3 years ago
- Python server for communicating with Kaldi from the browser using WebRTC☆69Updated last year
- Tunable pipelines☆32Updated last month
- This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at…☆410Updated 2 weeks ago
- On-device noise suppression powered by deep learning☆69Updated this week
- Variational Bayes HMM over x-vectors diarization☆268Updated last year
- Evaluate results from ASR/Speech-to-Text quickly☆37Updated 3 years ago
- Advanced data structures for handling temporal segments with attached labels.☆111Updated 2 months ago
- ☆39Updated last year
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆135Updated last year
- An even smaller speech recognizer / force aligner☆32Updated 4 months ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆147Updated 11 months ago
- Speaker change detection using SincNet and an LSTM/Transformer☆50Updated 9 months ago