linto-ai/linto-stt

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/linto-ai/linto-stt)

linto-ai / linto-stt

An automatic speech recognition API

☆84

Alternatives and similar repositories for linto-stt

Users that are interested in linto-stt are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

linto-ai / linto-os-generator
View on GitHub
Build a LinTO OS Image which boots on Raspberry Pi3
☆14Jul 8, 2020Updated 6 years ago
linto-ai / linto-studio
View on GitHub
Transcription and annotation interface for recorded audio or video files
☆58Updated this week
linto-ai / linto-diarization
View on GitHub
Speaker diarization service
☆27Jul 2, 2026Updated 2 weeks ago
linto-ai / linto-desktoptools-hmg
View on GitHub
GUI Tool to create, manage and test Keyword Spotting models using TF 2.0
☆13Feb 1, 2021Updated 5 years ago
OpenLLM-France / Lit-Claire
View on GitHub
Continual pretraining of foundation LLM using ⚡ Lightning Fabric
☆37Nov 27, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
linto-ai / WebVoiceSDK
View on GitHub
Buildings block for voice-enabled applications in the browser
☆37Feb 5, 2026Updated 5 months ago
pilot7747 / VoxDIY
View on GitHub
This repository provides data and code for "Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription" paper.
☆16Jul 22, 2021Updated 4 years ago
OpenLLM-France / Lucie-Training
View on GitHub
Code for continual pretraining of LUCIE
☆52Jun 2, 2026Updated last month
guokan-shang / ami-and-icsi-corpora
View on GitHub
AMI and ICSI Corpora in JSON format.
☆38Sep 29, 2023Updated 2 years ago
alxmamaev / ultimate_tts
View on GitHub
☆13Aug 7, 2021Updated 4 years ago
xinjli / phonepiece
View on GitHub
phone inventory library
☆17May 15, 2023Updated 3 years ago
keplerlab / enhant
View on GitHub
enhan(t) is an open source toolkit which enables you to enhance the web experience of existing video conferencing solutions like Zoom, MS…
☆15Apr 28, 2022Updated 4 years ago
PyThaiNLP / thai-g2p-wiktionary-corpus
View on GitHub
Thai Grapheme to Phoneme (G2P) Wiktionary Corpus
☆13Jul 25, 2022Updated 3 years ago
sil-ai / tts-singlish
View on GitHub
TTS for Singlish using Tacotron2, the IMDA corpus, and Pachyderm.
☆11Jan 11, 2020Updated 6 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
alea-institute / nupunkt
View on GitHub
Next-generation Punkt sentence boundary detection with zero dependencies
☆32Nov 18, 2025Updated 8 months ago
desh2608 / kaldi-noise-vectors
View on GitHub
Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.
☆13Feb 13, 2021Updated 5 years ago
alumae / streaming-punctuator
View on GitHub
☆17Apr 14, 2023Updated 3 years ago
speechcatcher-asr / speechcatcher-data
View on GitHub
☆11Sep 5, 2025Updated 10 months ago
benob / recasepunc
View on GitHub
Model for recasing and repunctuating ASR transcripts
☆141Apr 10, 2024Updated 2 years ago
braze-inc / braze-expo-plugin
View on GitHub
☆15Updated this week
zhu-han / SpeechLLM
View on GitHub
LLM-based ASR recipe with Zipformer encoder and Qwen LLM
☆34Sep 25, 2025Updated 9 months ago
levtelyatnikov / radiomixer
View on GitHub
radiomixer
☆14Feb 16, 2022Updated 4 years ago
ASLP-lab / Smart-Glass-Challenge
View on GitHub
☆17Jun 16, 2026Updated last month
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
morrisalp / taatiknet
View on GitHub
Character-level conversion between Hebrew text and Latin transliteration using deep learning - a demonstration of seq2seq training.
☆16Jun 27, 2023Updated 3 years ago
mozilla / murmur
View on GitHub
DEPRECATED - A webapp for collecting speech samples for voice recognition testing and training
☆20May 23, 2019Updated 7 years ago
mush42 / mantoq
View on GitHub
Arabic Grapheme-to-Phoneme (G2P) Conversion
☆16Mar 15, 2025Updated last year
pigzach / MagicSpeechASR
View on GitHub
magicspeech competition recipe
☆18Jun 29, 2020Updated 6 years ago
LAION-AI / scaled-echo-tts
View on GitHub
Scaled diffusion transformer for text-to-speech synthesis (DiT + T5Gemma2 conditioning, TorchTitan & Megatron backends, tested up to 1024…
☆24Mar 29, 2026Updated 3 months ago
WhissleAI / PromptingNemo
View on GitHub
All-in-one Speech Transcription
☆11Jun 5, 2026Updated last month
emirdemirel / DALI-TestSet4ALT
View on GitHub
This is a subset of the DALI set consisting of 240 polyphonic recordings that is used to benchmark lyrics transcription evaluation.
☆12Nov 30, 2021Updated 4 years ago
cyfer0618 / kaldi-pytorch-rnnlm
View on GitHub
Enable RNNLM lattice rescoring with Pytorch [kaldi]
☆12Jun 5, 2020Updated 6 years ago
SKaplanOfficial / AppleScript-Libraries
View on GitHub
Custom AppleScript libraries providing a variety of utilities
☆18Sep 11, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ASLP-lab / M7-TTS
View on GitHub
M7-TTS: A Mini-Scale Multilingual and Multi-Dialect Text-to-Speech Language Model with Mimi codec and Multi Token Prediction
☆20Mar 19, 2026Updated 4 months ago
SELMA-project / ml4audio
View on GitHub
audio, NLP, ML with huggingface, nvidia/nemo, speechbrain
☆11Sep 4, 2023Updated 2 years ago
KrishnaDN / E2E_ASR_Confidence_Estimation
View on GitHub
Implementation of the paper "Confidence estimation for attention based sequence to sequence models for speech recognition"
☆16May 9, 2021Updated 5 years ago
averkij / multipunct
View on GitHub
Train punctuation and capitalization models for different languages
☆26Apr 2, 2022Updated 4 years ago
hi-paris / CosyVoice2-EU
View on GitHub
Europeanized CosyVoice2 for French & German
☆17Mar 30, 2026Updated 3 months ago
NTRLab / MediaSpeech
View on GitHub
☆22Jul 22, 2022Updated 3 years ago
clarinsi / Slovene_ASR_e2e
View on GitHub
Automatic Speech Recognition tool
☆20Aug 5, 2023Updated 2 years ago