An automatic speech recognition API
☆81Mar 26, 2026Updated this week
Alternatives and similar repositories for linto-stt
Users that are interested in linto-stt are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Build a LinTO OS Image which boots on Raspberry Pi3☆14Jul 8, 2020Updated 5 years ago
- Transcription and annotation interface for recorded audio or video files☆52Updated this week
- Tools for speech processing, keyword spotting☆17Mar 11, 2020Updated 6 years ago
- ☆18Jul 3, 2025Updated 8 months ago
- Speaker diarization service☆28Feb 24, 2026Updated last month
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- This repository provides data and code for "Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription" paper.☆16Jul 22, 2021Updated 4 years ago
- phone inventory library☆17May 15, 2023Updated 2 years ago
- Custom AppleScript libraries providing a variety of utilities☆17Sep 11, 2023Updated 2 years ago
- enhan(t) is an open source toolkit which enables you to enhance the web experience of existing video conferencing solutions like Zoom, MS…☆15Apr 28, 2022Updated 3 years ago
- TTS for Singlish using Tacotron2, the IMDA corpus, and Pachyderm.☆11Jan 11, 2020Updated 6 years ago
- ☆17Jun 30, 2020Updated 5 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- ☆17Apr 14, 2023Updated 2 years ago
- brainless concatenative text to speech☆14May 11, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Model for recasing and repunctuating ASR transcripts☆142Apr 10, 2024Updated last year
- radiomixer☆14Feb 16, 2022Updated 4 years ago
- Grapheme-to-phoneme tool for corpus conversion, where phonemes match Phoible inventories☆20Apr 10, 2025Updated 11 months ago
- DEPRECATED - A webapp for collecting speech samples for voice recognition testing and training☆20May 23, 2019Updated 6 years ago
- joplin paperless file importer☆15Apr 25, 2023Updated 2 years ago
- magicspeech competition recipe☆18Jun 29, 2020Updated 5 years ago
- This is a subset of the DALI set consisting of 240 polyphonic recordings that is used to benchmark lyrics transcription evaluation.☆12Nov 30, 2021Updated 4 years ago
- Enable RNNLM lattice rescoring with Pytorch [kaldi]☆12Jun 5, 2020Updated 5 years ago
- Character-level conversion between Hebrew text and Latin transliteration using deep learning - a demonstration of seq2seq training.☆14Jun 27, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Implementation of the paper "Confidence estimation for attention based sequence to sequence models for speech recognition"☆16May 9, 2021Updated 4 years ago
- High accuracy code-switching whisper / qwen3 transcription☆24Updated this week
- Multilingual and code-switching ASR challenges for low resource Indian languages.☆21Jul 26, 2021Updated 4 years ago
- ☆20Jul 22, 2022Updated 3 years ago
- Target speaker automatic speech recognition (TS-ASR)☆12Oct 14, 2023Updated 2 years ago
- Page de préconfiguration de la communauté OpenLLM-France☆51Feb 2, 2024Updated 2 years ago
- Grapheme-to-phoneme (G2P) conversion is the process of generating pronunciation for words based on their written form. It has a highly es…☆19Jun 14, 2021Updated 4 years ago
- FreeSWITCH is a Software Defined Telecom Stack enabling the digital transformation from proprietary telecom switches to a versatile softw…☆31Jul 20, 2022Updated 3 years ago
- Segment a given audio into utterances using a trained end-to-end ASR model.☆74Oct 9, 2020Updated 5 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- mixlingual speech recognition system; hybrid (GMM+NNet) model; Kaldi + Keras☆71Nov 20, 2017Updated 8 years ago
- All-in-one Speech Transcription☆10Jan 25, 2026Updated 2 months ago
- audio, NLP, ML with huggingface, nvidia/nemo, speechbrain☆11Sep 4, 2023Updated 2 years ago
- Automatic Speech Recognition tool☆20Aug 5, 2023Updated 2 years ago
- A scalable solution that simplifies the integration of ComfyUI for developers☆11Jul 15, 2024Updated last year
- Train punctuation and capitalization models for different languages☆26Apr 2, 2022Updated 3 years ago
- MirasVoice is a data set consisting speech samples from bilinguals to train neural network for optimization of speaker verification algor…☆19Mar 15, 2020Updated 6 years ago