An automatic speech recognition API
☆84Jun 26, 2026Updated this week
Alternatives and similar repositories for linto-stt
Users that are interested in linto-stt are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Build a LinTO OS Image which boots on Raspberry Pi3☆14Jul 8, 2020Updated 5 years ago
- Speaker diarization service☆26Updated this week
- This repository provides data and code for "Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription" paper.☆16Jul 22, 2021Updated 4 years ago
- phone inventory library☆17May 15, 2023Updated 3 years ago
- ☆13Aug 7, 2021Updated 4 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- enhan(t) is an open source toolkit which enables you to enhance the web experience of existing video conferencing solutions like Zoom, MS…☆15Apr 28, 2022Updated 4 years ago
- TTS for Singlish using Tacotron2, the IMDA corpus, and Pachyderm.☆11Jan 11, 2020Updated 6 years ago
- ☆17Jun 30, 2020Updated 6 years ago
- brainless concatenative text to speech☆16May 11, 2021Updated 5 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- ☆17Apr 14, 2023Updated 3 years ago
- Model for recasing and repunctuating ASR transcripts☆140Apr 10, 2024Updated 2 years ago
- radiomixer☆14Feb 16, 2022Updated 4 years ago
- Grapheme-to-phoneme tool for corpus conversion, where phonemes match Phoible inventories☆19Apr 10, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- This project contains translation of AFINN-165 keyword list (saved as AFINN-165-en.json) originally available at https://github.com/fniel…☆12Jan 6, 2017Updated 9 years ago
- DEPRECATED - A webapp for collecting speech samples for voice recognition testing and training☆20May 23, 2019Updated 7 years ago
- Character-level conversion between Hebrew text and Latin transliteration using deep learning - a demonstration of seq2seq training.☆16Jun 27, 2023Updated 3 years ago
- finetune the chain model based on cvte open source model without traing any GMM for frame alignment☆12Aug 6, 2020Updated 5 years ago
- Enable RNNLM lattice rescoring with Pytorch [kaldi]☆12Jun 5, 2020Updated 6 years ago
- homepage of DreamActor-M1☆64Jun 26, 2025Updated last year
- Implementation of the paper "Confidence estimation for attention based sequence to sequence models for speech recognition"☆16May 9, 2021Updated 5 years ago
- ☆22Jul 22, 2022Updated 3 years ago
- Target speaker automatic speech recognition (TS-ASR)☆14Oct 14, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- FreeSWITCH is a Software Defined Telecom Stack enabling the digital transformation from proprietary telecom switches to a versatile softw…☆31Jul 20, 2022Updated 3 years ago
- Segment a given audio into utterances using a trained end-to-end ASR model.☆75Oct 9, 2020Updated 5 years ago
- Multilingual and code-switching ASR challenges for low resource Indian languages.☆23Jul 26, 2021Updated 4 years ago
- mixlingual speech recognition system; hybrid (GMM+NNet) model; Kaldi + Keras☆71Nov 20, 2017Updated 8 years ago
- All-in-one Speech Transcription☆11Jun 5, 2026Updated 3 weeks ago
- High accuracy code-switching whisper / qwen3 transcription☆39Jun 17, 2026Updated last week
- audio, NLP, ML with huggingface, nvidia/nemo, speechbrain☆11Sep 4, 2023Updated 2 years ago
- Automatic Speech Recognition tool☆20Aug 5, 2023Updated 2 years ago
- Train punctuation and capitalization models for different languages☆26Apr 2, 2022Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A scalable solution that simplifies the integration of ComfyUI for developers☆11Jul 15, 2024Updated last year
- MirasVoice is a data set consisting speech samples from bilinguals to train neural network for optimization of speaker verification algor…☆19Mar 15, 2020Updated 6 years ago
- Evaluation of STT models for german language☆15Jan 22, 2022Updated 4 years ago
- On-device speaker diarization powered by deep learning☆74Jun 22, 2026Updated last week
- ☆13May 23, 2024Updated 2 years ago
- OpenAI Whisper Prompt Examples☆53Jul 17, 2023Updated 2 years ago
- S3PRL for Speech Emotion Recognition (see s3prl > downstream)☆15Feb 28, 2026Updated 4 months ago