Top level code to transcribe English audio/video files into text/subtitles
☆21Jun 12, 2018Updated 7 years ago
Alternatives and similar repositories for srvk-eesen-offline-transcriber
Users that are interested in srvk-eesen-offline-transcriber are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- EESEN based offline transcriber VM using models trained on TEDLIUM and Cantab Research☆50Jun 4, 2019Updated 6 years ago
- An Android app that listens to conversations and determines who was speaking at any point in the conversation - a task known as speech di…☆14Apr 12, 2021Updated 4 years ago
- Demo WebApp using Kaldi DNN engine to convert speech to text☆11Jun 12, 2016Updated 9 years ago
- Text normalization scripts from IRISA lab☆14Jun 1, 2018Updated 7 years ago
- Offline transcription system for Estonian using Kaldi☆228Aug 4, 2022Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- An open-source tool for automatic speech recognition ASR quality estimation.☆23Dec 12, 2019Updated 6 years ago
- Adapting your own Language Model for Kaldi☆63Jan 8, 2019Updated 7 years ago
- This repository is now obsolete. Please go to https://github.com/idlak/idlak instead.☆39Feb 26, 2018Updated 8 years ago
- Research_speech_speaker_verification_nist_sre2010☆12Mar 1, 2016Updated 10 years ago
- Utilities for manipulating finite state transducers with the OpenFst library.☆32Sep 22, 2017Updated 8 years ago
- Visualize videos, groundtruth annotations, and predictions☆18Oct 17, 2022Updated 3 years ago
- Compute the most likely permutation of a lattice given an LM☆10Jan 3, 2013Updated 13 years ago
- NWJS os x desktop based application that given a video/audio file returns a transcription using IBM Watson Speech to text API☆41Jan 9, 2017Updated 9 years ago
- Speech recognition using webrtc for FirefoxOS☆59Feb 10, 2014Updated 12 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- SWIG bindings for Kaldi I/O, built with Conda☆15Dec 15, 2024Updated last year
- Cython implementation of Moattar and Homayounpour's Voice Activity Detection (VAD) algorithm fast enough for real-time on an RPi 3.☆12Aug 18, 2018Updated 7 years ago
- Deep Learning for Speech Recogntion based on Theano☆15Jul 28, 2017Updated 8 years ago
- Person Re-ID framework in development during my PhD with the collaboration of Prof. Dr. William Robson Schwartz (UFMG)☆21Dec 13, 2016Updated 9 years ago
- DEPRECATED - A webapp for collecting speech samples for voice recognition testing and training☆20May 23, 2019Updated 6 years ago
- (semi) Grapheme-to-Phoneme (G2P) - seq2seq model using PyTorch for Korean☆23Dec 17, 2017Updated 8 years ago
- A simple tutorial on setting up Sparrowhawk - a text-to-speech normalization engine☆14Oct 16, 2017Updated 8 years ago
- Unsupervised speech activity detection system.☆11Jul 2, 2018Updated 7 years ago
- Multiobjective Optimization Training of PLDA for Speaker Verification☆10Jun 14, 2018Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Web server to connect Kaldi speech recognizers to real-time web clients☆17Jul 9, 2014Updated 11 years ago
- Script for converting kaldi GMM/HMM models to HTK format☆11Jul 18, 2024Updated last year
- Minute Meeting Bot☆19Mar 4, 2023Updated 3 years ago
- This is the home directory to speaker diarization module being developed for Hetergeneous News data in RedHen Labs as a GSOC Project☆10Sep 11, 2015Updated 10 years ago
- ☆15Jan 24, 2017Updated 9 years ago
- Perform the forced decoding with target transcription☆11Sep 12, 2018Updated 7 years ago
- Scripts for recreating the Replication Dataset for Fundamental Frequency Estimation. Part of the dissertation "Pitch of Voiced Speech in …☆11Mar 29, 2021Updated 5 years ago
- A Python package for audio annotation and classifier training. Developed in collaboration with the WGBH Foundation and the American Archi…☆17Jun 2, 2018Updated 7 years ago
- Ontology of Snips NLU☆60Mar 8, 2020Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Part-of-Speech Tagging Models in Python☆16Oct 7, 2019Updated 6 years ago
- simple energy vad☆19Jun 3, 2017Updated 8 years ago
- ☆14Jun 12, 2015Updated 10 years ago
- Score Normalization for NIST 2019 Speaker Recognition Evaluation☆10Nov 8, 2019Updated 6 years ago
- Denoising autoencoders for speaker identification on MCE 2018 challenge☆12Nov 8, 2018Updated 7 years ago
- A KALDI/C++ implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition☆15Sep 4, 2019Updated 6 years ago
- NNSVS向けの教師データのラベル作成支援ツールです。☆10Apr 5, 2023Updated 2 years ago