Zoom Audio Transcription offline
β34Sep 30, 2020Updated 5 years ago
Alternatives and similar repositories for zoom_audio_transcribe
Users that are interested in zoom_audio_transcribe are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Artie Bias Corpus: an audio corpus + code for detecting demographic biasβ20Jul 21, 2020Updated 5 years ago
- π LanMIT: A Toolkit for Improving Language Models in Low-resourced Speech Recognition based on Kaldi.β22Jul 12, 2019Updated 6 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.β13Feb 13, 2021Updated 5 years ago
- Using YouTube to prepare a speech recognition dataset for any languageβ10Mar 30, 2021Updated 5 years ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challengeβ15Mar 26, 2022Updated 4 years ago
- GPUs on demand by Runpod - Special Offer Available β’ AdRun AI, ML, and HPC workloads on powerful cloud GPUsβwithout limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- CMU multilingual speech repositoryβ30Apr 15, 2022Updated 4 years ago
- Java Bindings for the C++ library DeepSpeechβ10Jun 4, 2020Updated 5 years ago
- [ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Zβ¦β32Apr 8, 2022Updated 4 years ago
- Unsupervised speech activity detection system.β11Jul 2, 2018Updated 7 years ago
- Benchmarking different VAD models on AVA-Speech datasetβ18May 21, 2023Updated 2 years ago
- pyMUSHRA is a python web application which hosts webMUSHRA experiments and collects the data with python.β48Apr 18, 2025Updated last year
- β14Jun 12, 2015Updated 10 years ago
- Web page for ISCA Special Interest Group: Robust Speech Processing (RoSP)β11Dec 4, 2023Updated 2 years ago
- β17Apr 14, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Evaluation of STT models for german languageβ15Jan 22, 2022Updated 4 years ago
- Simple Kaldi model server for chain (nnet3) models in online recognition mode directly from a local microphoneβ35Feb 18, 2022Updated 4 years ago
- My solutions for "CSS Grid Garden," a game for learning CSS Grid layout.β13Nov 21, 2017Updated 8 years ago
- This repository provides data and code for "Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription" paper.β16Jul 22, 2021Updated 4 years ago
- an tutorial implement of voice conversion using pytorchβ34Mar 30, 2018Updated 8 years ago
- Multistream CNN for Robust Acoustic Modelingβ40Jun 17, 2021Updated 4 years ago
- Sequence-to-sequence TTS based on Kyubyong's dc_ttsβ61Feb 2, 2023Updated 3 years ago
- β20Jul 22, 2022Updated 3 years ago
- Example workflow for our data-centric speech benchmarkβ17Jul 6, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- MaSS - Multilingual corpus of Sentence-aligned Spoken utterancesβ50Sep 16, 2024Updated last year
- Voice conversion training with 109 speakers with limited training samplesβ35Dec 21, 2020Updated 5 years ago
- Automatic Speech Recognition (ASR) system for the SamrΓ³mur speech corpus using Kaldiβ12Sep 30, 2022Updated 3 years ago
- β21Sep 24, 2018Updated 7 years ago
- Vim Speech Recognition Experimentsβ20May 30, 2025Updated 11 months ago
- Scripts for training Kaldi for German speech recognition (ASR).β27Feb 11, 2021Updated 5 years ago
- β32Nov 24, 2024Updated last year
- Analyzes signal, finds fundamental frequency, HNR etcβ15Aug 23, 2017Updated 8 years ago
- β36Sep 20, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Resources for "Simple Speech Representation Learning from Perceptual Data".β11Sep 18, 2023Updated 2 years ago
- Interface for Controllable Expressive Talking Machineβ40Sep 20, 2025Updated 7 months ago
- Simple JUCE sampler plugin which demonstrates the use of the freesound-juce API clientβ12Aug 26, 2020Updated 5 years ago
- Emotion Recognition from Brazilian Portuguese Informal Spontaneous Speechβ22Mar 21, 2022Updated 4 years ago
- A GUI automation tool to export an Ableton Live set with the send effects printed to each stem.β14Feb 28, 2017Updated 9 years ago
- Detect emotion from audioβ14Nov 20, 2018Updated 7 years ago
- REST api for mozilla deepspeech voice recognition engineβ20Nov 1, 2021Updated 4 years ago