harvard-edge/dataperf-speech-example

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/harvard-edge/dataperf-speech-example)

harvard-edge / dataperf-speech-example

Example workflow for our data-centric speech benchmark

☆17

Alternatives and similar repositories for dataperf-speech-example

Users that are interested in dataperf-speech-example are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

desh2608 / kaldi-noise-vectors
View on GitHub
Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.
☆13Feb 13, 2021Updated 5 years ago
TehreemFarooqi / Preparing-a-speech-recognition-dataset-using-YouTube-videos
View on GitHub
Using YouTube to prepare a speech recognition dataset for any language
☆10Mar 30, 2021Updated 5 years ago
gpu-poor / gramvaani_hindi_asr
View on GitHub
This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge
☆16Mar 26, 2022Updated 4 years ago
mikex86 / DeepSpeech-Java-Bindings
View on GitHub
Java Bindings for the C++ library DeepSpeech
☆10Jun 4, 2020Updated 6 years ago
sarahjuan / iban
View on GitHub
☆14Jun 12, 2015Updated 11 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
isca-sig-rosp / ISCA-SIG-RoSP
View on GitHub
Web page for ISCA Special Interest Group: Robust Speech Processing (RoSP)
☆11Dec 4, 2023Updated 2 years ago
cleanlab / cleanlab-codex
View on GitHub
Python client to integrate Cleanlab Codex with your AI Agent
☆19Nov 19, 2025Updated 8 months ago
pilot7747 / VoxDIY
View on GitHub
This repository provides data and code for "Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription" paper.
☆16Jul 22, 2021Updated 5 years ago
NTRLab / MediaSpeech
View on GitHub
☆22Jul 22, 2022Updated 4 years ago
dense-analysis / vim-speech
View on GitHub
Vim Speech Recognition Experiments
☆20May 30, 2025Updated last year
german-asr / kaldi-german
View on GitHub
Scripts for training Kaldi for German speech recognition (ASR).
☆27Feb 11, 2021Updated 5 years ago
domcross / german-stt-evaluation
View on GitHub
Evaluation of STT models for german language
☆16Jan 22, 2022Updated 4 years ago
artie-inc / artie-bias-corpus
View on GitHub
Artie Bias Corpus: an audio corpus + code for detecting demographic bias
☆20Jul 21, 2020Updated 6 years ago
cnlinxi / speech_emotion
View on GitHub
Detect emotion from audio
☆14Nov 20, 2018Updated 7 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
zelo / deepspeech-rest-api
View on GitHub
REST api for mozilla deepspeech voice recognition engine
☆20Nov 1, 2021Updated 4 years ago
rmarcacini / ser-coraa-pt-br
View on GitHub
Emotion Recognition from Brazilian Portuguese Informal Spontaneous Speech
☆22Mar 21, 2022Updated 4 years ago
miras-tech / MirasVoice
View on GitHub
MirasVoice is a data set consisting speech samples from bilinguals to train neural network for optimization of speaker verification algor…
☆19Mar 15, 2020Updated 6 years ago
SpeechColab / PySpeechColab
View on GitHub
A library of speech gadgets.
☆15Oct 15, 2022Updated 3 years ago
speechio / asr-noises
View on GitHub
A handy dataset of noises for ASR
☆22May 29, 2019Updated 7 years ago
vadimkantorov / inferspeech
View on GitHub
PyTorch speech2text inference script for the NVidia openseq2seq wav2letter model variant
☆10Aug 12, 2019Updated 6 years ago
revdotcom / words2num
View on GitHub
Convert words to numbers
☆21Apr 13, 2022Updated 4 years ago
vadimkantorov / readaudio
View on GitHub
Read audio with FFmpeg into NumPy/PyTorch via ctypes (standard library module)
☆11Aug 12, 2020Updated 5 years ago
ruslan-corpus / ruslan-corpus.github.io
View on GitHub
☆22Aug 29, 2019Updated 6 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
kbabilinski / deep-speech-unity
View on GitHub
A Unity implementation of DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on …
☆26Sep 22, 2022Updated 3 years ago
Digital-Umuganda / text_normalization_tts_rw
View on GitHub
☆11Apr 24, 2024Updated 2 years ago
besacier / mboshi-french-parallel-corpus
View on GitHub
☆23Apr 8, 2022Updated 4 years ago
jqhoogland / remark-tangle
View on GitHub
A remark plugin for making interactive markdown documents with Tangle.
☆13Oct 25, 2021Updated 4 years ago
ooshyun / Speech-Enhancement-Pytorch
View on GitHub
Pytorch Models for Speech Enhancement
☆23Mar 31, 2023Updated 3 years ago
wavlab-speech / cmu_multilingual_speech
View on GitHub
CMU multilingual speech repository
☆30Apr 15, 2022Updated 4 years ago
jdvala / zoom_audio_transcribe
View on GitHub
Zoom Audio Transcription offline
☆34Sep 30, 2020Updated 5 years ago
Yangyangii / TPGST-Tacotron
View on GitHub
Google's TPGST reimplementation.
☆34Dec 11, 2019Updated 6 years ago
Picovoice / text-to-speech-benchmark
View on GitHub
Text-to-Speech Benchmark
☆26Apr 2, 2026Updated 3 months ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
dodgejesse / show_your_work
View on GitHub
☆11Jan 21, 2020Updated 6 years ago
JmlrOrg / dmlr-style-file
View on GitHub
☆12Nov 21, 2023Updated 2 years ago
falabrasil / ufpalign
View on GitHub
👄🇧🇷 Alinhamento fonético forçado em Português Brasileiro
☆13Jul 18, 2025Updated last year
interactiveaudiolab / voogle
View on GitHub
This is code for an audio search engine that uses vocal imitations of the desired sound
☆38May 16, 2023Updated 3 years ago
arda-num / SFSRNet
View on GitHub
Reproduction of the paper SFSRNet: Super-resolution for single-channel Audio Source Separation by me (@arda-num) and @dritx16. Navigate P…
☆12Jul 7, 2022Updated 4 years ago
VITA-Group / Audio-Lottery
View on GitHub
[ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…
☆32Apr 8, 2022Updated 4 years ago
sony / evsCluster
View on GitHub
Python scripts to process EVS (Event-based vision sensor) data
☆12Jan 30, 2024Updated 2 years ago