bunyaminergen/awesome-speech-dataset

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/bunyaminergen/awesome-speech-dataset)

bunyaminergen / awesome-speech-dataset

Awesome Speech Dataset, including download links and a brief explanation for each resource. These datasets provide diverse and high-quality speech data covering various domains such as conversational, academic, political, and more.

☆28

Alternatives and similar repositories for awesome-speech-dataset

Users that are interested in awesome-speech-dataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

qinxiaoyi / TimeVarying_ASV
View on GitHub
☆12Oct 17, 2024Updated last year
bunyaminergen / Callytics
View on GitHub
Callytics is an advanced call analytics solution that leverages speech recognition and large language models (LLMs) technologies to analy…
☆83Apr 7, 2025Updated last year
aranciokov / FSMMDA_VideoRetrieval
View on GitHub
☆10Nov 23, 2023Updated 2 years ago
kadirnar / fast-dacvae
View on GitHub
☆20Mar 17, 2026Updated 4 months ago
nikhilraghav29 / diarizen-tutorial
View on GitHub
DiariZen Explained: A Tutorial for the Open Source State-of-the-Art Speaker Diarization Pipeline.
☆22Apr 24, 2026Updated 3 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Naozumi520 / g2pW-Cantonese
View on GitHub
Cantonese Grapheme-to-Phoneme Converter based on GitYCC/g2pW
☆15Dec 10, 2024Updated last year
sveinnpalsson / sourceseparation
View on GitHub
☆12Oct 9, 2025Updated 9 months ago
phate09 / SafeDRL
View on GitHub
Repository containing the PhD Thesis "Formal Verification of Deep Reinforcement Learning Agents"
☆11Aug 29, 2022Updated 3 years ago
IDEA-Emdoor-Lab / UniTTS
View on GitHub
A TTS Trained on Universal Audio.
☆41Jun 6, 2025Updated last year
hlt-mt / Speech-MASSIVE
View on GitHub
Speech-MASSIVE is a multilingual Spoken Language Understanding (SLU) dataset comprising the speech counterpart for a portion of the MASSI…
☆25Oct 8, 2025Updated 9 months ago
abap34 / almo
View on GitHub
ALMOは拡張Markdownパーサ・静的サイトジェネレータです。WebAssemblyを使ってブラウザ上で完結する実行環境を提供し、サーバを必要としないサンプルコードの実行環境やジャッジシステムを提供するページの構築を可能にします。
☆16Apr 14, 2026Updated 3 months ago
kyawaway / kyasual
View on GitHub
A smart-casual LaTeX Beamer theme
☆14Jan 21, 2025Updated last year
kimho1wq / MR-RawNet
View on GitHub
This repository contains official pytorch implementation and pre-trained models for the MR-RawNet.
☆17Jun 12, 2024Updated 2 years ago
onolab-tmu / asp-tutorial-2022
View on GitHub
Ono laboratory audio signal processing exercise for beginners.
☆19May 10, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
DMnBI / ViBE
View on GitHub
ViBE: a hierarchical BERT model to identify viruses using metagenome sequencing data
☆11Sep 6, 2022Updated 3 years ago
autumn-DL / SpeechSynthesisMeMe
View on GitHub
☆11Nov 2, 2024Updated last year
Respaired / RiFornet_Vocoder
View on GitHub
a Neural Vocoder supporting Ring Attention, Conformer and NSF.
☆25Aug 1, 2025Updated 11 months ago
dodohow1011 / TS-VAD
View on GitHub
☆55Jan 15, 2021Updated 5 years ago
mengzaiqiao / awesome-natural-language-reasoning
View on GitHub
A collection of research papers related to Natural Language Reasoning
☆10May 27, 2022Updated 4 years ago
TUT-ARG / DCASE2016-baseline-system-matlab
View on GitHub
☆13Jan 10, 2017Updated 9 years ago
phanxuanphucnd / wav2kws
View on GitHub
Wav2kws is keyword spotting (KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Google Speech Commands datasets V1 and V2.
☆13Jun 11, 2021Updated 5 years ago
llm-lab-org / CLASP
View on GitHub
CLASP: Contrastive Language-Speech Pretraining for Multilingual Multimodal Information Retrieval
☆13Jun 27, 2025Updated last year
v-iashin / VoxCeleb
View on GitHub
An attempt to replicate the results of [1706.08612] VoxCeleb: a large-scale speaker identification dataset
☆12Dec 11, 2019Updated 6 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
flamed-tts / Flamed-TTS
View on GitHub
This repository implement a novel zero-shot TTS framework, named Flamed-TTS, focusing on the efficient generation and dynamic pacing in …
☆57Aug 9, 2025Updated 11 months ago
hyounghk / CoSIm
View on GitHub
Code and dataset for NAACL 2022 paper "CoSIm: Commonsense Reasoning for Counterfactual Scene Imagination" Hyounghun Kim, Abhay Zala, Mohi…
☆16Nov 26, 2022Updated 3 years ago
exercise-book-yq / FreeCodec
View on GitHub
FREECODEC: A DISENTANGLED NEURAL SPEECH CODEC WITH FEWER TOKENS
☆24Sep 9, 2024Updated last year
liuhuang31 / Megatts2_HierSpeechpp
View on GitHub
Megatts2 use HierSpeechpp's vocoder
☆18Dec 2, 2024Updated last year
kooBH / DSS
View on GitHub
[WIP]Direction based Multi-Channel Speech Separation
☆14Jan 25, 2024Updated 2 years ago
image-charts / python
View on GitHub
⚡️Official Image-charts Python library
☆12Jun 25, 2026Updated last month
andreasvlachos / arow_csc
View on GitHub
Cost-sensitive multiclass classification with Adaptive Regularization of Weights
☆16Sep 12, 2016Updated 9 years ago
takecy / vertx3-api-server
View on GitHub
RESTful API server template of Vert.x 3.x
☆13Oct 12, 2020Updated 5 years ago
amazon-science / adaptive-in-context-learning
View on GitHub
AdaICL: Which Examples to Annotate of In-Context Learning? Towards Effective and Efficient Selection
☆19Oct 30, 2023Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
JianmengYu / f4k
View on GitHub
Fish4Knowledge dataset cleaning, UOE 4th Year Honours Project.
☆11Jun 13, 2018Updated 8 years ago
tarun360 / SpeakerProfiling
View on GitHub
Estimating the Age, Height, and Gender of a speaker with their speech signal.
☆15Sep 19, 2022Updated 3 years ago
tli725 / JL-Corpus
View on GitHub
For further understanding the wide array of emotions embedded in human speech, we are introducing an emotional speech corpus. In contrast…
☆11Oct 29, 2018Updated 7 years ago
JaesungHuh / VoxSRC2022
View on GitHub
VoxSRC2022 workshop development kit
☆19Jul 21, 2022Updated 4 years ago
lijian-ml / CS373-Programming-a-Robotic-Car
View on GitHub
机器人人工智能，优达学城cs373作业。　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　　Artificial Intelligence for Robotics, this repository contains all the homework…
☆12Nov 12, 2017Updated 8 years ago
UKPLab / maps
View on GitHub
Multicultural Proverbs and Sayings
☆13Jan 11, 2025Updated last year
Helsinki-NLP / shroom
View on GitHub
☆12Jul 17, 2026Updated last week