Awesome Speech Dataset, including download links and a brief explanation for each resource. These datasets provide diverse and high-quality speech data covering various domains such as conversational, academic, political, and more.
☆26Jul 4, 2025Updated 8 months ago
Alternatives and similar repositories for awesome-speech-dataset
Users that are interested in awesome-speech-dataset are comparing it to the libraries listed below
Sorting:
- This repository contains official pytorch implementation and pre-trained models for the MR-RawNet.☆17Jun 12, 2024Updated last year
- DiFlow-TTS delivers low-latency zero-shot TTS via discrete flow matching and factorized speech tokens. A compact, open framework for fast…☆53Updated this week
- Speech-MASSIVE is a multilingual Spoken Language Understanding (SLU) dataset comprising the speech counterpart for a portion of the MASSI…☆24Oct 8, 2025Updated 5 months ago
- Callytics is an advanced call analytics solution that leverages speech recognition and large language models (LLMs) technologies to analy…☆78Apr 7, 2025Updated 11 months ago
- Speech to Phoneme, Bandwidth Extension and Speaker Verification using the Vibravox dataset.☆46Dec 1, 2025Updated 3 months ago
- Punch Out Model Synthesis - a program for constraint based tiling generation☆19Feb 1, 2026Updated last month
- A custom n8n node for integrating with ikas e-commerce platform. This node enables seamless automation workflows between ikas and other s…☆20Oct 12, 2025Updated 4 months ago
- ☆11Updated this week
- Demo for DART, Audio Imagination workshop submission in NeurIPS 2024☆12Apr 15, 2025Updated 10 months ago
- ☆10Oct 9, 2025Updated 5 months ago
- Name☆10Jul 21, 2017Updated 8 years ago
- [NeurIPS'22] PyTorch library to compare similarity between NN representations☆13Feb 27, 2025Updated last year
- ☆23Jan 25, 2026Updated last month
- SocksSharp provides support for Socks4/4a/5 proxy servers to HttpClient☆12Feb 3, 2021Updated 5 years ago
- A Solution Accelerator bringing together the latest AI agentic patterns and Azure services to automate the first line of review for docum…☆23Jan 17, 2025Updated last year
- Code repo for "S3R-Net: A Single-Stage Approach to Self-Supervised Shadow Removal" (NTIRE workshop @ CVPR 2024)☆11Jun 15, 2024Updated last year
- Zero-alloc structured logging for .NET - fast formatters, rich terminal visuals, production-ready file & JSON sinks.☆49Mar 1, 2026Updated last week
- Tool for signing and countersigning iXBRL or other XML files☆12Mar 3, 2023Updated 3 years ago
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated 11 months ago
- A .NET class library to obfuscate and deobfuscate save files from the game No Man's Sky.☆10Feb 9, 2026Updated last month
- Ono laboratory audio signal processing exercise for beginners.☆19May 10, 2023Updated 2 years ago
- ☆10Dec 22, 2023Updated 2 years ago
- For further understanding the wide array of emotions embedded in human speech, we are introducing an emotional speech corpus. In contrast…☆11Oct 29, 2018Updated 7 years ago
- 日本音響学会誌用BibTeXスタイルファイル☆11Jan 24, 2022Updated 4 years ago
- C# port of stb_dxt.h☆10Aug 9, 2020Updated 5 years ago
- UDIM Shader Material for Godot☆11Apr 2, 2024Updated last year
- ⚡️Official Image-charts Python library☆12Updated this week
- Sample code to demonstrate how to get started with the .NET MAUI Community Toolkit DrawingView☆12Jan 2, 2023Updated 3 years ago
- [⚠️ WIP] ALMOは拡張Markdownパーサ・静的サイトジェネレータです。WebAssemblyを使ってブラウザ上で完結す る実行環境を提供し、サーバを必要としないサンプルコードの実行環境やジャッジシステムを提供するページの構築を可能にします。☆16Feb 28, 2026Updated last week
- Scripts for recreating the Replication Dataset for Fundamental Frequency Estimation. Part of the dissertation "Pitch of Voiced Speech in …☆11Mar 29, 2021Updated 4 years ago
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆10Sep 30, 2024Updated last year
- ☆11Oct 15, 2022Updated 3 years ago
- A modularised, multi-threaded C# game engine based on the Entity Component System architecture. Uses OpenGL for rendering and contains ma…☆12May 12, 2024Updated last year
- [INTERSPEECH 2024] Official code for VoxSim: A perceptual voice similarity dataset☆12Sep 29, 2025Updated 5 months ago
- REBORN: Reinforcement-Learned Boundary Segmentation with Iterative Training for Unsupervised ASR☆14Dec 11, 2024Updated last year
- Efficient kernel for RMS normalization with fused operations, includes both forward and backward passes, compatibility with PyTorch.☆12Jun 5, 2024Updated last year
- ☆28Dec 31, 2025Updated 2 months ago
- ☆11Nov 2, 2024Updated last year
- CLASP: Contrastive Language-Speech Pretraining for Multilingual Multimodal Information Retrieval☆13Jun 27, 2025Updated 8 months ago