bunyaminergen / awesome-speech-datasetView external linksLinks
Awesome Speech Dataset, including download links and a brief explanation for each resource. These datasets provide diverse and high-quality speech data covering various domains such as conversational, academic, political, and more.
☆26Jul 4, 2025Updated 7 months ago
Alternatives and similar repositories for awesome-speech-dataset
Users that are interested in awesome-speech-dataset are comparing it to the libraries listed below
Sorting:
- This repository contains official pytorch implementation and pre-trained models for the MR-RawNet.☆17Jun 12, 2024Updated last year
- DiFlow-TTS delivers low-latency zero-shot TTS via discrete flow matching and factorized speech tokens. A compact, open framework for fast…☆52Updated this week
- Speech-MASSIVE is a multilingual Spoken Language Understanding (SLU) dataset comprising the speech counterpart for a portion of the MASSI…☆24Oct 8, 2025Updated 4 months ago
- Callytics is an advanced call analytics solution that leverages speech recognition and large language models (LLMs) technologies to analy…☆77Apr 7, 2025Updated 10 months ago
- Speech to Phoneme, Bandwidth Extension and Speaker Verification using the Vibravox dataset.☆45Dec 1, 2025Updated 2 months ago
- SocksSharp provides support for Socks4/4a/5 proxy servers to HttpClient☆12Feb 3, 2021Updated 5 years ago
- A smart-casual LaTeX Beamer theme☆13Jan 21, 2025Updated last year
- Demo for DART, Audio Imagination workshop submission in NeurIPS 2024☆12Apr 15, 2025Updated 10 months ago
- ☆11Jan 31, 2025Updated last year
- ☆10Oct 9, 2025Updated 4 months ago
- Punch Out Model Synthesis - a program for constraint based tiling generation☆18Feb 1, 2026Updated 2 weeks ago
- PASE: Phonologically Anchored Speech Enhancer☆37Dec 10, 2025Updated 2 months ago
- ☆22Jan 25, 2026Updated 3 weeks ago
- Name☆10Jul 21, 2017Updated 8 years ago
- ☆27Dec 31, 2025Updated last month
- A Solution Accelerator bringing together the latest AI agentic patterns and Azure services to automate the first line of review for docum…☆23Jan 17, 2025Updated last year
- Code repo for "S3R-Net: A Single-Stage Approach to Self-Supervised Shadow Removal" (NTIRE workshop @ CVPR 2024)☆11Jun 15, 2024Updated last year
- Develop macOS apps on Windows with seamless cross-platform tools.☆15Jun 5, 2025Updated 8 months ago
- [⚠️ WIP] ALMOは拡張Markdownパーサ・静的サイトジェネレータです。WebAssemblyを使ってブラウザ上で完結する実行環境を提供し、サーバを必要としないサンプルコードの実行環境やジャッジシステムを提供するページの構築を可能にします。☆16Oct 28, 2025Updated 3 months ago
- Sample code to demonstrate how to get started with the .NET MAUI Community Toolkit DrawingView☆12Jan 2, 2023Updated 3 years ago
- A .NET class library to obfuscate and deobfuscate save files from the game No Man's Sky.☆10Feb 9, 2026Updated last week
- ⚡️Official Image-charts Python library☆12Updated this week
- A modularised, multi-threaded C# game engine based on the Entity Component System architecture. Uses OpenGL for rendering and contains ma…☆12May 12, 2024Updated last year
- Enhanced Reverberation As Supervision (ERAS) for unsupervised reverberant speech separation☆15Aug 1, 2024Updated last year
- [INTERSPEECH 2024] Official code for VoxSim: A perceptual voice similarity dataset☆12Sep 29, 2025Updated 4 months ago
- Tool for signing and countersigning iXBRL or other XML files☆12Mar 3, 2023Updated 2 years ago
- C# port of stb_dxt.h☆10Aug 9, 2020Updated 5 years ago
- REBORN: Reinforcement-Learned Boundary Segmentation with Iterative Training for Unsupervised ASR☆14Dec 11, 2024Updated last year
- UDIM Shader Material for Godot☆10Apr 2, 2024Updated last year
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆10Sep 30, 2024Updated last year
- Ono laboratory audio signal processing exercise for beginners.☆19May 10, 2023Updated 2 years ago
- ☆10Dec 22, 2023Updated 2 years ago
- For further understanding the wide array of emotions embedded in human speech, we are introducing an emotional speech corpus. In contrast…☆11Oct 29, 2018Updated 7 years ago
- ☆11Nov 2, 2024Updated last year
- ☆11Oct 15, 2022Updated 3 years ago
- Scripts for recreating the Replication Dataset for Fundamental Frequency Estimation. Part of the dissertation "Pitch of Voiced Speech in …☆11Mar 29, 2021Updated 4 years ago
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated 10 months ago
- CLASP: Contrastive Language-Speech Pretraining for Multilingual Multimodal Information Retrieval☆13Jun 27, 2025Updated 7 months ago
- Efficient kernel for RMS normalization with fused operations, includes both forward and backward passes, compatibility with PyTorch.☆12Jun 5, 2024Updated last year