hollygrimm/voice-dataset-creation

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/hollygrimm/voice-dataset-creation)

hollygrimm / voice-dataset-creation

Community-controlled voice data collection for language preservation and AI development. Companion to 'AI Techniques for Indigenous Cultural Expression' in Envisioning Indigenous Methods in Digital Media and Ecologies.

☆71

Alternatives and similar repositories for voice-dataset-creation

Users that are interested in voice-dataset-creation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

padmalcom / ttsdatasetcreator
View on GitHub
☆23Sep 30, 2025Updated 9 months ago
babua / TTSDatasetRecorder
View on GitHub
A simple app for recording speech datasets.
☆26Jun 27, 2022Updated 4 years ago
cpii-cai / PunCantonese
View on GitHub
A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts
☆15Dec 3, 2024Updated last year
janetacarr / masala
View on GitHub
Currying in Clojure for fun and learning.
☆11Feb 14, 2024Updated 2 years ago
thepowerfuldeez / rvc-trainer
View on GitHub
☆12Mar 28, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
iamanigeeit / present
View on GitHub
☆14Aug 19, 2024Updated last year
freds0 / CML-TTS-Dataset
View on GitHub
CML-TTS: A Multilingual Dataset for Speech Synthesis
☆35Jul 31, 2024Updated last year
Edresson / SC-GlowTTS
View on GitHub
SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model
☆107Sep 10, 2021Updated 4 years ago
Tomiinek / Blizzard2013_Segmentation
View on GitHub
Transcripts and segmentation for the Blizzard 2013 audiobooks also known as the Lessac or Blizzard 2013 dataset.
☆45Nov 13, 2019Updated 6 years ago
dunky11 / voicesmith
View on GitHub
[WIP] VoiceSmith makes training text to speech models easy.
☆231Oct 10, 2022Updated 3 years ago
georgid / lakh_vocal_segments_dataset
View on GitHub
singing voice with annotations of vocal onsets, based on the matched MIDI from http://colinraffel.com/projects/lmd/
☆20Dec 30, 2019Updated 6 years ago
nipponjo / arabic-vocalization
View on GitHub
Arabic deep-learning based diacritization models (Shakkala, Shakkelha) ported to PyTorch
☆15May 30, 2023Updated 3 years ago
JacobLinCool / zero-rvc
View on GitHub
Run Retrieval-based Voice Conversion training and inference with ease.
☆12Jan 24, 2025Updated last year
dynilib / dynitag
View on GitHub
Collaborative audio annotation tool
☆17Sep 16, 2022Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
ORI-Muchim / AudioSR-Upsampling
View on GitHub
AudioSR-Upsampling (any -> 48kHz)
☆42Feb 13, 2024Updated 2 years ago
dipjyoti92 / speaker_embeddings_GE2E
View on GitHub
PyTorch Implementation of Generalized End-to-End Loss for Speaker Verification
☆28Jan 23, 2021Updated 5 years ago
shengcanxu / canoSpeech
View on GitHub
text to speech
☆10Mar 19, 2024Updated 2 years ago
Scarfmonster / HiFiPLN
View on GitHub
Multispeaker Community Vocoder Model for DiffSinger
☆39Aug 11, 2025Updated 11 months ago
LAION-AI / Text-to-speech
View on GitHub
☆61Nov 4, 2023Updated 2 years ago
Takaaki-Saeki / zm-text-tts
View on GitHub
[IJCAI'23] Learning to Speak from Text for Low-Resource TTS
☆65May 30, 2023Updated 3 years ago
youmebangbang / TTS-dataset-tools
View on GitHub
Automatically generates TTS dataset using audio and associated text. Make cuts under a custom length. Uses Google Speech to text API to p…
☆52Apr 17, 2022Updated 4 years ago
danklabs / tts_dataset_maker
View on GitHub
A gui to help make a text to speech dataset.
☆18Dec 10, 2022Updated 3 years ago
shartoo / BeADataScientist
View on GitHub
BeADataScientist
☆13Sep 4, 2020Updated 5 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
rishikksh20 / TFGAN
View on GitHub
TFGAN: Time and Frequency Domain Based Generative Adversarial Network for High-fidelity Speech Synthesis
☆88Feb 23, 2021Updated 5 years ago
keonlee9420 / Parallel-Tacotron2
View on GitHub
PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling
☆191Nov 18, 2021Updated 4 years ago
rhoposit / multilingual_VQVAE
View on GitHub
☆37May 8, 2021Updated 5 years ago
YatingMusic / ddsp-singing-vocoders
View on GitHub
Official implementation of SawSing (ISMIR'22)
☆275Aug 28, 2022Updated 3 years ago
gumblex / whisper_vad
View on GitHub
Whisper.cpp Speech-to-text with Voice Acticity Detection
☆22Nov 3, 2024Updated last year
freds0 / kabooks
View on GitHub
KABooks is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. Using a…
☆13Mar 24, 2023Updated 3 years ago
Kikyo-16 / airgen
View on GitHub
Official source codes of airsep
☆39Mar 26, 2024Updated 2 years ago
JazminVidal / gop-pykaldi
View on GitHub
Goodness of Pronunciation algorithm using PyKaldi
☆18Jun 12, 2022Updated 4 years ago
cpdu / vallt
View on GitHub
☆36Mar 14, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
hs-oh-prml / DurFlexEVC
View on GitHub
☆81Jan 22, 2025Updated last year
jakeoneijk / FlashSR_Inference
View on GitHub
☆78Jan 25, 2025Updated last year
thu-spmi / CTC-TTS
View on GitHub
Code for CTC-TTS: LLM-based dual-streaming text-to-speech with CTC alignment, Interspeech 2026.
☆20Jun 9, 2026Updated last month
lukassteinwender / avatair
View on GitHub
A pipeline to generate user-preferred photo-realistic avatars using stable-diffusion and bayesian-optimization.
☆18Jun 12, 2026Updated last month
ShovalMessica / NAST
View on GitHub
Official repository for NAST: Noise Aware Speech Tokenization for Speech Language Models (Interspeech 2024) https://arxiv.org/abs/2406.11…
☆46Jul 2, 2024Updated 2 years ago
andrewsilva9 / tune_tortoise_autoregressor
View on GitHub
Fine tuning the UnifiedVoice autoregressor for TortoiseTTS.
☆15Nov 25, 2023Updated 2 years ago
Bli-AIk / souprune
View on GitHub
A distinctive modern framework designed specifically for RPG & STG games like Deltarune and Undertale.
☆17May 27, 2026Updated last month