dscripka/synthetic_speech_dataset_generation

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/dscripka/synthetic_speech_dataset_generation)

dscripka / synthetic_speech_dataset_generation

This repository contains text-to-speech (TTS) models and utilities designed produce synthetic training datasets for other speech-related models.

☆30

Alternatives and similar repositories for synthetic_speech_dataset_generation

Users that are interested in synthetic_speech_dataset_generation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

labmlai / dashboard
View on GitHub
Experiments dashboard for LabML
☆17Dec 11, 2022Updated 3 years ago
lugan113 / SynTTS-Commands-Official
View on GitHub
SynTTS-Commands is a large-scale, multilingual (English & Chinese) synthetic speech command dataset designed for low-power Keyword Spotti…
☆17Feb 5, 2026Updated 5 months ago
JanLunge / orbit
View on GitHub
A modular platform to build voice based LLM Assistants
☆13Dec 14, 2023Updated 2 years ago
Seeed-Studio / Seeed_Arduino_IR
View on GitHub
Library for receiving, decoding, and sending infrared signals using Arduino
☆10Jan 8, 2025Updated last year
customink / lambda-python-nltk-layer
View on GitHub
Lambda layer to enable using famous NLTK python package with AWS lambda
☆10Mar 21, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
patrickvonplaten / audio-gen-dreambooth
View on GitHub
☆23Jun 13, 2023Updated 3 years ago
mrusci / ondevice-learning-kws
View on GitHub
Test Framework for few-shot open set KWS
☆45Nov 8, 2024Updated last year
sandersn / nltk
View on GitHub
NLTK ported to Javascript
☆13Sep 3, 2017Updated 8 years ago
amrit-das / custom_image_classifier_pytorch
View on GitHub
PyTorch Based Image Classifier for image prediction and segregation https:/…
☆14Jan 21, 2020Updated 6 years ago
cpii-cai / PunCantonese
View on GitHub
A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts
☆15Dec 3, 2024Updated last year
voxeet / comms-sdk-cpp
View on GitHub
The Dolby.io Communications C++ SDK provides both Client and Server applications the ability to create HD voice and video for fully immer…
☆13Aug 30, 2024Updated last year
Celemony / ARA_Examples
View on GitHub
Examples demonstrating proper usage of the ARA Audio Random Access API
☆15Updated this week
DongKeon / webrtc-whisper-asr
View on GitHub
WebRTC-based real-time audio streaming with Faster Whisper ASR integration for live speech-to-text transcription.
☆13Sep 27, 2024Updated last year
jhuang448 / MultilingualALT
View on GitHub
Repo of the paper "Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model""
☆15Jun 28, 2024Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
drscotthawley / fad_pytorch
View on GitHub
Frechet Audio Distance evaluation in PyTorch
☆36Jun 9, 2023Updated 3 years ago
WangHelin1997 / Aty-TTS
View on GitHub
Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech
☆11May 14, 2025Updated last year
RoverRobotics-archive / ros1_roverpro_auto_dock
View on GitHub
This package allows you to autonomously dock any of the open rovers using just a camera.
☆19Dec 17, 2019Updated 6 years ago
maum-ai / sane-tts
View on GitHub
SANE-TTS: Stable And Natural End-to-End Multilingual Text-to-Speech
☆11Jun 30, 2023Updated 3 years ago
freds0 / kabooks
View on GitHub
KABooks is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. Using a…
☆13Mar 24, 2023Updated 3 years ago
may- / joeys2t
View on GitHub
Minimalist Speech-to-Text toolkit for educational purposes
☆13Feb 1, 2024Updated 2 years ago
DigitalPhonetics / VoicePAT
View on GitHub
VoicePAT is a modular and efficient toolkit for voice privacy research, with main focus on speaker anonymization.
☆59May 14, 2024Updated 2 years ago
iamanigeeit / present
View on GitHub
☆14Aug 19, 2024Updated last year
andreyd41 / lux3-bot
View on GitHub
3rd Place Solution in Lux AI Season 3 (NeurIPS 2024) Competition
☆14Mar 16, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
L16H7 / lux-3-comets
View on GitHub
Multi-agent Reinforcement Learning, 14th in 701 teams - NeurIPS 2024 Competition
☆15Mar 13, 2025Updated last year
HolgerBovbjerg / data2vec-KWS
View on GitHub
This repository contains code for applying Data2Vec to pretrain Keyword Transformer model as described in "Improving Label-Deficient Keyw…
☆32Mar 6, 2025Updated last year
TTS-Research / PEL-TTS
View on GitHub
☆14Aug 16, 2023Updated 2 years ago
Adventureeee / multi-modal-sentiment
View on GitHub
multi-modal sentiment
☆16Nov 19, 2024Updated last year
Seeed-Studio / Seeed_Arduino_mbedtls
View on GitHub
☆13Oct 18, 2024Updated last year
stanfordnlp / huggingface-models
View on GitHub
Scripts for pushing models to huggingface repos
☆15Updated this week
Seeed-Studio / Seeed_Arduino_atWiFi
View on GitHub
☆11Nov 20, 2020Updated 5 years ago
audiodemo / voice-conversion
View on GitHub
Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks
☆17Aug 18, 2023Updated 2 years ago
SherifAbdulatif / CMGAN
View on GitHub
Conformer-based Metric GAN for speech enhancement
☆27May 3, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
chriskiefer / libcccrt
View on GitHub
A collection of signal analysis functions related to complexity, chaos and causality. Optimised for realtime signal processing.
☆18Jan 15, 2024Updated 2 years ago
noritsuna / JPEGEncoder4Cortex-M
View on GitHub
JPEG Encoder for Cortex-M
☆14Sep 2, 2016Updated 9 years ago
sanchit-gandhi / seq2seq-speech
View on GitHub
Repository for fine-tuning Transformers 🤗 based seq2seq speech models in JAX/Flax.
☆39Feb 23, 2023Updated 3 years ago
SubrataMaji / Chatbot-using-Deep-Learning
View on GitHub
Building a chatbot with bidirectional LSTM and attention mechanism with tensorflow and keras
☆13Oct 2, 2020Updated 5 years ago
zoenguyenramirez / arc-prize-2024
View on GitHub
☆21Feb 22, 2025Updated last year
MTG / Podcastmix
View on GitHub
PodcastMix A dataset for separating music and speech in podcasts.
☆44Aug 20, 2024Updated last year
Seeed-Studio / Seeed_Arduino_LvGL
View on GitHub
This library is a free and open-source graphics library that has a demo to tests the performance in various cases. For example rectangle,…
☆15Jan 8, 2025Updated last year