neonbjb/ocotillo

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/neonbjb/ocotillo)

neonbjb / ocotillo

Performant and accurate speech recognition built on Pytorch

☆253

Alternatives and similar repositories for ocotillo

Users that are interested in ocotillo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

neonbjb / tts-scores
View on GitHub
Scripts for computing the Intelligibility and CLVP scores for evaluating TTS models
☆175Dec 18, 2023Updated 2 years ago
neonbjb / DL-Art-School
View on GitHub
DLAS - A configuration-driven trainer for generative models
☆142Oct 11, 2022Updated 3 years ago
wetdog / wavenext_pytorch
View on GitHub
Unofficial implementation of wavenext vocoder
☆59Aug 28, 2024Updated last year
neonbjb / tortoise-tts
View on GitHub
A multi-voice TTS system trained with an emphasis on quality
☆14,863Nov 19, 2024Updated last year
maum-ai / univnet
View on GitHub
Unofficial PyTorch Implementation of UnivNet Vocoder (https://arxiv.org/abs/2106.07889)
☆285Oct 8, 2021Updated 4 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
neonbjb / pyfastmp3decoder
View on GitHub
A fast MP3 decoder for python, using minimp3
☆32Sep 20, 2022Updated 3 years ago
MiniXC / phones
View on GitHub
A collection of utilities for handling IPA phones.
☆27Sep 24, 2023Updated 2 years ago
shivammehta25 / BetterFastSpeech2
View on GitHub
Just another FastSpeech 2 but cleaner code :)
☆29Jun 28, 2024Updated 2 years ago
dunky11 / voicesmith
View on GitHub
[WIP] VoiceSmith makes training text to speech models easy.
☆231Oct 10, 2022Updated 3 years ago
xinjli / transphone
View on GitHub
phoneme tokenizer and grapheme-to-phoneme model for 8k languages
☆174Jun 9, 2023Updated 3 years ago
p1an-lin-jung / wv_tts
View on GitHub
☆19Mar 22, 2024Updated 2 years ago
axelspringer / DeepPhonemizer
View on GitHub
Grapheme to phoneme conversion with deep learning.
☆428Dec 8, 2023Updated 2 years ago
youmebangbang / TTS-dataset-tools
View on GitHub
Automatically generates TTS dataset using audio and associated text. Make cuts under a custom length. Uses Google Speech to text API to p…
☆52Apr 17, 2022Updated 4 years ago
patrickvonplaten / Wav2Vec2_ParlanceCTCDecode
View on GitHub
☆11Nov 5, 2021Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
yoongi43 / VRVQ
View on GitHub
Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"
☆11Apr 10, 2025Updated last year
ydqmkkx / Respiro-en
View on GitHub
Official implementation of paper: Frame-Wise Breath Detection with Self-Training: An Exploration of Enhancing Breath Naturalness in Text-…
☆43Sep 18, 2024Updated last year
yl4579 / PL-BERT
View on GitHub
Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions
☆268Jan 13, 2025Updated last year
Tomiinek / Blizzard2013_Segmentation
View on GitHub
Transcripts and segmentation for the Blizzard 2013 audiobooks also known as the Lessac or Blizzard 2013 dataset.
☆45Nov 13, 2019Updated 6 years ago
152334H / tortoise-tts-fast
View on GitHub
Fast TorToiSe inference (5x or your money back!)
☆827Jul 10, 2024Updated last year
talhanai / kaldi-diar-latte
View on GitHub
steps to perform text-based speaker diarization with kaldi toolkit
☆12Nov 2, 2018Updated 7 years ago
jamesparsloe / llm.speech
View on GitHub
Trying to build an all in one speech-text language model - a bit like GPT-4o
☆22Jun 1, 2024Updated 2 years ago
Respaired / RiFornet_Vocoder
View on GitHub
a Neural Vocoder supporting Ring Attention, Conformer and NSF.
☆25Aug 1, 2025Updated 11 months ago
adelacvg / ttts
View on GitHub
Train the next generation of TTS systems.
☆169Sep 13, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
y-chan / hifi-gan-misrnet
View on GitHub
unofficial pytorch implementation of HiFi-GAN with fast MISR.
☆15Mar 21, 2023Updated 3 years ago
b04901014 / MQTTS
View on GitHub
☆260May 15, 2023Updated 3 years ago
152334H / DL-Art-School
View on GitHub
TorToiSe fine-tuning with DLAS
☆224Aug 1, 2024Updated last year
indri-voice / audiotoken
View on GitHub
Audio tokenization, in the fastest way possible!
☆54Aug 26, 2024Updated last year
huggingface / dataspeech
View on GitHub
☆401Sep 3, 2024Updated last year
lifeiteng / SoundStorm
View on GitHub
☆71Jul 13, 2023Updated 2 years ago
anonymous-pits / pits
View on GitHub
PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictor
☆280Jul 16, 2023Updated 2 years ago
xinshengwang / robpitch
View on GitHub
A pitch detection model trained to be robust against noise and reverberation environments.
☆27Jan 21, 2025Updated last year
ZehuaKcrissLi / GTR-Voice
View on GitHub
☆16Nov 11, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
mechanicalsea / lighthubert
View on GitHub
LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT
☆73Sep 26, 2022Updated 3 years ago
Edresson / YourTTS
View on GitHub
YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone
☆1,057Nov 4, 2024Updated last year
MiscellaneousStuff / PhoneLM
View on GitHub
(R&D) Text to speech using phonemes as inputs and audio codec codes as outputs. Loosely based on MegaByte, VALL-E and Encodec.
☆48Sep 4, 2023Updated 2 years ago
Takaaki-Saeki / zm-text-tts
View on GitHub
[IJCAI'23] Learning to Speak from Text for Low-Resource TTS
☆65May 30, 2023Updated 3 years ago
miccio-dk / NISQA
View on GitHub
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
☆16Apr 13, 2022Updated 4 years ago
keonlee9420 / Comprehensive-E2E-TTS
View on GitHub
A Non-Autoregressive End-to-End Text-to-Speech (text-to-wav), supporting a family of SOTA unsupervised duration modelings. This project g…
☆147Jun 6, 2022Updated 4 years ago
kaiidams / voice100
View on GitHub
Voice100 includes neural TTS/ASR models. Inference of Voice100 is low cost as its models are tiny and only depend on CNN without autoregr…
☆28Nov 23, 2023Updated 2 years ago