ozdefir/finetuneas

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ozdefir/finetuneas)

ozdefir / finetuneas

An HTML interface for finetuning the sync map output from aeneas

☆53

Alternatives and similar repositories for finetuneas

Users that are interested in finetuneas are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

aolney / manual-subtitle-speech-alignment
View on GitHub
Postprocess SRT derived speech alignments for creating clean datasets for machine learning
☆17Jan 4, 2023Updated 3 years ago
readbeyond / lachesis
View on GitHub
lachesis automates the segmentation of a transcript into closed captions
☆35Jan 26, 2017Updated 9 years ago
vadimkantorov / inferspeech
View on GitHub
PyTorch speech2text inference script for the NVidia openseq2seq wav2letter model variant
☆10Aug 12, 2019Updated 6 years ago
NTRLab / MediaSpeech
View on GitHub
☆22Jul 22, 2022Updated 3 years ago
carlfm01 / my-speech-datasets
View on GitHub
My public domain speech index
☆13Sep 19, 2019Updated 6 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
readbeyond / aeneas
View on GitHub
aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
☆2,852Jun 22, 2024Updated 2 years ago
egorsmkv / qirimtatar-tts-datasets
View on GitHub
Open Source Crimean Tatar Text-to-Speech datasets
☆14Feb 23, 2025Updated last year
klintan / swedish-asr-dataset
View on GitHub
Jupyter Notebooks for creating Speech datasets
☆46Mar 3, 2019Updated 7 years ago
domcross / german-stt-evaluation
View on GitHub
Evaluation of STT models for german language
☆16Jan 22, 2022Updated 4 years ago
thorstenMueller / cTTS
View on GitHub
TTS Client for Coqui TTS server
☆13Jan 7, 2023Updated 3 years ago
vliu15 / adversarial-tts
View on GitHub
End-to-end Text-to-Speech with Generative Adversarial Networks
☆20Feb 6, 2021Updated 5 years ago
morelen17 / tts-papers
View on GitHub
List of papers about TTS / Список статей о TTS
☆10Dec 16, 2017Updated 8 years ago
symblai / speech-recognition-evaluation
View on GitHub
Evaluate results from ASR/Speech-to-Text quickly
☆41Dec 28, 2021Updated 4 years ago
litrl / litrl_code
View on GitHub
litrl browser and detectors
☆10Oct 5, 2023Updated 2 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
gpu-poor / gramvaani_hindi_asr
View on GitHub
This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge
☆16Mar 26, 2022Updated 4 years ago
mayukhnair / deepspeech-colab
View on GitHub
Running Mozilla's implementation of Baidu DeepSpeech on Google Colaboratory
☆16Mar 18, 2019Updated 7 years ago
Idlak / Living-Audio-Dataset
View on GitHub
A "Crowd-Built" continuously growing speech dataset with transcripts. The dataset contains multiple languages and is intended for anyone …
☆43Aug 3, 2022Updated 3 years ago
hyperaudio / hyperaudio
View on GitHub
☆14Mar 31, 2023Updated 3 years ago
bhuwanadhikari / covid19nepal-map
View on GitHub
👨‍💻 A simple app that shows the corona 😷😷 statistics of Nepal, district and province wise. https://covidmapnepal.web.app/
☆11May 3, 2021Updated 5 years ago
erogol / FFTNet
View on GitHub
FFTNet vocoder implementation
☆81Sep 28, 2018Updated 7 years ago
cschaefer26 / StyleMelGAN
View on GitHub
☆10Apr 8, 2024Updated 2 years ago
miccio-dk / NISQA
View on GitHub
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
☆16Apr 13, 2022Updated 4 years ago
PiotrTa / Huawei-Challenge-Speaker-Identification
View on GitHub
Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.
☆36Oct 4, 2019Updated 6 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
CSTR-Edinburgh / ophelia
View on GitHub
Sequence-to-sequence TTS based on Kyubyong's dc_tts
☆61Feb 2, 2023Updated 3 years ago
IBM / Train-Custom-Speech-Model
View on GitHub
Create a custom Watson Speech to Text model using specialized domain data
☆61Aug 31, 2021Updated 4 years ago
ryanrudes / YTTTS
View on GitHub
The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions
☆53Apr 1, 2021Updated 5 years ago
averkij / Word-to-Number-Russian
View on GitHub
Проект для перевода чисел, записанных в текстовом виде на русском языке.
☆11Apr 5, 2022Updated 4 years ago
marytts / pavoque-data
View on GitHub
PAVOQUE Corpus of Expressive Speech
☆12Aug 2, 2016Updated 9 years ago
nicolalandro / train_coqui_tts_ita
View on GitHub
My guide to create an italian TTS with Coqui
☆14Feb 2, 2022Updated 4 years ago
collabnix / docker-cctv-raspbian
View on GitHub
Docker Image for Low-cost HD surveillance Camera Module on Raspberry Pi 3
☆22Jun 3, 2020Updated 6 years ago
MlWoo / WaveRNN-TF
View on GitHub
☆15Oct 11, 2019Updated 6 years ago
rishikksh20 / iSTFT-Avocodo-pytorch
View on GitHub
Ultrafast GAN based Vocoder for Text to Speech
☆50Jul 16, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
jplusplus / skolstatistik
View on GitHub
A collection of datasets from Skolverket
☆11Sep 1, 2020Updated 5 years ago
sushant-t / tts-trainer
View on GitHub
Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…
☆30May 27, 2023Updated 3 years ago
janvainer / speedyspeech
View on GitHub
☆262Dec 8, 2022Updated 3 years ago
daanzu / kaldi-fork-active-grammar
View on GitHub
☆10Updated this week
kusha / voiceid
View on GitHub
Speaker recognition/identification system in Python. Python3 port.
☆14May 2, 2015Updated 11 years ago
diego-fustes / asr-rescoring
View on GitHub
Rescoring methods for end-to-end Automatic Speech Recognition
☆27Sep 23, 2020Updated 5 years ago
aildnont / HIFIS-model
View on GitHub
Machine learning models for prediction of chronic homelessness using the HIFIS Application.
☆19Jul 10, 2024Updated 2 years ago