ozdefir / finetuneasView external linksLinks
An HTML interface for finetuning the sync map output from aeneas
☆53Jul 5, 2022Updated 3 years ago
Alternatives and similar repositories for finetuneas
Users that are interested in finetuneas are comparing it to the libraries listed below
Sorting:
- Postprocess SRT derived speech alignments for creating clean datasets for machine learning☆17Jan 4, 2023Updated 3 years ago
- ☆20Jul 22, 2022Updated 3 years ago
- End-to-end Text-to-Speech with Generative Adversarial Networks☆20Feb 6, 2021Updated 5 years ago
- Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.☆36Oct 4, 2019Updated 6 years ago
- Resources for "Simple Speech Representation Learning from Perceptual Data".☆11Sep 18, 2023Updated 2 years ago
- A "Crowd-Built" continuously growing speech dataset with transcripts. The dataset contains multiple languages and is intended for anyone …☆43Aug 3, 2022Updated 3 years ago
- ☆10Apr 8, 2024Updated last year
- My public domain speech index☆13Sep 19, 2019Updated 6 years ago
- List of papers about TTS / Список статей о TTS☆10Dec 16, 2017Updated 8 years ago
- Open Source Crimean Tatar Text-to-Speech datasets☆14Feb 23, 2025Updated 11 months ago
- Evaluation of STT models for german language☆15Jan 22, 2022Updated 4 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆15Mar 26, 2022Updated 3 years ago
- PyTorch speech2text inference script for the NVidia openseq2seq wav2letter model variant☆10Aug 12, 2019Updated 6 years ago
- FFTNet vocoder implementation☆81Sep 28, 2018Updated 7 years ago
- lachesis automates the segmentation of a transcript into closed captions☆35Jan 26, 2017Updated 9 years ago
- aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)☆2,804Jun 22, 2024Updated last year
- ☆32Jul 27, 2022Updated 3 years ago
- TTS Client for Coqui TTS server☆13Jan 7, 2023Updated 3 years ago
- Speaker recognition/identification system in Python. Python3 port.☆14May 2, 2015Updated 10 years ago
- ☆14Jun 12, 2015Updated 10 years ago
- Dippy Synthetic Speech Subnet☆17Sep 11, 2025Updated 5 months ago
- qup: a Single-Node Job Scheduler with NVIDIA GPU support☆15Jan 10, 2023Updated 3 years ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Apr 13, 2022Updated 3 years ago
- ☆14Mar 31, 2023Updated 2 years ago
- My guide to create an italian TTS with Coqui☆14Feb 2, 2022Updated 4 years ago
- ☆15Oct 11, 2019Updated 6 years ago
- Sequence-to-sequence TTS based on Kyubyong's dc_tts☆61Feb 2, 2023Updated 3 years ago
- A recipe for creating a Speaker Identification system built on Kaldi.☆15Jan 2, 2020Updated 6 years ago
- Examples of cleaning up raw voices☆18Mar 2, 2022Updated 3 years ago
- JSON schema and JavaScript model classes for dealing with time-aligned transcripts of speech.☆16Aug 20, 2018Updated 7 years ago
- Example workflow for our data-centric speech benchmark☆17Jul 6, 2023Updated 2 years ago
- ☆37May 8, 2021Updated 4 years ago
- Text-based media editing interface☆16Aug 9, 2017Updated 8 years ago
- Filter Bank Implementaion as Convolutional Neural Network using Python Keras☆17Dec 18, 2024Updated last year
- Running Mozilla's implementation of Baidu DeepSpeech on Google Colaboratory☆16Mar 18, 2019Updated 6 years ago
- Docker Image for Low-cost HD surveillance Camera Module on Raspberry Pi 3☆21Jun 3, 2020Updated 5 years ago
- ☆80Aug 8, 2025Updated 6 months ago
- Artie Bias Corpus: an audio corpus + code for detecting demographic bias☆20Jul 21, 2020Updated 5 years ago