An HTML interface for finetuning the sync map output from aeneas
☆53Jul 5, 2022Updated 3 years ago
Alternatives and similar repositories for finetuneas
Users that are interested in finetuneas are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Postprocess SRT derived speech alignments for creating clean datasets for machine learning☆17Jan 4, 2023Updated 3 years ago
- PyTorch speech2text inference script for the NVidia openseq2seq wav2letter model variant☆10Aug 12, 2019Updated 6 years ago
- ☆20Jul 22, 2022Updated 3 years ago
- My public domain speech index☆13Sep 19, 2019Updated 6 years ago
- aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)☆2,834Jun 22, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Open Source Crimean Tatar Text-to-Speech datasets☆14Feb 23, 2025Updated last year
- Jupyter Notebooks for creating Speech datasets☆46Mar 3, 2019Updated 7 years ago
- Evaluation of STT models for german language☆15Jan 22, 2022Updated 4 years ago
- TTS Client for Coqui TTS server☆13Jan 7, 2023Updated 3 years ago
- Resources for "Simple Speech Representation Learning from Perceptual Data".☆11Sep 18, 2023Updated 2 years ago
- A "Crowd-Built" continuously growing speech dataset with transcripts. The dataset contains multiple languages and is intended for anyone …☆43Aug 3, 2022Updated 3 years ago
- End-to-end Text-to-Speech with Generative Adversarial Networks☆20Feb 6, 2021Updated 5 years ago
- List of papers about TTS / Список статей о TTS☆10Dec 16, 2017Updated 8 years ago
- Evaluate results from ASR/Speech-to-Text quickly☆41Dec 28, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.☆36Oct 4, 2019Updated 6 years ago
- litrl browser and detectors☆10Oct 5, 2023Updated 2 years ago
- FFTNet vocoder implementation☆81Sep 28, 2018Updated 7 years ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆15Mar 26, 2022Updated 4 years ago
- Sequence-to-sequence TTS based on Kyubyong's dc_tts☆61Feb 2, 2023Updated 3 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- Examples of cleaning up raw voices☆18Mar 2, 2022Updated 4 years ago
- Running Mozilla's implementation of Baidu DeepSpeech on Google Colaboratory☆16Mar 18, 2019Updated 7 years ago
- A set of pipelines for performing experiments on various NLP tasks with a focus on resource-poor/minority languages.☆37May 14, 2026Updated last week
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆14Mar 31, 2023Updated 3 years ago
- Repo for NYPL's 2016 Event, Open Audio Weekend☆14Jun 30, 2016Updated 9 years ago
- This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to…☆45May 25, 2021Updated 4 years ago
- ☆10Apr 8, 2024Updated 2 years ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Apr 13, 2022Updated 4 years ago
- Narration Studio, your all in one TTS Solution!☆34Apr 15, 2026Updated last month
- Home Assistant integration for Hoymiles Cloud API, primarily developed for HYT inverters with battery storage systems. This integration p…☆21Apr 4, 2026Updated last month
- NYPL Oral History Project☆16Mar 4, 2020Updated 6 years ago
- ☆262Dec 8, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions☆52Apr 1, 2021Updated 5 years ago
- AWS Transcribe evaluation pipeline: bulk-process audio files and view the results☆17Oct 13, 2023Updated 2 years ago
- A collection of datasets from Skolverket☆11Sep 1, 2020Updated 5 years ago
- transcribe audio feeds into public web ui☆45Aug 31, 2022Updated 3 years ago
- ☆14Jun 12, 2015Updated 10 years ago
- Проект для перевода чисел, записанных в текстовом виде на русском языке.☆11Apr 5, 2022Updated 4 years ago
- My guide to create an italian TTS with Coqui☆14Feb 2, 2022Updated 4 years ago