aolney/manual-subtitle-speech-alignment

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/aolney/manual-subtitle-speech-alignment)

aolney / manual-subtitle-speech-alignment

Postprocess SRT derived speech alignments for creating clean datasets for machine learning

☆17

Alternatives and similar repositories for manual-subtitle-speech-alignment

Users that are interested in manual-subtitle-speech-alignment are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ozdefir / finetuneas
View on GitHub
An HTML interface for finetuning the sync map output from aeneas
☆53Jul 5, 2022Updated 4 years ago
lukereichold / SpeechTimestamper
View on GitHub
Generate an accurate, timestamped transcript given an audio file and its text using Google Cloud's Speech-to-Text API via gRPC.
☆21Aug 16, 2020Updated 5 years ago
Yeongtae / tacotron2
View on GitHub
Tacotron 2 - PyTorch implementation with faster-than-realtime inference
☆30May 28, 2020Updated 6 years ago
zhaoyi2 / CVTE_chain_model_finetune
View on GitHub
finetune the chain model based on cvte open source model without traing any GMM for frame alignment
☆12Aug 6, 2020Updated 5 years ago
redsk / neo_concept
View on GitHub
ConceptNet to neo4j 2.2
☆10Nov 6, 2015Updated 10 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
Hongtao-Lin / NER
View on GitHub
Named entity recognition system using multi-stage CRF and statistical rules
☆11Oct 3, 2016Updated 9 years ago
chenzhehuai / kaldi-decoders
View on GitHub
Custom decoders for Kaldi
☆13Jun 5, 2019Updated 7 years ago
amirharati / kaldi-alligner
View on GitHub
scripts to align a given wave to its transcription using trained models by Kaldi
☆37Aug 15, 2019Updated 6 years ago
fssqawj / QAServer
View on GitHub
QA Server Based Chinese CQA Site
☆12Jul 14, 2021Updated 5 years ago
webaverse / LJSpeechTools
View on GitHub
Tools to isolate speaker and transcribe unstructured audio clips
☆11Dec 4, 2022Updated 3 years ago
jpuigcerver / kaldi-decoders
View on GitHub
Custom decoders for Kaldi
☆81Jun 10, 2019Updated 7 years ago
GaoQ1 / rasa_nlu_cn
View on GitHub
Turn Chinese natural language into structured data 中文自然语言理解，并支持spacy
☆13Jul 9, 2024Updated 2 years ago
matteocontrini / amazon-ssml-cheatsheet
View on GitHub
Amazon SSML cheatsheet
☆16Nov 9, 2018Updated 7 years ago
YanWenqiang / MedicalNER
View on GitHub
医疗命名实体识别， CRF，
☆13Jun 26, 2019Updated 7 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
wassname / phoneme2grapheme
View on GitHub
Teaching machines to spell with deep learning (acc=>80%) e.g. a model hears "pɹˈaʊd˺ɚ" and writes "prowder" (but it should be "prouder")
☆19Jun 1, 2017Updated 9 years ago
hslh / pie-detection
View on GitHub
Automatic Detection of Potentially Idiomatic Expressions
☆12Feb 19, 2021Updated 5 years ago
NeoTeo / fingerprinter-chromaprint
View on GitHub
Swift rewrite of audio fingerprinting using the Chromaprint C++ library.
☆10Feb 6, 2017Updated 9 years ago
shinhyeokoh / rwen
View on GitHub
☆14Jun 16, 2023Updated 3 years ago
toaq / toadua
View on GitHub
The online collaborative dictionary for the constructed language Toaq.
☆14Updated this week
amunategui / Read-and-Process-Files-Larger-Than-RAM
View on GitHub
Using the function read.table() to break file into chunks to loop and process them. This allows processing files of any size beyond what …
☆10Aug 19, 2014Updated 11 years ago
pop123123123 / CLI_sentence_mixing
View on GitHub
☆15Jan 14, 2024Updated 2 years ago
jhasegaw / phonecodes
View on GitHub
python code for converting among IPA, ARPABET, XSAMPA, Callhome, DISC, TIMIT, plus some lexical tones.
☆44Jun 18, 2026Updated last month
Tomiinek / Blizzard2013_Segmentation
View on GitHub
Transcripts and segmentation for the Blizzard 2013 audiobooks also known as the Lessac or Blizzard 2013 dataset.
☆45Nov 13, 2019Updated 6 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
FrankGrimm / node-germansentiment
View on GitHub
german sentiment analysis
☆13Mar 8, 2017Updated 9 years ago
oxygen-dioxide / OpenUtau
View on GitHub
Open source UTAU editing environment.
☆11Updated this week
Utopiah / googlepoly-load-component
View on GitHub
A-Frame VR component to load a 3D model from Google Poly
☆13Mar 8, 2019Updated 7 years ago
cabal-club / commons
View on GitHub
high level thoughts and issues for the future of cabal
☆14Jan 9, 2024Updated 2 years ago
veniamin-ilmer / better-standards
View on GitHub
Personal opinions of Standards and Policies
☆14Aug 22, 2021Updated 4 years ago
Richienb / Richienb
View on GitHub
☆18Sep 18, 2022Updated 3 years ago
nickzoic / word-list
View on GitHub
A 512 word list for passphrases etc
☆13Jun 26, 2026Updated last month
ArgLab / writing_observer
View on GitHub
Writing Observer and Learning Observer: A system for monitoring learning process data, with an initial focus on writing process data from…
☆12Updated this week
DDR0 / fuseblk-filename-fixer
View on GitHub
A small program to fix filename issues when copying to different filesystems, licenced under GNU GPLv3.
☆11Nov 17, 2022Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
proycon / spacy2folia
View on GitHub
Use spaCy for NLP and output to the FoLiA XML format.
☆12Feb 27, 2024Updated 2 years ago
MarginaliaSearch / PublicData
View on GitHub
Public data sets for Marginalia Search
☆15Jul 11, 2026Updated 2 weeks ago
dbpedia / neural-rdf-verbalizer
View on GitHub
🗣 Multilingual RDF Verbalizer – Google Summer of Code 2019
☆21Mar 24, 2023Updated 3 years ago
bronichern / DeepFry
View on GitHub
☆13Jun 29, 2025Updated last year
ruathudo / post-ocr-correction
View on GitHub
☆11Nov 14, 2021Updated 4 years ago
howl-anderson / rasa_contrib
View on GitHub
rasa_contrib is a addon package for rasa. It provide some useful/powerful addition components
☆21Dec 8, 2022Updated 3 years ago
camenduru / DiffSketcher-colab
View on GitHub
☆16Dec 18, 2023Updated 2 years ago