JaesungHuh/simple-subtitling

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/JaesungHuh/simple-subtitling)

JaesungHuh / simple-subtitling

Character-aware audio-only subtitling

☆31

Alternatives and similar repositories for simple-subtitling

Users that are interested in simple-subtitling are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

callee2006 / HGUNeuralNetworks
View on GitHub
Multi-layer perceptron, Autoencoder, and Restricted Boltzmann Machine
☆10Sep 15, 2018Updated 7 years ago
Yangyangii / ProgressiveGAN
View on GitHub
Implementation of NVIDIA's Progressive Growing of GANs
☆11Jan 6, 2020Updated 6 years ago
Mrugalla / JIF
View on GitHub
a VST3 Plugin (Windows, 64bit), that loops a GIF to the tempo of your beat.
☆21Sep 15, 2025Updated 10 months ago
ftshijt / speech_evaluation
View on GitHub
A toolkit dedicate for speech evaluation.
☆23Sep 26, 2024Updated last year
BUTSpeechFIT / mt-asr-data-prep
View on GitHub
☆25Feb 26, 2026Updated 4 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Mrugalla / Overdrive-ReNEO
View on GitHub
an mda overdrive tribute project
☆20Sep 15, 2025Updated 10 months ago
MaikeZuefle / f-actor
View on GitHub
☆28Jul 17, 2026Updated last week
plnguyen2908 / UniTalk-ASD-code
View on GitHub
[Interspeech 2026] Revisiting Active Speaker Detection: An In-the-Wild Benchmark for Generalization and Robustness
☆22Jun 25, 2026Updated last month
Yangyangii / DeepConvolutionalTTS-pytorch
View on GitHub
Deep Convolutional TTS pytorch implementation
☆27Jul 2, 2019Updated 7 years ago
landonviator / Poletti-Class-B-Amplifier
View on GitHub
☆11Aug 7, 2024Updated last year
MTG / carnatic-separation-ismir23
View on GitHub
Carnatic singing voice separation trained with in-domain data with leakage
☆11Nov 5, 2023Updated 2 years ago
Mddct / transformer-vocos
View on GitHub
☆35Sep 6, 2025Updated 10 months ago
shansongliu / HumTrans
View on GitHub
☆13Sep 26, 2023Updated 2 years ago
xjuspeech / YOLOPitch
View on GitHub
☆10Jun 11, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
mysee1989 / GraphJigsaw
View on GitHub
Code for the paper: Graph Jigsaw Learning for Cartoon Face Recognition
☆10Jul 1, 2022Updated 4 years ago
dcaulley / av_diarization
View on GitHub
AudioVisual Diarization - Supervised and Unsupervised
☆15Nov 22, 2022Updated 3 years ago
carlosabalde / mobiledetect2vcl
View on GitHub
Python script to transform the Mobile Detect JSON database into an UA-based mobile detection VCL subroutine easily integrable in any Varn…
☆14Nov 13, 2023Updated 2 years ago
andywiggins / ddsp-guitar-synth
View on GitHub
A Differentiable Acoustic Guitar Model for String-Specific Polyphonic Synthesis
☆18Nov 16, 2023Updated 2 years ago
ArtemisWang / blind_movies
View on GitHub
为视障人群生成电影，输入是电影剧本和mkv格式电影，输出为带有解说的电影
☆12Jul 28, 2019Updated 6 years ago
carpedm20 / HeXA-Bot
View on GitHub
KakaoTalk robot which automatically answer to your command
☆11Feb 24, 2014Updated 12 years ago
MSD-IRIMAS / Augmenting-TSC-Elastic-Averaging
View on GitHub
Augmenting Time Series Datasets with Weighted Elastic Barycenter Averaging
☆10Jun 2, 2025Updated last year
shanwangshan / TAU-urban-audio-visual-scenes
View on GitHub
☆12Oct 23, 2021Updated 4 years ago
masanbasa3k / Generate_Your_Own_Music
View on GitHub
This repository contains code for generating new music using Generative Adversarial Networks (GANs). GANs are a type of deep learning mod…
☆29Aug 26, 2023Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
ryylcc / OWSOL
View on GitHub
☆15Feb 18, 2024Updated 2 years ago
chunan / libdtw
View on GitHub
An implementation of DTW for spoken term detection. Including non-constrained, segmental DTW, slope-constrained versions. For more detail…
☆16Jun 2, 2019Updated 7 years ago
chimechallenge / C8DASR-Baseline-NeMo
View on GitHub
NeMo: a toolkit for conversational AI
☆13May 4, 2024Updated 2 years ago
wavlab-speech / shinjiwlab.github.io
View on GitHub
☆18Updated this week
ajin12 / tooldetection
View on GitHub
☆14Mar 16, 2019Updated 7 years ago
SRPOL-AUI / spectrum-correction
View on GitHub
Source code for publication: "Spectrum Correction: Acoustic Scene Classification with Mismatched Recording Devices"
☆13Feb 22, 2022Updated 4 years ago
iamcam / ai-wordpress-rag-demo
View on GitHub
This small project demonstrates how to integrate WordPress blog entries into queries for a RAG-based (Retriever-Augmented Generation) lan…
☆11Apr 2, 2024Updated 2 years ago
Audio-Foundation-Models / ConversationTTS
View on GitHub
☆101Jan 19, 2026Updated 6 months ago
ZhaoZeyu1995 / BenNevis
View on GitHub
A Diffrentiable WFST-based End-to-End Automatic Speech Recognition toollkit with flexible topology support
☆12Feb 15, 2026Updated 5 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
wiragotama / TIARA-annotationTool
View on GitHub
An Interactive Tool for Annotating Discourse Structure and Text Improvement
☆16Sep 15, 2021Updated 4 years ago
TheAudioProgrammer / stateVariableFilter
View on GitHub
☆11Jan 27, 2018Updated 8 years ago
md-mohaiminul / TranS4mer
View on GitHub
☆34Jun 2, 2023Updated 3 years ago
Mrugalla / Absorb
View on GitHub
a sidechain plugin that mixes up the texture of the colliding input signals
☆50Sep 12, 2025Updated 10 months ago
JaesungHuh / VoxSRC2022
View on GitHub
VoxSRC2022 workshop development kit
☆19Jul 21, 2022Updated 4 years ago
jefflai108 / Unsupervised-TTS
View on GitHub
☆42Mar 25, 2022Updated 4 years ago
keonlee9420 / Robust_Fine_Grained_Prosody_Control
View on GitHub
PyTorch Implementation of Robust and fine-grained prosody control of end-to-end speech synthesis
☆41Feb 20, 2022Updated 4 years ago