saurabhshri / CCAlignerView external linksLinks
🔮 Word by word audio subtitle synchronisation tool and API. Developed under GSoC 2017 with CCExtractor.
☆170Oct 27, 2019Updated 6 years ago
Alternatives and similar repositories for CCAligner
Users that are interested in CCAligner are comparing it to the libraries listed below
Sorting:
- Real-time Audio-to-audio Karaoke Generation System for Monaural Music☆42May 24, 2021Updated 4 years ago
- ☆13Aug 23, 2024Updated last year
- DeepSpeech based forced alignment tool☆239Dec 12, 2020Updated 5 years ago
- Generate an accurate, timestamped transcript given an audio file and its text using Google Cloud's Speech-to-Text API via gRPC.☆21Aug 16, 2020Updated 5 years ago
- Fast and differentiable hidden Markov model in C++☆19Jan 20, 2023Updated 3 years ago
- A desktop app to speed up, streamline and simplify the process of creating custom karaoke videos.☆17Jul 5, 2025Updated 7 months ago
- Libsms in an open source C library that implements SMS techniques for the analysis, transformation and synthesis of musical sounds based …☆15Sep 9, 2024Updated last year
- Sisyphus recipies for ASR☆18Updated this week
- A collection of links and notes on forced alignment tools☆935Nov 10, 2021Updated 4 years ago
- aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)☆2,804Jun 22, 2024Updated last year
- A duration-invariant audio-to-lyrics alignment pipeline with low memory footprint which segments long music recordings via a recursive bi…☆15Oct 13, 2022Updated 3 years ago
- ChiNese Text Normalization (CNTN) tool for Text-to-speech system☆37Apr 12, 2018Updated 7 years ago
- Tools for parsing the audio track in television news programs☆19Apr 24, 2021Updated 4 years ago
- WaveNet implementation using tf.estimator☆21Jul 6, 2023Updated 2 years ago
- Automatic Detection of Potentially Idiomatic Expressions☆12Feb 19, 2021Updated 4 years ago
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆11Oct 21, 2022Updated 3 years ago
- Hybrid speech synthesiser☆28Feb 18, 2019Updated 6 years ago
- An extensible speech synthesis system, build with PyTorch and the original code is from r9y9's https://github.com/r9y9/nnmnkwii_gallery☆26Jun 24, 2019Updated 6 years ago
- ☆17Jan 8, 2026Updated last month
- A simple lyrics editor (generator and organizer as well) for .LRC files.☆11Oct 27, 2023Updated 2 years ago
- A Grapheme to Phoneme model using LSTM implemented in pytorch☆13Jul 6, 2022Updated 3 years ago
- Minimal module for computing audio spectrograms☆15Feb 28, 2019Updated 6 years ago
- ☆15Mar 4, 2017Updated 8 years ago
- Unsupervised speech activity detection system.☆11Jul 2, 2018Updated 7 years ago
- A multi-platform Unix CLI that prints a symlink's complete chain of targets using absolute paths.☆13Dec 27, 2022Updated 3 years ago
- The project for speech translation☆12Sep 28, 2023Updated 2 years ago
- [INTERSPEECH 2024] Official code for VoxSim: A perceptual voice similarity dataset☆12Sep 29, 2025Updated 4 months ago
- Pale Moon add-on that exports and imports passwords☆11Nov 6, 2021Updated 4 years ago
- ☆11Nov 7, 2024Updated last year
- This is not remotely close to a finished product, and does not intend to nor does this claim to be working fine-tuning code for MaskGCT. …☆13Dec 4, 2024Updated last year
- Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE☆15Nov 30, 2022Updated 3 years ago
- Neural text to speech system that uses eSpeak as a text/phoneme front-end☆16Oct 20, 2021Updated 4 years ago
- ☆22Apr 8, 2022Updated 3 years ago
- A fast, flexible CD+Graphics (CD+G) renderer☆27Feb 7, 2026Updated last week
- ☆80Aug 8, 2025Updated 6 months ago
- 用GPUImage实现部分视觉效果☆11Dec 18, 2015Updated 10 years ago
- ☆12Oct 2, 2020Updated 5 years ago
- A conda-smithy repository for nvcc.☆13Jan 23, 2025Updated last year
- ☆14Aug 1, 2025Updated 6 months ago