Mandarin Chinese audio datasets aligned with Montreal Forced Aligner
☆19Aug 13, 2024Updated last year
Alternatives and similar repositories for forced-alignment-chinese
Users that are interested in forced-alignment-chinese are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated last year
- ☆25Jun 19, 2025Updated 10 months ago
- poorman's ar-dit tts☆45Dec 31, 2025Updated 4 months ago
- ☆16Mar 12, 2024Updated 2 years ago
- pku nlp toolkit☆10Jun 5, 2018Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- An AR+AR TTS attempt.☆18Jan 13, 2025Updated last year
- ☆48Apr 5, 2026Updated last month
- disruptor c++ implementation for IPC (arbitrary length of data)☆17Nov 1, 2025Updated 6 months ago
- A fuzzy file picker in a tmux popup for selecting files with terminal-based AI coding assistants☆38Apr 26, 2026Updated last week
- Source code for “Neural RST-based Evaluation of Discourse Coherence” (AACL-IJCNLP 2020)☆28Aug 29, 2021Updated 4 years ago
- Official code for "Semantic-VAE: Semantic-Alignment Latent Representation for Better Speech Synthesis"☆112Dec 20, 2025Updated 4 months ago
- faster inference☆28Jan 20, 2025Updated last year
- Share your clipboard across the devices☆19Sep 10, 2017Updated 8 years ago
- T5Voice is a lightweight PyTorch implementation of T5-based text-to-speech synthesis, supporting both streaming and non-streaming speech …☆28Nov 7, 2025Updated 5 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A smart goto Telescope extension☆11Feb 26, 2024Updated 2 years ago
- ☆36Sep 6, 2025Updated 8 months ago
- A package used to test webrtc apm functions, such as aec, ns☆17Feb 21, 2019Updated 7 years ago
- ☆17Apr 5, 2026Updated last month
- A codebase for data crawling and preprocessing for TTS and ASR systems training.☆23Feb 26, 2026Updated 2 months ago
- Command-line interface and Python library to transcribe pinyin to IPA. The tones are attached to the vowel of the syllable.☆53Apr 16, 2025Updated last year
- Video Jitter Buffer derived from WebRTC☆16Nov 29, 2018Updated 7 years ago
- Aligntune : A Modular Toolkit for Post Training Alignment of LLMs☆36Apr 29, 2026Updated last week
- Clean and modernized implementation of FastSpeech2/LightSpeech using IPA☆18Aug 16, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Text-to-text alignment algorithm for speech recognition error analysis.☆29Apr 6, 2026Updated 3 weeks ago
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆11Sep 30, 2024Updated last year
- LibriSpeech-Long is a benchmark dataset for long-form speech generation and processing. Released as part of "Long-Form Speech Generation …☆94Dec 28, 2024Updated last year
- semantic tokenizer for speech and music☆21Jul 6, 2025Updated 10 months ago
- The open source code of ALMTokenizer2: Towards Low bit-rate and Semantic-rich Audio Tokenizer with Flow-based Scalar Diffusion Transforme…☆45Sep 5, 2025Updated 8 months ago
- Official Repository of Paper: "SynParaSpeech: Automated Synthesis of Paralinguistic Datasets for Speech Generation and Understanding" (IC…☆69Apr 27, 2026Updated last week
- Anatomy of a linux kernel development☆27Mar 30, 2017Updated 9 years ago
- TASU: A New Style of Alignment of Speech LLM with only Text Training Data, zero-shot on ASR and Other SU tasks☆24Jan 19, 2026Updated 3 months ago
- Compute WER and SER for speech recognition evaluation☆27Mar 18, 2026Updated last month
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Another implementation of the paper "Compound Word Transformer: Learning to Compose Full-Song Music over Dynamic Directed Hypergraphs" in…☆13Jun 30, 2021Updated 4 years ago
- Descript Audio Codec - VAE Variant (.dac-vae): High-Fidelity Audio Compression with Variational Autoencoder☆36Aug 30, 2025Updated 8 months ago
- Tidy Tunes is an easy-to-use pipeline for mining high-quality audio data for speech generation models. To do so, it chains multiple open …☆23Apr 13, 2026Updated 3 weeks ago
- Attach and save files in NeoMutt using Ranger or Vifm as your file picker☆18Sep 3, 2025Updated 8 months ago
- Open Source code for our paper, Steering Autoregressive Music Generation with Recursive Feature Machines (Zhao et al., 2025). aka MusicRF…☆38Oct 26, 2025Updated 6 months ago
- A ECMAScript proposal to introduce a built-in parser for ES☆15Sep 14, 2019Updated 6 years ago
- This is the official implementation for the paper "Pianist Transformer: Towards Expressive Piano Performance Rendering via Scalable Self-…☆36Mar 30, 2026Updated last month