Mandarin Chinese audio datasets aligned with Montreal Forced Aligner
☆19Aug 13, 2024Updated last year
Alternatives and similar repositories for forced-alignment-chinese
Users that are interested in forced-alignment-chinese are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated last year
- ☆25Jun 19, 2025Updated 9 months ago
- poorman's ar-dit tts☆45Dec 31, 2025Updated 3 months ago
- ☆16Mar 12, 2024Updated 2 years ago
- pku nlp toolkit☆10Jun 5, 2018Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- An AR+AR TTS attempt.☆18Jan 13, 2025Updated last year
- ☆48Apr 5, 2026Updated last week
- disruptor c++ implementation for IPC (arbitrary length of data)☆17Nov 1, 2025Updated 5 months ago
- A fuzzy file picker in a tmux popup for selecting files with terminal-based AI coding assistants☆36Apr 4, 2026Updated last week
- Source code for “Neural RST-based Evaluation of Discourse Coherence” (AACL-IJCNLP 2020)☆28Aug 29, 2021Updated 4 years ago
- Official code for "Semantic-VAE: Semantic-Alignment Latent Representation for Better Speech Synthesis"☆110Dec 20, 2025Updated 3 months ago
- faster inference☆28Jan 20, 2025Updated last year
- Share your clipboard across the devices☆19Sep 10, 2017Updated 8 years ago
- T5Voice is a lightweight PyTorch implementation of T5-based text-to-speech synthesis, supporting both streaming and non-streaming speech …☆28Nov 7, 2025Updated 5 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆36Sep 6, 2025Updated 7 months ago
- A smart goto Telescope extension☆11Feb 26, 2024Updated 2 years ago
- A package used to test webrtc apm functions, such as aec, ns☆17Feb 21, 2019Updated 7 years ago
- ☆17Apr 5, 2026Updated last week
- A codebase for data crawling and preprocessing for TTS and ASR systems training.☆22Feb 26, 2026Updated last month
- Command-line interface and Python library to transcribe pinyin to IPA. The tones are attached to the vowel of the syllable.☆53Apr 16, 2025Updated 11 months ago
- Video Jitter Buffer derived from WebRTC☆16Nov 29, 2018Updated 7 years ago
- Aligntune : A Modular Toolkit for Post Training Alignment of LLMs☆36Mar 23, 2026Updated 3 weeks ago
- Clean and modernized implementation of FastSpeech2/LightSpeech using IPA☆18Aug 16, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Text-to-text alignment algorithm for speech recognition error analysis.☆28Apr 6, 2026Updated last week
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆11Sep 30, 2024Updated last year
- LibriSpeech-Long is a benchmark dataset for long-form speech generation and processing. Released as part of "Long-Form Speech Generation …☆93Dec 28, 2024Updated last year
- The open source code of ALMTokenizer2: Towards Low bit-rate and Semantic-rich Audio Tokenizer with Flow-based Scalar Diffusion Transforme…☆45Sep 5, 2025Updated 7 months ago
- semantic tokenizer for speech and music☆21Jul 6, 2025Updated 9 months ago
- Official Repository of Paper: "SynParaSpeech: Automated Synthesis of Paralinguistic Datasets for Speech Generation and Understanding" (IC…☆68Jan 27, 2026Updated 2 months ago
- Anatomy of a linux kernel development☆27Mar 30, 2017Updated 9 years ago
- TASU: A New Style of Alignment of Speech LLM with only Text Training Data, zero-shot on ASR and Other SU tasks☆24Jan 19, 2026Updated 2 months ago
- Compute WER and SER for speech recognition evaluation☆27Mar 18, 2026Updated 3 weeks ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Descript Audio Codec - VAE Variant (.dac-vae): High-Fidelity Audio Compression with Variational Autoencoder☆36Aug 30, 2025Updated 7 months ago
- Another implementation of the paper "Compound Word Transformer: Learning to Compose Full-Song Music over Dynamic Directed Hypergraphs" in…☆13Jun 30, 2021Updated 4 years ago
- Tidy Tunes is an easy-to-use pipeline for mining high-quality audio data for speech generation models. To do so, it chains multiple open …☆23Mar 17, 2026Updated 3 weeks ago
- This is the official implementation for the paper "Pianist Transformer: Towards Expressive Piano Performance Rendering via Scalable Self-…☆33Mar 30, 2026Updated 2 weeks ago
- Attach and save files in NeoMutt using Ranger or Vifm as your file picker☆18Sep 3, 2025Updated 7 months ago
- Open Source code for our paper, Steering Autoregressive Music Generation with Recursive Feature Machines (Zhao et al., 2025). aka MusicRF…☆38Oct 26, 2025Updated 5 months ago
- A ECMAScript proposal to introduce a built-in parser for ES☆15Sep 14, 2019Updated 6 years ago