Mandarin Chinese audio datasets aligned with Montreal Forced Aligner
☆17Aug 13, 2024Updated last year
Alternatives and similar repositories for forced-alignment-chinese
Users that are interested in forced-alignment-chinese are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated 11 months ago
- ☆25Jun 19, 2025Updated 9 months ago
- poorman's ar-dit tts☆45Dec 31, 2025Updated 2 months ago
- ☆15Mar 12, 2024Updated 2 years ago
- pku nlp toolkit☆10Jun 5, 2018Updated 7 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- An AR+AR TTS attempt.☆18Jan 13, 2025Updated last year
- ☆42Nov 4, 2025Updated 4 months ago
- disruptor c++ implementation for IPC (arbitrary length of data)☆17Nov 1, 2025Updated 4 months ago
- A fuzzy file picker in a tmux popup for selecting files with terminal-based AI coding assistants☆32Mar 13, 2026Updated last week
- Source code for “Neural RST-based Evaluation of Discourse Coherence” (AACL-IJCNLP 2020)☆28Aug 29, 2021Updated 4 years ago
- Official code for "Semantic-VAE: Semantic-Alignment Latent Representation for Better Speech Synthesis"☆109Dec 20, 2025Updated 3 months ago
- faster inference☆28Jan 20, 2025Updated last year
- Share your clipboard across the devices☆19Sep 10, 2017Updated 8 years ago
- T5Voice is a lightweight PyTorch implementation of T5-based text-to-speech synthesis, supporting both streaming and non-streaming speech …☆28Nov 7, 2025Updated 4 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆36Sep 6, 2025Updated 6 months ago
- A smart goto Telescope extension☆11Feb 26, 2024Updated 2 years ago
- A package used to test webrtc apm functions, such as aec, ns☆17Feb 21, 2019Updated 7 years ago
- ☆17Jan 4, 2026Updated 2 months ago
- A codebase for data crawling and preprocessing for TTS and ASR systems training.☆22Feb 26, 2026Updated last month
- Command-line interface and Python library to transcribe pinyin to IPA. The tones are attached to the vowel of the syllable.☆53Apr 16, 2025Updated 11 months ago
- Video Jitter Buffer derived from WebRTC☆16Nov 29, 2018Updated 7 years ago
- Aligntune : A Modular Toolkit for Post Training Alignment of LLMs☆37Updated this week
- Clean and modernized implementation of FastSpeech2/LightSpeech using IPA☆18Aug 16, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆10Sep 30, 2024Updated last year
- Official Repository of Paper: "SynParaSpeech: Automated Synthesis of Paralinguistic Datasets for Speech Generation and Understanding" (IC…☆67Jan 27, 2026Updated last month
- LibriSpeech-Long is a benchmark dataset for long-form speech generation and processing. Released as part of "Long-Form Speech Generation …☆93Dec 28, 2024Updated last year
- The open source code of ALMTokenizer2: Towards Low bit-rate and Semantic-rich Audio Tokenizer with Flow-based Scalar Diffusion Transforme…☆45Sep 5, 2025Updated 6 months ago
- This is the official implementation for the paper "Pianist Transformer: Towards Expressive Piano Performance Rendering via Scalable Self-…☆29Feb 8, 2026Updated last month
- semantic tokenizer for speech and music☆21Jul 6, 2025Updated 8 months ago
- MOSS-Audio-Tokenizer is a Causal Transformer-based audio tokenizer built on the CAT architecture. Trained on 3M hours of diverse audio, i…☆166Mar 6, 2026Updated 3 weeks ago
- Anatomy of a linux kernel development☆27Mar 30, 2017Updated 8 years ago
- TASU: A New Style of Alignment of Speech LLM with only Text Training Data, zero-shot on ASR and Other SU tasks☆23Jan 19, 2026Updated 2 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆15May 11, 2025Updated 10 months ago
- Compute WER and SER for speech recognition evaluation☆27Mar 18, 2026Updated last week
- Descript Audio Codec - VAE Variant (.dac-vae): High-Fidelity Audio Compression with Variational Autoencoder☆35Aug 30, 2025Updated 6 months ago
- Another implementation of the paper "Compound Word Transformer: Learning to Compose Full-Song Music over Dynamic Directed Hypergraphs" in…☆13Jun 30, 2021Updated 4 years ago
- Tidy Tunes is an easy-to-use pipeline for mining high-quality audio data for speech generation models. To do so, it chains multiple open …☆23Mar 17, 2026Updated last week
- Attach and save files in NeoMutt using Ranger or Vifm as your file picker☆18Sep 3, 2025Updated 6 months ago
- Open Source code for our paper, Steering Autoregressive Music Generation with Recursive Feature Machines (Zhao et al., 2025). aka MusicRF…☆38Oct 26, 2025Updated 5 months ago