jingzhunxue / flow_mirrorLinks
flow mirror models from JZX AI Labs
☆45Updated 8 months ago
Alternatives and similar repositories for flow_mirror
Users that are interested in flow_mirror are comparing it to the libraries listed below
Sorting:
- TTSAudioNormalizer is a specialized tool for TTS data production, featuring descriptive statistical analysis of audio loudness and loud…☆99Updated 5 months ago
- Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.☆20Updated last month
- FastThresholdClustering is an efficient vector clustering algorithm based on FAISS, particularly suitable for large-scale vector data clu…☆24Updated 5 months ago
- An easy-to-use, fast, and easily integrable tool for evaluating audio LLM☆102Updated last week
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆93Updated 8 months ago
- ☆19Updated 7 months ago
- ☆24Updated 5 months ago
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆100Updated 2 years ago
- 基于FunASR实现语音识别,包含常规版和ONNX版(推荐)。☆39Updated 7 months ago
- We Speech Transcript based on LLM, in 300 lines of code.☆162Updated last month
- Official release of StyleTalk dataset.☆64Updated 11 months ago
- Official Code for ParrotTTS☆51Updated 7 months ago
- ☆108Updated last month
- Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)☆71Updated 7 months ago
- ☆65Updated last year
- A enterprise-grade Chinese-English code switch punctuator from funasr.☆24Updated last year
- Inference code for Audiodec-Valle-Wenetspeech4TTS☆49Updated 10 months ago
- This project is to train an RWKV LLM for TTS generation which compatible to other TTS engine(like fish/cosy/chattts).☆75Updated this week
- 单独维护的中文TTS☆35Updated 2 years ago
- Huawei Grad-TTS for Chinese☆50Updated last year
- Baichuan-Audio: A Unified Framework for End-to-End Speech Interaction☆195Updated 3 months ago
- BLSP-Emo: Towards Empathetic Large Speech-Language Models☆45Updated 11 months ago
- ☆198Updated 8 months ago
- ☆40Updated 3 months ago
- ☆56Updated 11 months ago
- Grapheme-to-Phoneme for Mixed Chinese (Mandarin or Cantonese) and English.☆98Updated 2 months ago
- A large-scale speech corpus introduced in Spark-TTS, built from diverse open-source datasets for training text-to-speech (TTS) systems.☆73Updated 3 weeks ago
- A Comprehensive Mandarin Speech Dataset for Young Children Aged 3-5☆31Updated 2 months ago
- [ACL 2024] Generative Pre-Trained Speech Language Model with Efficient Hierarchical Transformer☆58Updated 7 months ago
- 基于语言学本体构建,全面覆盖汉语多音字、音变等现象的高效中文TTS数据集。A linguistically grounded and comprehensive Chinese TTS dataset, efficiently covering Chinese polyph…☆31Updated 9 months ago