personal blog
☆18Jun 8, 2022Updated 3 years ago
Alternatives and similar repositories for blog
Users that are interested in blog are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Python version of PEAQ(Perceptual Evaluation of Audio Quality)☆14Jul 24, 2025Updated 8 months ago
- Official Repository of Paper: "SynParaSpeech: Automated Synthesis of Paralinguistic Datasets for Speech Generation and Understanding" (IC…☆67Jan 27, 2026Updated 2 months ago
- 🔥 语音合成(TTS),语音克隆教程: https://dataxujing.github.io/TTS-paper/#/☆11Oct 29, 2024Updated last year
- [WIP]Trying to implement "Ultra Low Complexity Deep Learning Based Noise Suppression." arXiv preprint arXiv:2312.08132 (2023).☆26May 29, 2024Updated last year
- Project of Singing Voice Conversion.☆16Oct 27, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- 这是一个基于杰杰大佬mqttclient进行封装的精简调用接口版本,进一步降低了使用者的门槛,杰杰大佬的Github: https://github.com/jiejieTop/mqttclient)☆16Aug 8, 2022Updated 3 years ago
- Clone of the mp3gain sources from svn on sourceforge (http://mp3gain.sourceforge.net/)☆11Jan 3, 2013Updated 13 years ago
- A Pytorch (support batch and channel) implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech…☆12Jul 24, 2024Updated last year
- Triton kernel fusion for Qwen3-TTS 1.7B inference acceleration — RMSNorm, SwiGLU, M-RoPE, Norm+Residual☆55Mar 22, 2026Updated last week
- tts fronted-end☆11Dec 19, 2018Updated 7 years ago
- Baseline system for SVDD 2024 Challenge CtrSVDD track☆28Nov 16, 2024Updated last year
- TTS FrontEnd DataSet: Polyphone / Prosody / TextNormalization☆103Feb 5, 2024Updated 2 years ago
- Repo for hosting tutorial code associated with the Kaldi Speech Recognition for Beginners - A Simple Tutorial blog by AssemblyAI☆13May 20, 2023Updated 2 years ago
- ☆11Jun 14, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- frameworks_base for Geeksphone Peak and Keon☆12Jan 13, 2015Updated 11 years ago
- ☆13Oct 27, 2021Updated 4 years ago
- Unofficial implementation of wavenext vocoder☆60Aug 28, 2024Updated last year
- ☆13Jan 2, 2025Updated last year
- ☆12May 5, 2017Updated 8 years ago
- Finding the most similar tone/color in a large collection of audio. 在一大堆音频中寻找最相似的音色。☆13Jun 17, 2024Updated last year
- Dataset, code and results repository for SBA-Net.☆14Sep 23, 2022Updated 3 years ago
- 基于标贝数据继续训练,同时对原本的FastSpeech2模型做了改进,引入了韵律表征以及韵律预测模块,使中文发音更生动且富有节奏☆278Sep 10, 2023Updated 2 years ago
- a motion detector for video; written with OpenCV☆12Nov 3, 2022Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 基于语言学本体构建,全面覆盖汉语多音字、音变等现象的高效中文TTS数据集。A linguistically grounded and comprehensive Chinese TTS dataset, efficiently covering Chinese polyph…☆54Aug 13, 2024Updated last year
- Kalman filtering for speech signal enhancement☆20May 25, 2016Updated 9 years ago
- Implementation of the contextual biasing for ASR decoding on GPUs without lattice generation. The code supports submission to Interspeech…☆21Sep 25, 2023Updated 2 years ago
- gypified libfaad C library☆15Apr 12, 2013Updated 12 years ago
- 语音唤醒☆13Dec 12, 2018Updated 7 years ago
- speech enhancement algorithms for microphone arrays☆15May 12, 2020Updated 5 years ago
- Weird autoencoder experiments☆24Jan 26, 2026Updated 2 months ago
- poorman's ar-dit tts☆45Dec 31, 2025Updated 2 months ago
- C++版本的汉字转拼音 Transfer chinese character to pinyin☆15Aug 31, 2018Updated 7 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- speaker-disentangled speech linguistic content quantizer☆24Mar 19, 2025Updated last year
- C++ 11 algorithm implementation for voice conversion using harmonic plus stochastic models☆55May 12, 2021Updated 4 years ago
- Chorale Music Separation Dataset and Model Framework☆40Dec 5, 2022Updated 3 years ago
- ☆11Apr 3, 2024Updated last year
- LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search☆95Sep 1, 2021Updated 4 years ago
- REPeating Pattern Extraction Technique (REPET) in Python for audio source separation: original REPET, REPET extended, adaptive REPET, REP…☆33Feb 16, 2024Updated 2 years ago
- An android app makes real time object detection via TensorFlow for Mobile and warns the user verbally about them and their locations.☆13Dec 18, 2017Updated 8 years ago