personal blog
☆18Jun 8, 2022Updated 3 years ago
Alternatives and similar repositories for blog
Users that are interested in blog are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Python version of PEAQ(Perceptual Evaluation of Audio Quality)☆14Jul 24, 2025Updated 9 months ago
- Extract your SlidesLive presentation.☆15Apr 19, 2024Updated 2 years ago
- Official Repository of Paper: "SynParaSpeech: Automated Synthesis of Paralinguistic Datasets for Speech Generation and Understanding" (IC…☆69Apr 27, 2026Updated last week
- 🔥 语音合成(TTS),语音克隆教程: https://dataxujing.github.io/TTS-paper/#/☆11Oct 29, 2024Updated last year
- WCH 32-bit chip series cmake collection repo☆15Mar 3, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- The simplest way to demix stereo content with decent quality and low latency.☆19Apr 11, 2019Updated 7 years ago
- [WIP]Trying to implement "Ultra Low Complexity Deep Learning Based Noise Suppression." arXiv preprint arXiv:2312.08132 (2023).☆28May 29, 2024Updated last year
- Project of Singing Voice Conversion.☆16Oct 27, 2023Updated 2 years ago
- 这是一个基于杰杰大佬mqttclient进行封装的精简调用接口版本,进一步降低了使用者的门槛,杰杰大佬的Github: https://github.com/jiejieTop/mqttclient)☆17Aug 8, 2022Updated 3 years ago
- A Pytorch (support batch and channel) implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech…☆12Jul 24, 2024Updated last year
- Clone of the mp3gain sources from svn on sourceforge (http://mp3gain.sourceforge.net/)☆11Jan 3, 2013Updated 13 years ago
- tts fronted-end☆11Dec 19, 2018Updated 7 years ago
- NUEDC 2021 G by OpenMV4☆13Nov 19, 2021Updated 4 years ago
- Baseline system for SVDD 2024 Challenge CtrSVDD track☆29Nov 16, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- TTS FrontEnd DataSet: Polyphone / Prosody / TextNormalization☆103Feb 5, 2024Updated 2 years ago
- Repo for hosting tutorial code associated with the Kaldi Speech Recognition for Beginners - A Simple Tutorial blog by AssemblyAI☆13May 20, 2023Updated 2 years ago
- 一个将豆包 ASR 能力封装为 OpenAI 兼容接口的小项目,支持 Docker 启动,并提供一份可配合 Spokenly 使用的参考修正提示词,实现和 Typeless 类似的语音修正效果。☆35Feb 28, 2026Updated 2 months ago
- ☆11Jun 14, 2024Updated last year
- ☆13Oct 27, 2021Updated 4 years ago
- Unofficial implementation of wavenext vocoder☆60Aug 28, 2024Updated last year
- frameworks_base for Geeksphone Peak and Keon☆12Jan 13, 2015Updated 11 years ago
- ☆14Jan 2, 2025Updated last year
- Finding the most similar tone/color in a large collection of audio. 在一大堆音频中寻找最相似的音色。☆13Jun 17, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Dataset, code and results repository for SBA-Net.☆14Sep 23, 2022Updated 3 years ago
- 基于标贝数据继续训练,同时对原本的FastSpeech2模型做了改进,引入了韵律表征以及韵律预测模块,使中文发音更生动且富有节奏☆281Sep 10, 2023Updated 2 years ago
- a motion detector for video; written with OpenCV☆12Nov 3, 2022Updated 3 years ago
- 基于语言学本体构建,全面覆盖汉语多音字、音变等现象的高效中文TTS数据集。A linguistically grounded and comprehensive Chinese TTS dataset, efficiently covering Chinese polyph…☆56Aug 13, 2024Updated last year
- ☆46Feb 10, 2021Updated 5 years ago
- QT快速入门笔记☆21Jul 31, 2018Updated 7 years ago
- Kalman filtering for speech signal enhancement☆20May 25, 2016Updated 9 years ago
- a automatic script to change a Keil project to makefile project☆22May 11, 2022Updated 3 years ago
- Implementation of the contextual biasing for ASR decoding on GPUs without lattice generation. The code supports submission to Interspeech…☆21Sep 25, 2023Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- gypified libfaad C library☆16Apr 12, 2013Updated 13 years ago
- ☆13May 5, 2017Updated 9 years ago
- 语音唤醒☆13Dec 12, 2018Updated 7 years ago
- speech enhancement algorithms for microphone arrays☆15May 12, 2020Updated 5 years ago
- Weird autoencoder experiments☆24Apr 24, 2026Updated 2 weeks ago
- poorman's ar-dit tts☆45Dec 31, 2025Updated 4 months ago
- Triton kernel fusion for Qwen3-TTS 1.7B inference acceleration — RMSNorm, SwiGLU, M-RoPE, Norm+Residual☆76Apr 17, 2026Updated 3 weeks ago