nilboy / vc-lm
Transform any person's voice into thousands of different voices (voice converter)
☆6Updated last year
Alternatives and similar repositories for vc-lm:
Users that are interested in vc-lm are comparing it to the libraries listed below
- VITS2 for Chinese speech | 最新VITS2中文语音合成☆131Updated last year
- Mac和Windows一键安装Stable Diffusion WebUI,LamaCleaner,SadTalker,ChatGLM2-6B,等AI工具,使用国内镜像,无需魔法。☆240Updated last year
- 基于深度学习的语音增强工具(Speech Enhancement Tools Based on Deep Learning)☆120Updated last year
- 将音频或视频中的中文语音识别并导出为srt字幕,基于魔塔社区Paraformer模型☆102Updated 7 months ago
- The img demo repo for multidiffusion, to avoid large images automatic downloading by webui☆157Updated last year
- AutoCut Client☆338Updated 4 months ago
- 10000 chatTTS voices !chatTTS 音色库,再也不为音色抽卡烦恼啦。这是我第一个项目,熬夜龟速生产10000条音色并上传Github,给点鼓励呗哈!主域名:https://www.TTSlist.com 备用:http://ttslist.aiqb…☆155Updated 7 months ago
- xiaoyuzhou fm audio downloder.☆30Updated last week
- 跨平台基于云平台(阿里云、讯飞等)语音合成 API 的文字转语音助手。支持单文本快速合成和批量合成。支持windows、macOS、Linux。☆290Updated this week
- 一站式短视频拼接软件 无依赖,点击即用,自动去黑边,自动帧同步,自动调整分辨率,批量变更视频为横屏/竖屏☆396Updated 4 months ago
- ⚡ 一款用于自动语音识别 (ASR)、翻译的高性能异步 API。不需要购买Whisper API,使用本地运行的Whisper模型进行推理,并支持多GPU并发,针对分布式部署进行设计。还内置了包括TikTok、抖音等社交媒体平台的爬虫,可实现来自多个社交平台的无缝媒体处理,…☆320Updated last month
- 基于OpenVoice和Melotts整合的中文版webui,添加resemble_enhance音频增强功能☆89Updated 10 months ago
- Bert-vits2转写和标注独立整合Webui,整合阿里FunAsr,必剪Asr以及Whisper大模型☆170Updated 7 months ago
- 基于whisper的实时语音识别 网页和桌面客户端☆160Updated 5 months ago
- 青梧字幕是一款基于whisper的AI字幕提取工具☆454Updated 6 months ago
- 通过LLM进行进行字幕断句分割,处理和优化字幕文件,将自动语音识别(ASR)数据的分段合并与拆分,☆93Updated 2 months ago
- video to video translation with voice clone and lip synchronization|带有语音克隆和口型同步的视频翻译,支持中英互换☆123Updated 10 months ago
- ☆118Updated 9 months ago
- 一个简约的音乐下载工具☆188Updated last year
- SummerAsr 是一个基于C++的可独立编译且几乎没有额外依赖库的本地中文语音识别器。 Summer Asr is a Chinese automatic speech recognize project written with C++ that can be eas…☆87Updated 2 months ago
- chinese NLP corpus of chinese science fiction, chinese science fiction corpus: Archive of the Ark Plan of Ula Science Fiction Website 乌拉科…☆104Updated 2 years ago
- ☆154Updated 3 months ago
- 一个简单的音频降噪工具,提高web UI界面和api接口☆19Updated 3 months ago
- ☆32Updated 7 months ago
- FaceChain is a deep-learning toolchain for generating your Digital-Twin.☆47Updated last year
- 拯救你的英语发音,告别因发音错误带来的尴尬!☆235Updated 2 years ago
- [IP&M 2022] Telegram地下市场中文黑话识别语料集。Telegram Underground Market Chinese Corpus. Paper: Identification of Chinese Dark Jargons in Telegram U…☆183Updated last year
- Fuse ChatTTS with OpenVoice, upload a 10-second audio clip, and clone your personalized ChatTTS voice.☆414Updated 3 months ago
- Your Companion for Multilingual Reading☆129Updated last year
- A toolkit for speaker diarization.☆171Updated 3 months ago