Jack-Cherish / dsiView external linksLinks
Do Something Interesting缩写,做一些有趣的事
☆259Jan 10, 2025Updated last year
Alternatives and similar repositories for dsi
Users that are interested in dsi are comparing it to the libraries listed below
Sorting:
- This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion☆5,018Jan 21, 2025Updated last year
- Documentation for Bert-VITS2☆22Nov 29, 2023Updated 2 years ago
- ☆311Dec 10, 2023Updated 2 years ago
- Easy to use and open-source unknown stealer☆22Jul 24, 2023Updated 2 years ago
- vits2 backbone with multilingual-bert☆8,687Updated this week
- 一款基于Tauri + React + TypeScript开发的桌面应用,帮助创作者快速生成短视频剧本和剪辑方 案。☆22Jun 7, 2025Updated 8 months ago
- InstantID for StableDiffusion 1.5.☆11Jul 6, 2024Updated last year
- ☆607Jan 8, 2024Updated 2 years ago
- 基于go的轻量化key-value服务☆13Jun 9, 2020Updated 5 years ago
- ☆33Nov 27, 2024Updated last year
- Bark Voice Cloning and Voice Cloning for Chinese Speech☆2,965Dec 19, 2025Updated last month
- OpenCVのGrabCut()を利用したセマンティックセグメンテーション向けアノテーションツール(Annotation tool using GrabCut() of OpenCV. It can be used to create datasets for semant…☆38Dec 13, 2021Updated 4 years ago
- 本项目基于SadTalkers实现视频唇形合成的Wav2lip。通过以视频文件方式进行语音驱动生成唇形,设置面部区域可配置的增强方式进行合成唇形(人脸)区域画面增强,提高生成唇形的清晰度。使用DAIN 插帧的DL算法对生成视频进行补帧,补充帧间合成唇形的动作过渡,使合成的唇…☆2,003Jun 4, 2023Updated 2 years ago
- Chat with any character you like: ChatGLM2+SadTalker+Voice Cloning | 和喜欢的角色沉浸式对话吧:ChatGLM2+声音克隆+视频对话☆613Aug 11, 2023Updated 2 years ago
- Speech AI training and inference tools☆36Jun 25, 2023Updated 2 years ago
- Code of AAAI2025 Paper 《VIoTGPT: Learning to Schedule Vision Tools in LLMs towards Intelligent Video Internet of Things》☆15Jan 16, 2025Updated last year
- ☆16Apr 11, 2024Updated last year
- A fork of images browse for stable-diffusion-webui☆15Jun 15, 2023Updated 2 years ago
- 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)☆54,918Updated this week
- A Universal Framework for AI Video Watermark Removal☆49Dec 5, 2025Updated 2 months ago
- AI Vtuber是一个由 【ChatterBot/ChatGPT/claude/langchain/chatglm/text-gen-webui/闻达/千问/kimi/ollama】 驱动的虚拟主播【Live2D/UE/xuniren】,可以在 【Bilibili/抖音/…☆4,266Jul 29, 2025Updated 6 months ago
- Use stable diffusion to outpaint around an image and uncrop it☆21Feb 8, 2023Updated 3 years ago
- text-to-video☆74Jul 9, 2023Updated 2 years ago
- Best practice TTS based on BERT and VITS with some Natural Speech Features Of Microsoft; Support ONNX streaming out!☆1,227Feb 5, 2024Updated 2 years ago
- flowmix多模态编辑器开发使用文档.☆21Nov 21, 2024Updated last year
- Voice conversion with just linear regression.☆33Sep 25, 2025Updated 4 months ago
- 实时STT,连接OpenAI接口/智谱AI(流式LLM)和GPT-SOVITS/Edge-TTS,通过网页的方式,进行跨网络的服务调用,实现实时对话的效果☆431Dec 31, 2024Updated last year
- SoftVC VITS Singing Voice Conversion☆27,992Nov 11, 2023Updated 2 years ago
- [SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild☆7,212Aug 5, 2024Updated last year
- A Stable Diffusion webUI extension for manage trigger word for LoRA or other model☆258Jun 5, 2024Updated last year
- ComfyUI的模型下载节点,支持civitai和huggingface下的模型下载。☆26Jan 3, 2025Updated last year
- 链家房屋数据爬虫以及数据分析☆20Sep 28, 2019Updated 6 years ago
- ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型☆15,663Jun 27, 2024Updated last year
- DeepParseX 是一个强大的多模态文档解析与知识管理平台,支持 PDF、Word、Excel、PPT、图片、视频、音频 等多种文件格式的智能解析,自动提取关键信息,并构建 检索增强生成(RAG) 和 知识图谱(Knowledge Graph) 系统,实现结构化数据的智…☆56Updated this week
- How to use tensorboard in fastai☆21Jul 10, 2019Updated 6 years ago
- 币安量化交易☆23Apr 2, 2022Updated 3 years ago
- VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech☆7,821Dec 6, 2023Updated 2 years ago
- [CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation☆13,587Jun 26, 2024Updated last year
- 基于 LoRA 和 P-Tuning v2 的 ChatGLM-6B 高效参数微调☆55May 17, 2023Updated 2 years ago