fishaudio / fish-speech
Brand new TTS solution
☆11,190Updated this week
Related projects: ⓘ
- A generative speech model for daily dialogue.☆30,703Updated 2 weeks ago
- High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.☆4,398Updated last month
- A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提 取。☆11,253Updated this week
- Enjoy the magic of Diffusion models!☆6,349Updated this week
- Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.☆4,768Updated last week
- ⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。☆9,486Updated this week
- Open-Sora: Democratizing Efficient Video Production for All☆21,609Updated last month
- EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine☆7,201Updated last month
- Instant voice cloning by MIT and MyShell.☆28,390Updated 3 weeks ago
- 利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.☆16,112Updated last month
- 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)☆32,567Updated this week
- Bring portraits to life!☆11,729Updated last week
- 🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.☆13,879Updated this week
- Open source real-time translation app for Android that runs locally☆6,326Updated last week
- MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone☆11,907Updated this week
- Inference and training library for high-quality TTS models.☆4,193Updated 3 weeks ago
- Zero-Shot Speech Editing and Text-to-Speech in the Wild☆7,459Updated 2 months ago
- FreeAskInternet is a completely free, PRIVATE and LOCALLY running search aggregator & answer generate using MULTI LLMs, without GPU neede…☆8,451Updated 5 months ago
- Next generation face swapper and enhancer☆17,808Updated this week
- RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.☆17,176Updated this week
- Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,并添加配音☆9,955Updated this week
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/☆7,553Updated 7 months ago
- Your image is almost there!☆7,207Updated last month
- Convert PDF to markdown quickly with high accuracy☆16,438Updated last week
- A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频☆7,202Updated 3 weeks ago
- Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key☆5,259Updated 2 months ago
- An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.☆10,156Updated last week
- [SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild☆6,376Updated last month
- Question and Answer based on Anything.☆11,376Updated this week
- 一个简单的本地网页界面,使用ChatTTS将文字合成为语音,同时支持对外提供API接口。A simple native web interface that uses ChatTTS to synthesize text into speech, along with su…☆5,868Updated 2 weeks ago