lewangdev / CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
☆9Updated 7 months ago
Alternatives and similar repositories for CosyVoice:
Users that are interested in CosyVoice are comparing it to the libraries listed below
- Chrome extension to add a link from each Arxiv page to the corresponding HF Paper page☆25Updated last year
- ☆32Updated last year
- ☆33Updated 9 months ago
- g1: Using GPT-4o to create o1-like reasoning chains☆20Updated 5 months ago
- WhisperMesh is an advanced chatbot that integrates voice and text interactions, delivering personalized responses through LLM models and …☆14Updated 3 weeks ago
- coze api to openai☆13Updated 5 months ago
- Bambo is a new proxy framework. Compared with mainstream frameworks, it is more lightweight and flexible and can handle various load task…☆35Updated last week
- Conversational Retrieval Evaluation Dataset☆98Updated last week
- 智能视频处理系统☆44Updated last month
- Datalore is an AI-powered Data Analysis tool that integrates Anthropic's Claude API with various data analysis libraries and custom funct…☆39Updated this week
- Chooat is an open-source project designed to provide a seamless and powerful AI chat experience.☆19Updated last month
- A real-time AI development framework leveraging WebRTC for audio and video transmission.☆102Updated 2 weeks ago
- 用文本编辑器剪视频☆36Updated last year
- ASR using OpenAI capability API `v1/audio/transcriptions` like Groq, SiliconFlow☆27Updated 5 months ago
- This repo is built for showing how to generate PPT use python☆40Updated 6 months ago
- 一个简单的音频降噪工具,提高web UI界面和api接口☆19Updated 3 months ago
- 基于 OpenAI Realtime Console 修改的语音聊天应用。支持定义 api base url。☆32Updated 4 months ago
- TEaR framework for paper "TEaR: Improving LLM-based Machine Translation with Systematic Self-Refinement"☆45Updated 6 months ago
- The inference code of RVC-Boss/GPT-SoVITS that can be developer-friendly.☆12Updated 4 months ago
- 一个基于Together AI的强大图像生成工具,支持文生图、图生图和提示词分析功能。☆20Updated 2 months ago
- A transformer-based multimodal model for music.☆28Updated 6 months ago
- 02. Enabling various applications to be AI-enabled or used by AI.☆28Updated 5 months ago
- A mono-repo to house the various supported Transport options to be used with Pipecat's client-js package☆13Updated last week
- Perplexity style AI Search engine clone built with Gemini 2.0 Flash and Grounding☆12Updated last month