Czj1997-02 / SeamlessM4TAppLinks
基于SeamlessM4T模型的Flask接口和Flutter应用
☆57Updated 2 years ago
Alternatives and similar repositories for SeamlessM4TApp
Users that are interested in SeamlessM4TApp are comparing it to the libraries listed below
Sorting:
- ChatTTS HTTP API☆54Updated last year
- make your Speaker talking as Native style with own voice!☆251Updated last year
- 跨语种语音克隆,中文版Webui☆61Updated last year
- offline 2d digitalhuman demo for edge devices (android/ios/etc.)☆81Updated last year
- video to video translation with voice clone and lip synchronization|带有语音克隆和口型同步的视频翻译,支持中英互换☆142Updated last year
- ☆119Updated last year
- ☆70Updated 2 years ago
- TTS☆80Updated last year
- ☆41Updated last year
- a gradio webui for faster whisper☆276Updated 2 years ago
- 基于OpenVoice和Melotts整合的中文版webui,添加resemble_enhance音频增强功能☆99Updated last year
- 获取bilibili直播弹幕,使用WebSocket协议☆37Updated last year
- Extension of ChatTTS, 3x Faster on Windows, Support Voice Cloning and Mobile Deployment☆174Updated 10 months ago
- ☆232Updated 2 years ago
- Fuse ChatTTS with OpenVoice, upload a 10-second audio clip, and clone your personalized ChatTTS voice.☆459Updated last year
- Streaming ASR and TTS based on FastAPI+ sherpa-onnx☆177Updated last month
- 基于中文文本情绪分析自动切换参考音频的 GPT-SoVITS 推理 Demo☆106Updated last year
- 通过此代码可以免训练模型并通过轻量级服务器定制数字人形象☆106Updated last year
- 洛曦 数字人视频播放器,带HTTP API,使用gradio api对接Easy-Wav2Lip、Sadtalker、GeneFacePlusPlus、MuseTalk,也可以用于播放本地视频☆175Updated last year
- The API server version of the SadTalker project. Runs in Docker, 10 times faster than the original!☆144Updated 2 years ago
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆108Updated 2 months ago
- 一个用于F5-TTS的api和webui项目☆64Updated last year
- 根据声音生成音色文件☆37Updated last year
- 重生之我是 AI 打工人。前世,我的身份默默无闻,来去匆匆,不知道自己将在何地出生。然而,命运给予了我难得的机会,让我重生为一名 AI 打工人。☆50Updated 2 years ago
- MotionAgent is your AI assistent to convert ideas into motion pictures.☆307Updated last year
- ASR (Automatic Speech Recognition) for real-time streamed audio powered by Whisper and tranformers☆36Updated last year
- 阿里SenseVoice的fastpi封装,采用onnx发布,体积更小,附带量化模型,支持GPU。支持从URL文件进行语音识别。☆104Updated last year
- FaceChain is a deep-learning toolchain for generating your Digital-Twin.☆46Updated 2 years ago
- 基于 faster-whisper 的伪实时语音转写服务☆233Updated 8 months ago
- SubFix: Efficient Web-Based Audio Subtitle Editing and Multilingual Automatic Annotation Tool.☆208Updated last year