camenduru / seamless-expressive-hf

☆14

Alternatives and similar repositories for seamless-expressive-hf

Users that are interested in seamless-expressive-hf are comparing it to the libraries listed below

Sorting:

camenduru / Open-Sora-jupyter
☆12Updated last year
camenduru / TokenFlow-colab
☆22Updated last year
yannqi / Draw-an-Audio-Code
Official code of the paper: Draw an Audio: Leveraging Multi-Instruction for Video-to-Audio Synthesis.
☆46Updated 8 months ago
xinliu9451 / awesome-denoiser
This is a repository that collects common audio noise reduction models, using Gradio to demonstrate the use of each model, which is very …
☆37Updated 5 months ago
Apple-jun / FilmComposer
Music production for silent film clips.
☆22Updated 2 weeks ago
WGS-note / F5_TTS_Faster
F5-TTS 推理加速，速度提升约4倍！
☆85Updated 4 months ago
MiniMax-AI / audio-tools
A collection of optimized utilities for text-to-audio processing, enhancing both training and inference workflows. This repository contai…
☆13Updated last month
AI-S2-Lab / EmoPP
[NCMMSC'2024] Emotion-Aware Prosodic Phrasing for Expressive Text-to-Speech
☆22Updated 8 months ago
Tencent / Tencent-Hunyuan-7B
☆18Updated 3 months ago
HUIZ-A / SVA
☆20Updated last year
0nutation / SpeechAgents
SpeechAgents: Human-Communication Simulation with Multi-Modal Multi-Agent Systems
☆81Updated last year
jingzhunxue / flow_mirror
flow mirror models from JZX AI Labs
☆45Updated 7 months ago
camenduru / styletts-colab
☆39Updated last year
knoriy / CLARA
☆62Updated 9 months ago
ariesssxu / vta-ldm
☆57Updated 10 months ago
ScottishFold007 / TTSAudioNormalizer
TTSAudioNormalizer is a specialized tool for TTS data production, featuring descriptive statistical analysis of audio loudness and loud…
☆98Updated 4 months ago
yynil / RWKVTTS
This project is to train an RWKV LLM for TTS generation which compatible to other TTS engine(like fish/cosy/chattts).
☆74Updated last week
kyegomez / SoundStream
Implementation of SoundtStream from the paper: "SoundStream: An End-to-End Neural Audio Codec"
☆11Updated 3 months ago
ZaVang / GPT-SoVits
重构GPT-SOVITS的项目，重写了部分代码，优化了webui的使用以及增加了api调用
☆27Updated 5 months ago
EndlessReform / smoltts
Open TTS models, built for streaming on the edge
☆41Updated 2 months ago
parrot-tts / Parrot-TTS
Official Code for ParrotTTS
☆50Updated 7 months ago
VoiceBank-NTPU-TW / VoiceBank-2023
VoiceBank-2023 is the speech corpus specially designed for constructing personalized Mandarin text-to-speech (TTS) systems.
☆39Updated last year
Audio-AGI / FlowSep
Official implementation for FlowSep
☆47Updated 4 months ago
i4Ds / whisper-finetune
This repository contains code for fine-tuning the Whisper speech-to-text model.
☆9Updated 2 months ago
camenduru / MiniGPT-v2-colab
☆29Updated last year
jakariaemon / WSI
Whisper Speaker Identification (WSI), a cutting-edge model for multilingual speaker identification.
☆18Updated 2 months ago
camenduru / dreamtalk
Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models
☆15Updated last year
zhenye234 / LLaSA_inference
☆40Updated 3 months ago
hay86 / ComfyUI_Dreamtalk
Unofficial implementation of DreamTalk in ComfyUI
☆12Updated 9 months ago
tonychenxyz / emoknob
This repository contains the code and data for the paper EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion Control by Haozhe Chen,…
☆71Updated 7 months ago