HWTeng-Teaching / 202409-StatLinks
☆10Updated 9 months ago
Alternatives and similar repositories for 202409-Stat
Users that are interested in 202409-Stat are comparing it to the libraries listed below
Sorting:
- AWS AI Stack – A ready-to-use, full-stack boilerplate project for building serverless AI applications on AWS☆991Updated 10 months ago
- SOTA Open Source TTS☆23,011Updated this week
- Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.☆16,591Updated last week
- [CVPR 2025] EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation☆4,266Updated last month
- MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting☆4,767Updated 2 months ago
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆13,271Updated 2 weeks ago
- ☆30Updated last year
- Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation☆8,580Updated last year
- These are a simple calculators that I created with Python.☆11Updated last year
- [AAAI 2025] EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning☆4,061Updated last month
- EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine☆8,334Updated last year
- Taming Stable Diffusion for Lip Sync!☆4,953Updated 3 months ago
- Inference and training library for high-quality TTS models.☆5,422Updated 9 months ago
- 🌊 Swift Executor is a cutting-edge script executor for Windows, built with Roblox players in mind. Featuring smart AI assistance for eas…☆39Updated 5 months ago
- 🌊 Thunder Executor is a cutting-edge script executor for Windows, built with Roblox players in mind. Featuring smart AI assistance for e…☆38Updated 5 months ago
- 🌊 Nihon Executor is a cutting-edge script executor for Windows, built with Roblox players in mind. Featuring smart AI assistance for eas…☆37Updated 5 months ago
- Real time interactive streaming digital human☆6,543Updated last week
- MARS5 speech model (TTS) from CAMB.AI☆2,797Updated last year
- High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.☆6,805Updated 9 months ago
- Text-to-Music Generation with Rectified Flow Transformers☆1,707Updated 9 months ago
- Instant voice cloning by MIT and MyShell. Audio foundation model.☆34,497Updated 5 months ago
- V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.☆2,354Updated 8 months ago
- Multilingual Voice Understanding Model☆6,676Updated last month
- Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime…☆7,532Updated this week
- Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junio…☆9,409Updated 4 months ago
- LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve spee…☆3,070Updated 4 months ago
- Foundational model for human-like, expressive TTS☆4,161Updated last year
- SRV404 Building Your Own ML Application with AWS Lambda and Amazon SageMaker☆16Updated 6 years ago
- This repo provides Generative AI and AI/ML code samples, blueprints (end-to-end solutions) and proof of concepts oriented to the LATAM ma…☆55Updated 2 weeks ago
- Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audi…☆8,945Updated this week