Text-to-Speech Recipe Users can create speech signals from an input text by using text-to-speech (TTS), also referred to as speech synthesis. Popular TTS and Vocoder models, such as Tacotron 2, are supported by SpeechBrain (e.g, HiFIGAN).
☆19Dec 16, 2024Updated last year
Alternatives and similar repositories for Speech
Users that are interested in Speech are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Extends Model Context Protocol (MCP) to local LLMs via Ollama, enabling Claude-like tool use (files, web, email, GitHub, AI images) while…☆26Jun 17, 2025Updated 10 months ago
- ☆10Jul 1, 2019Updated 6 years ago
- MongoDB with Pymongo Tutorial☆10Apr 19, 2024Updated 2 years ago
- Python scripts for AI voice changers☆14Apr 25, 2023Updated 3 years ago
- Image Captioning Agent using Mistral 7B☆12Dec 1, 2023Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- The repository extends sam-3d-body with the main purpose of mapping the outputs of the estimation into fbx files, allowing posing and rem…☆24Apr 27, 2026Updated last week
- Using PDFPlumber for PDF data extraction☆12May 31, 2017Updated 8 years ago
- ☆25Feb 16, 2026Updated 2 months ago
- A set of Python client examples and utils for https://github.com/carla-simulator/carla☆12Dec 10, 2019Updated 6 years ago
- ESPNet TTS with Streamlit GUI☆14Apr 30, 2023Updated 3 years ago
- WaveGANによる音声生成器☆13Feb 9, 2024Updated 2 years ago
- ☆10Dec 10, 2021Updated 4 years ago
- ☆26Dec 11, 2025Updated 4 months ago
- Code for CVPR19 paper "Monocular Total Capture: Posing Face, Body and Hands in the Wild"☆12May 14, 2021Updated 4 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- DeepND ASD & ID☆12Apr 29, 2022Updated 4 years ago
- chinese_tacotron-2☆12Feb 27, 2018Updated 8 years ago
- ☆18Apr 26, 2024Updated 2 years ago
- VOICEVOX公式サイトを作るコードです☆14Updated this week
- AI based, large scale, HD material acquisition and creation from a couple pictures.☆11Nov 2, 2021Updated 4 years ago
- transfers weights/colors/positions from one mesh to the other (with arbitrary topology)☆12Jun 12, 2019Updated 6 years ago
- This is an official repository for the Article Generation app using Llama2, Pexels, and Streamlit.☆13Aug 5, 2023Updated 2 years ago
- This repository is the official implementation of ICASSP2024 paper: Highlight removal network based on an improved dichromatic reflection…☆14Apr 18, 2024Updated 2 years ago
- This is an introduction to Retrieval-Augmented Generation (RAG) for beginners . It uses Llama 2 LLM, FAISS vector store, and LangChain as…☆17Jul 8, 2025Updated 10 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- system capable of autonomously chasing another vehicle. Tested in CARLA and with RC cars.☆22Sep 10, 2020Updated 5 years ago
- Python profiling tool☆17Jan 28, 2025Updated last year
- ☆14Oct 11, 2024Updated last year
- ☆20Jan 7, 2024Updated 2 years ago
- Author's implementation of learning virtual chimeras by dynamic motion reassembly (SIGGRAPH Asia 2022 Technical Paper)☆15Feb 20, 2023Updated 3 years ago
- 自分の声で音声合成☆17Mar 4, 2019Updated 7 years ago
- Keras version of Realtime Multi-Person Pose Estimation project☆15Nov 29, 2018Updated 7 years ago
- ☆10Aug 14, 2020Updated 5 years ago
- TF Mesh Renderer☆15Dec 25, 2019Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Open Source bro of my conversational agent. https://github.com/mfmezger/conversational-agent-langchain☆14Feb 5, 2024Updated 2 years ago
- ☆17Jun 3, 2024Updated last year
- ☆20Mar 6, 2021Updated 5 years ago
- [ACL-IJCNLP 2021] "EarlyBERT: Efficient BERT Training via Early-bird Lottery Tickets" by Xiaohan Chen, Yu Cheng, Shuohang Wang, Zhe Gan, …☆18Dec 30, 2021Updated 4 years ago
- Implementation of vocoders empowered with pytorch lightning☆18Jan 27, 2024Updated 2 years ago
- An application with all sorts of data about COVID-19 and helpful information for users☆20May 12, 2020Updated 5 years ago
- Code for paper "Hearing Lips in Noise: Universal Viseme-Phoneme Mapping and Transfer for Robust Audio-Visual Speech Recognition"☆28Jun 21, 2023Updated 2 years ago