ImPavloh / WhiTTsper-The-LoraLinks
Demo combining Whisper for speech recognition and Google TTS for speech synthesis to interact with Alpaca-LoRA.
☆20Updated last year
Alternatives and similar repositories for WhiTTsper-The-Lora
Users that are interested in WhiTTsper-The-Lora are comparing it to the libraries listed below
Sorting:
- A composition of offline tools to achieve high quality multilingual speech to text transcription☆23Updated last month
- An open source NLP as a service project focused on providing state of the art systems with ease. Training and inference by simple docker …☆20Updated last year
- ☆19Updated 10 months ago
- a simple system for 2-way interruptible voice interactions between human and LLM☆30Updated last year
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated 2 years ago
- The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions☆52Updated 4 years ago
- Llama-Mimi is a speech language model that uses a unified tokenizer (Mimi) and a single Transformer decoder (Llama) to jointly model sequ…☆28Updated 3 months ago
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆16Updated last year
- Audio tokenization, in the fastest way possible!☆53Updated last year
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Updated 2 years ago
- Open TTS models, built for streaming on the edge☆44Updated 10 months ago
- SpeechPlus: Small LLM-Based Text-to-Speech Library 🚀☆20Updated 7 months ago
- Universal text classifier for generative models☆24Updated last year
- A lightweight Python library for running TTS models with a unified API.☆21Updated 10 months ago
- Implementation of Google's USM speech model in Pytorch☆34Updated this week
- OCTRA is a web-application for the orthographic transcription of audio files.☆39Updated last month
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year
- ☆60Updated last month
- Project of Singing Voice Conversion.☆15Updated 2 years ago
- [NCMMSC'2024] Emotion-Aware Prosodic Phrasing for Expressive Text-to-Speech☆22Updated last year
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Updated 3 years ago
- An Implementation of "Orca: Progressive Learning from Complex Explanation Traces of GPT-4"☆43Updated last year
- Code for the paper: MACE: Leveraging Audio for Evaluating Audio Captioning Systems☆13Updated last year
- ☆14Updated 2 years ago
- Merge LLM that are split in to parts☆27Updated 5 months ago
- Finetune any model on HF in less than 30 seconds☆56Updated this week
- Accelerate Whisper tasks such as transcription, by multiprocesing through parallelization☆25Updated 3 years ago
- LoRA-based phoneme/prosody control for LLM-based TTS with no G2P - Lightweight adapter for edit and control the target language's phoneme…☆22Updated 5 months ago
- Dippy Synthetic Speech Subnet☆17Updated 4 months ago
- ☆41Updated 6 months ago