knoriy / CLARA
☆62Updated 8 months ago
Alternatives and similar repositories for CLARA:
Users that are interested in CLARA are comparing it to the libraries listed below
- The demo page of UniAudio☆33Updated last year
- Open TTS models, built for streaming on the edge☆39Updated 2 weeks ago
- ☆104Updated this week
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆61Updated 3 weeks ago
- This repository contains the code and data for the paper EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion Control by Haozhe Chen,…☆68Updated 6 months ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆95Updated 5 months ago
- ☆59Updated last year
- Audio tokenization, in the fastest way possible!☆49Updated 7 months ago
- Official repository of the IEEE SLT 2024 paper "Self-Supervised Syllable Discovery Based on Speaker-Disentangled HuBERT"☆37Updated this week
- An unofficial PyTorch implementation of VALL-E☆87Updated this week
- This is a fork of the original fairseq repository (version 0.12.2) with added classes for training mHuBERT-147.☆17Updated 4 months ago
- Codebase and project page for EDMSound☆34Updated last year
- Trying to build an all in one speech-text language model - a bit like GPT-4o☆22Updated 10 months ago
- Collection of scripts from mHuBERT-147.☆24Updated 4 months ago
- a Frontier Japanese Speech Generation net☆28Updated 3 weeks ago
- SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.☆84Updated 3 months ago
- Repository for "TESS-2: A Large-Scale, Generalist Diffusion Language Model"☆33Updated last month
- ☆39Updated last month
- Unsupervised Rhythm Modeling for Voice Conversion☆80Updated last year
- small audio language model for reasoning☆50Updated last week
- ☆35Updated 11 months ago
- SlamKit is an open source tool kit for efficient training of SpeechLMs. It was used for "Slamming: Training a Speech Language Model on On…☆186Updated last week
- ☆254Updated last year
- ☆84Updated last year
- Implementation of Google's USM speech model in Pytorch☆30Updated 2 months ago
- ☆39Updated 11 months ago
- The official implementation of our paper "Instruct-MusicGen: Unlocking Text-to-Music Editing for Music Language Models via Instruction Tu…☆81Updated 7 months ago
- Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translation☆143Updated last year
- LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation☆91Updated this week
- VoiceBox neural network implementation☆105Updated 8 months ago