proger / mamba-cpu

☆14

Alternatives and similar repositories for mamba-cpu:

Users that are interested in mamba-cpu are comparing it to the libraries listed below

kyegomez / SoundStream
Implementation of SoundtStream from the paper: "SoundStream: An End-to-End Neural Audio Codec"
☆12Updated this week
ex3ndr / supervoice-gpt
GPT-style network for phonemization with durations of text
☆64Updated 10 months ago
ex3ndr / supervoice-gpt-facodec
GPT for FACodec
☆13Updated 10 months ago
ryota-komatsu / speaker_disentangled_hubert
Official repository of the IEEE SLT 2024 paper "Self-Supervised Syllable Discovery Based on Speaker-Disentangled HuBERT"
☆32Updated last week
qiuqiangkong / mini_llm
☆21Updated 3 weeks ago
kyegomez / Audio-xLSTMs
Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch
☆17Updated this week
AI-S2-Lab / EmoPP
[NCMMSC'2024] Emotion-Aware Prosodic Phrasing for Expressive Text-to-Speech
☆22Updated 5 months ago
lucidrains / light-recurrent-unit-pytorch
Implementation of a Light Recurrent Unit in Pytorch
☆48Updated 3 months ago
tencent-ailab / TriNet
TriNet: stabilizing self-supervised learning from complete or slow collapse on ASR.
☆26Updated last year
indri-voice / audiotoken
Audio tokenization, in the fastest way possible!
☆46Updated 5 months ago
ex3ndr / supervoice-separate
Supervoice Speaker Separation Network
☆12Updated 8 months ago
ryanrudes / YTTTS
The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions
☆52Updated 3 years ago
patrickvonplaten / audio-gen-dreambooth
☆23Updated last year
diggerdu / AudioMamba
☆9Updated 7 months ago
multimodal-art-projection / Open-Suno
trying to reproduce suno v3
☆27Updated this week
parrot-tts / Parrot-TTS
Official Code for ParrotTTS
☆46Updated 3 months ago
duerig / StyleTTS2
StyleTTS 2 Optimized Training Fork
☆18Updated this week
taehong-moon / ee-diffusion
Official Code Implementation for 'A Simple Early Exiting Framework for Accelerated Sampling in Diffusion Models'
☆16Updated 6 months ago
kyegomez / USM
Implementation of Google's USM speech model in Pytorch
☆27Updated this week
ex3ndr / supervoice-hybrid
My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one
☆27Updated 5 months ago
kyutai-labs / sphn
python bindings for symphonia/opus - read various audio formats from python and write opus files
☆25Updated last month
sushant-t / tts-trainer
Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…
☆28Updated last year
apple / ml-acn-embed
Acoustic Neighbor Embeddings
☆19Updated last month
Yuer867 / EMO-Disentanger
This is the official repository of ISMIR 2024 paper "Emotion-driven Piano Music Generation via Two-stage Disentanglement and Functional R…
☆51Updated 4 months ago
chomeyama / wavehax
Official repository of Wavehax vocoder
☆45Updated 2 months ago
SLPcourse / Singing-Voice-Conversion
Project of Singing Voice Conversion.
☆14Updated last year
kyegomez / OpenStrawberry
An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO
☆27Updated this week
lifeiteng / NotebookTTS
Text-To-Speech for NotebookLM
☆27Updated last month
xiaozhah / Aligner
Aligner for text-to-speech
☆15Updated 6 months ago