proger / mamba-cpu
☆12Updated 10 months ago
Related projects ⓘ
Alternatives and complementary repositories for mamba-cpu
- Implementation of a Light Recurrent Unit in Pytorch☆46Updated last month
- GPT-style network for phonemization with durations of text☆62Updated 8 months ago
- Official Code Implementation for 'A Simple Early Exiting Framework for Accelerated Sampling in Diffusion Models'☆13Updated 3 months ago
- Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch☆16Updated last week
- Official repository of the IEEE SLT 2024 paper "Self-Supervised Syllable Discovery Based on Speaker-Disentangled HuBERT"☆29Updated last month
- The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions☆50Updated 3 years ago
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆11Updated 5 months ago
- TriNet: stabilizing self-supervised learning from complete or slow collapse on ASR.☆26Updated last year
- Zeta implementation of a reusable and plug in and play feedforward from the paper "Exponentially Faster Language Modeling"☆15Updated last week
- Supervoice Speaker Separation Network☆13Updated 5 months ago
- Audio tokenization, in the fastest way possible!☆45Updated 2 months ago
- ☆23Updated last year
- GPT for FACodec☆13Updated 7 months ago
- Blazing fast data loading with HuggingFace Dataset and Ray Data☆15Updated 10 months ago
- Spectral Mapping of Singing Voices: U-Net-Assisted Vocal Segmentation☆11Updated 3 months ago
- Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant …☆14Updated 8 months ago
- This is the official repository of ISMIR 2024 paper "Emotion-driven Piano Music Generation via Two-stage Disentanglement and Functional R…☆47Updated 2 months ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated 8 months ago
- The official GitHub page for the survey paper "Foundation Models for Music: A Survey".☆93Updated 2 months ago
- Speech enhancement in noisy and reverberant environments using deep neural networks☆15Updated last month
- [NCMMSC'2024] Emotion-Aware Prosodic Phrasing for Expressive Text-to-Speech☆22Updated 3 months ago
- ☆35Updated last month
- trying to reproduce suno v3☆25Updated 7 months ago
- ☆20Updated 3 weeks ago
- ☆9Updated 5 months ago
- A dashboard for exploring timm learning rate schedulers☆18Updated last year
- Enhanced Reverberation As Supervision (ERAS) for unsupervised reverberant speech separation☆11Updated 3 months ago
- Official Code for ParrotTTS☆42Updated last month
- Codebase for ICLR' 23 paper- ''wav2tok: Deep Sequence Tokenizer for Audio Retrieval"☆32Updated last year