proger / mamba-cpuLinks
☆18Updated 2 years ago
Alternatives and similar repositories for mamba-cpu
Users that are interested in mamba-cpu are comparing it to the libraries listed below
Sorting:
- Implementation of a Light Recurrent Unit in Pytorch☆49Updated last year
- Conformer block with Rotary Position Embedding, modified from lucidrains' implement☆16Updated last year
- GPT-style network for phonemization with durations of text☆68Updated last year
- This project is to train an RWKV LLM for TTS generation which compatible to other TTS engine(like fish/cosy/chattts).☆94Updated 4 months ago
- python bindings for symphonia/opus - read various audio formats from python and write opus files☆77Updated last month
- LongCat Audio Tokenizer and Detokenizer☆285Updated last week
- A package for NeuCodec: a 50hz, 0.8kbps, 24kHz audio codec.☆149Updated last week
- Code for the blog "Neural audio codecs: how to get audio into LLMs"☆149Updated 3 months ago
- Audio tokenization, in the fastest way possible!☆53Updated last year
- ☆29Updated 7 months ago
- The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions☆52Updated 4 years ago
- trying to reproduce suno v3☆35Updated last year
- Official Code for ParrotTTS☆58Updated last year
- Googleの音声復元モデルMiipher-2の再現実装の学習および推論コード。学習済みモデルも公開しています。☆30Updated 6 months ago
- ☆86Updated last year
- Official repository of Wavehax vocoder☆66Updated last month
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆15Updated 8 months ago
- A collection of optimized utilities for text-to-audio processing, enhancing both training and inference workflows. This repository contai…☆44Updated 10 months ago
- List of Large Lanugage Model Papers☆60Updated 2 years ago
- Spectral Mapping of Singing Voices: U-Net-Assisted Vocal Segmentation☆13Updated last year
- Streamable Text-to-Speech model using a language modeling approach, without vector quantization☆110Updated 8 months ago
- A neural network layer API and library for sequence modeling, designed for easy creation of sequence models that can be executed layerwis…☆50Updated this week
- ☆19Updated last year
- Accompanying repository for the paper "DiffVox: A Differentiable Model for Capturing and Analysing Professional Effects Distributions"☆38Updated 3 months ago
- Blazing fast data loading with HuggingFace Dataset and Ray Data☆16Updated 2 years ago
- [EMNLP 2024] ESC: Efficient Speech Coding with Cross-Scale Residual Vector Quantized Transformers☆125Updated 10 months ago
- unofficial pytorch implementation of HiFi-GAN with fast MISR.☆15Updated 2 years ago
- ☆106Updated 4 months ago
- Implementation of Google's USM speech model in Pytorch☆34Updated 3 weeks ago
- Official code for"DiaMoE-TTS: A Unified IPA-based Dialect TTS Framework with Mixture-of-Experts and Parameter-Efficient Zero-Shot Adaptat…☆225Updated 2 months ago