proger / mamba-cpuLinks
☆18Updated last year
Alternatives and similar repositories for mamba-cpu
Users that are interested in mamba-cpu are comparing it to the libraries listed below
Sorting:
- python bindings for symphonia/opus - read various audio formats from python and write opus files☆72Updated 5 months ago
- Conformer block with Rotary Position Embedding, modified from lucidrains' implement☆16Updated last year
- Code for the blog "Neural audio codecs: how to get audio into LLMs"☆140Updated 2 months ago
- This project is to train an RWKV LLM for TTS generation which compatible to other TTS engine(like fish/cosy/chattts).☆91Updated 2 months ago
- an implementation of FAdam (Fisher Adam) in PyTorch☆49Updated 5 months ago
- Implementation of Google's USM speech model in Pytorch☆34Updated 2 months ago
- Python package of MP-SENet from Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement.☆19Updated last year
- Streaming Audiotransformers for online Audio tagging☆49Updated last year
- Official Code for ParrotTTS☆58Updated last year
- Official code for"DiaMoE-TTS: A Unified IPA-based Dialect TTS Framework with Mixture-of-Experts and Parameter-Efficient Zero-Shot Adaptat…☆206Updated last month
- Googleの音声復元モデルMiipher-2の再現実装の学習および推論コード。学習済みモデルも公開しています。☆30Updated 5 months ago
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆15Updated 7 months ago
- Streamable Text-to-Speech model using a language modeling approach, without vector quantization☆106Updated 7 months ago
- Official repository of the IEEE SLT 2024 paper "Self-Supervised Syllable Discovery Based on Speaker-Disentangled HuBERT"☆44Updated 3 months ago
- LSLM implements full duplex modeling in interactive speech language models, based on research by Ma et al. (2024). This project advances …☆84Updated 6 months ago
- GPT-style network for phonemization with durations of text☆68Updated last year
- A package for NeuCodec: a 50hz, 0.8kbps, 24kHz audio codec.☆137Updated 2 months ago
- ☆29Updated 5 months ago
- mnn asr demo.☆23Updated 9 months ago
- Official repository of Wavehax vocoder☆65Updated last week
- RWKV-SpeechChat is a real-time dialogue script based on a frozen 3B RWKV model with trained adapters and initial states. Various trained …☆28Updated 11 months ago
- Official PyTorch inference code for the Interspeech 2025 paper: Efficient Speech Enhancement via Embeddings from Pre-trained Generative A…☆74Updated 6 months ago
- ☆111Updated 2 months ago
- PyTorch implementation of Miipher-2 [2025] which is a speech restoration model by Google DeepMind☆61Updated 3 months ago
- The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions☆52Updated 4 years ago
- Implementation of a Light Recurrent Unit in Pytorch☆49Updated last year
- [EMNLP Main '25] LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation☆141Updated 7 months ago
- Apply Score diffusion to improve speech signals recorded under various adverse conditions and distortions, including noise, reverberation…☆74Updated last year
- ☆103Updated 2 months ago
- ☆44Updated 2 years ago