JL-er / DiSHA
☆12Updated 2 months ago
Alternatives and similar repositories for DiSHA:
Users that are interested in DiSHA are comparing it to the libraries listed below
- This repo is an exploratory experiment to enable frozen pretrained RWKV language models to accept speech modality input. We followed the …☆48Updated 3 months ago
- A large-scale RWKV v6, v7(World, ARWKV, PRWKV) inference. Capable of inference by combining multiple states(Pseudo MoE). Easy to deploy o…☆33Updated this week
- RWKV-SpeechChat is a real-time dialogue script based on a frozen 3B RWKV model with trained adapters and initial states. Various trained …☆26Updated 2 months ago
- ☆32Updated this week
- A fork of the PEFT library, supporting Robust Adaptation (RoSA)☆13Updated 7 months ago
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆14Updated 9 months ago
- Repository for "TESS-2: A Large-Scale, Generalist Diffusion Language Model"☆33Updated last month
- Implementation of a Light Recurrent Unit in Pytorch☆47Updated 5 months ago
- Official implementation of the TTS model Lina-Speech☆157Updated 2 months ago
- DPO, but faster 🚀☆40Updated 3 months ago
- ☆18Updated 10 months ago
- My Implementation of Q-Sparse: All Large Language Models can be Fully Sparsely-Activated☆31Updated 7 months ago
- RWKV v5,v6 LoRA Trainer on Cuda and Rocm Platform. RWKV is a RNN with transformer-level LLM performance. It can be directly trained like …☆12Updated last year
- Tiny re-implementation of MDM in style of LLaDA and nano-gpt speedrun☆44Updated 2 weeks ago
- ☆35Updated 11 months ago
- Official repository of the IEEE SLT 2024 paper "Self-Supervised Syllable Discovery Based on Speaker-Disentangled HuBERT"☆37Updated this week
- [ACL 2024] Generative Pre-Trained Speech Language Model with Efficient Hierarchical Transformer☆52Updated 4 months ago
- RWKV, in easy to read code☆71Updated this week
- https://x.com/BlinkDL_AI/status/1884768989743882276☆27Updated last month
- Here we will test various linear attention designs.☆60Updated 11 months ago
- Official release of StyleTalk dataset.☆62Updated 8 months ago
- ☆11Updated 3 months ago
- A collections of audio codecs with a standardized API☆11Updated last month
- LSLM implements full duplex modeling in interactive speech language models, based on research by Ma et al. (2024). This project advances …☆63Updated 3 months ago
- Interface Design for Self-Supervised Speech Models, Accepted to Interspeech2024☆15Updated 4 months ago
- Experimental playground for benchmarking language model (LM) architectures, layers, and tricks on smaller datasets. Designed for flexible…☆17Updated this week
- QuIP quantization☆52Updated last year
- A low-bitrate single-codebook 16 kHz speech codec based on focal modulation☆81Updated last month
- Code for ACL 2024 main conference paper "Can We Achieve High-quality Direct Speech-to-Speech Translation Without Parallel Speech Data?".☆24Updated 8 months ago
- Evaluating LLMs with Dynamic Data☆78Updated last month