qiuqiangkong / music_llmLinks
☆47Updated 4 months ago
Alternatives and similar repositories for music_llm
Users that are interested in music_llm are comparing it to the libraries listed below
Sorting:
- ☆98Updated last month
- LAFMA: A Latent Flow Matching Model for Text-to-Audio Generation (INTERSPEECH 2024)☆38Updated 11 months ago
- We introduce the LLAMA1 Test Set, a comprehensive open-domain world knowledge QA dataset for evaluating question-answering systems. We pr…☆18Updated last year
- The open source code of ALMTokenizer2: Towards Low bit-rate and Semantic-rich Audio Tokenizer with Flow-based Scalar Diffusion Transforme…☆26Updated 2 weeks ago
- This is the official train-dev-test release of the Interspeech2024 Discrete Speech Representation Challenge.☆32Updated last year
- The source code for the paper XiaoiceSing2 (interspeech2023)☆47Updated last year
- Ultra-low-bitrate Speech Codec for Speech Language Modeling Applications☆78Updated 5 months ago
- ☆24Updated 5 months ago
- ☆20Updated last month
- Streaming Vocos☆26Updated 4 months ago
- The demo page for ALMTokenizer☆48Updated last month
- small audio language model for reasoning☆64Updated last month
- ☆43Updated 11 months ago
- Contains the code associated with the ICLR submission for our text-to-speech diffusion model☆53Updated last year
- E2E TTS using Conditional Flow Matching (Experimental*)☆69Updated last year
- FastSAG: Towards Fast Non-Autoregressive Singing Accompaniment Generation☆20Updated 5 months ago
- ☆23Updated 7 months ago
- A Low-Frame-Rate, Semantically-Enhanced Neural Audio Codec for Speech Generation☆32Updated this week
- ☆31Updated 11 months ago
- JamendoMaxCaps is a large-scale dataset of 362,000 instrumental creative commons tracks☆36Updated last week
- A trainer for SNAC (Multi-Scale Neural Audio Codec) has replaced the decoder with Vocos.☆52Updated 7 months ago
- (R&D) Text to speech using phonemes as inputs and audio codec codes as outputs. Loosely based on MegaByte, VALL-E and Encodec.☆48Updated last year
- (WIP)long form speech generatoins☆31Updated 2 months ago
- Official repo of ICASSP 2024 paper - Generative De-Quantization for Neural Speech Codec via Latent Diffusion.☆55Updated 4 months ago
- Self-supervised Generative LM-based Voice Conversion☆36Updated last month
- Official implementation for FlowSep☆50Updated 5 months ago
- VAE modified from Descript Audio Codec, which replaces the RVQ with VAE☆72Updated last year
- ☆47Updated 2 months ago
- Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)☆59Updated 7 months ago
- ☆28Updated 3 weeks ago