cofe-ai / flm-audioLinks
FLM-Audio is a audio-language subversion of RoboEgo/FLM-Ego -- an omnimodal model with native full duplexity.
☆26Updated last week
Alternatives and similar repositories for flm-audio
Users that are interested in flm-audio are comparing it to the libraries listed below
Sorting:
- OpenS2S : Advancing Fully Open-Source End-to-End Empathetic Large Speech Language Model☆83Updated 2 months ago
- Benchmark for evaluating TTS models on complex prosodic, expressiveness, and linguistic challenges.☆157Updated 2 months ago
- EchoX: Towards Mitigating Acoustic-Semantic Gap via Echo Training for Speech-to-Speech LLMs☆38Updated 2 weeks ago
- LSLM implements full duplex modeling in interactive speech language models, based on research by Ma et al. (2024). This project advances …☆76Updated 3 months ago
- A Foundation Model for Industrial Signal Comprehensive Representation☆43Updated 2 months ago
- This repository contains a series of works on diffusion-based speech tokenizers, including the official implementation of the paper: "TaD…☆179Updated 2 weeks ago
- ☆39Updated 2 months ago
- This project is to train an RWKV LLM for TTS generation which compatible to other TTS engine(like fish/cosy/chattts).☆85Updated last week
- ☆99Updated this week
- a text-conditional diffusion probabilistic model capable of generating high fidelity audio.☆177Updated last year
- Github repository for ACL 2025 paper: Recent Advances in Speech Language Models: A Survey.☆139Updated 3 months ago
- Towards Fine-grained Audio Captioning with Multimodal Contextual Cues☆81Updated last week
- Baichuan-Audio: A Unified Framework for End-to-End Speech Interaction☆211Updated 7 months ago
- ☆78Updated 5 months ago
- A collection of optimized utilities for text-to-audio processing, enhancing both training and inference workflows. This repository contai…☆39Updated 6 months ago
- SpeechAgents: Human-Communication Simulation with Multi-Modal Multi-Agent Systems☆84Updated last year
- The official code repository for SongPrep: A Preprocessing Framework and End-to-end Model for Full-song Structure Parsing and Lyrics Tran…☆68Updated last week
- ☆14Updated last year
- LUCY: Linguistic Understanding and Control Yielding Early Stage of Her☆55Updated 5 months ago
- A unified tokenizer that is capable of both extracting semantic information and enabling high-fidelity audio reconstruction.☆95Updated 2 weeks ago
- Official repository of the IEEE SLT 2024 paper "Self-Supervised Syllable Discovery Based on Speaker-Disentangled HuBERT"☆43Updated last week
- ☆41Updated 7 months ago
- An easy-to-use, fast, and easily integrable tool for evaluating audio LLM☆149Updated 2 weeks ago
- ☆37Updated 6 months ago
- ☆26Updated last month
- ☆50Updated 6 months ago
- small audio language model for reasoning☆76Updated 5 months ago
- ☆28Updated 3 months ago
- ☆61Updated last week
- Official code for "EmoVoice: LLM-based Emotional Text-To-Speech Model with Freestyle Text Prompting"☆80Updated 4 months ago