FLM-Audio is a audio-language subversion of RoboEgo/FLM-Ego -- an omnimodal model with native full duplexity.
☆64Dec 9, 2025Updated 2 months ago
Alternatives and similar repositories for flm-audio
Users that are interested in flm-audio are comparing it to the libraries listed below
Sorting:
- [arXiv 2025] ObjFiller-3D: Consistent Multi-view 3D Inpainting via Video Diffusion Models☆36Aug 26, 2025Updated 6 months ago
- Toolbox for Evaluation of AEC/AES Systems☆33Feb 18, 2026Updated 2 weeks ago
- TASU: A New Style of Alignment of Speech LLM with only Text Training Data, zero-shot on ASR and Other SU tasks☆22Jan 19, 2026Updated last month
- ATC-Anno is an annotation tool for Air Traffic Control data that offers automatic semantic and concept annotation.☆12Nov 17, 2023Updated 2 years ago
- Compute the most likely permutation of a lattice given an LM☆10Jan 3, 2013Updated 13 years ago
- Processing for Hearing-Assistive/Augmented-reality Devices (HADES)☆13Jan 13, 2026Updated last month
- ☆13Jan 14, 2025Updated last year
- Llama-Mimi is a speech language model that uses a unified tokenizer (Mimi) and a single Transformer decoder (Llama) to jointly model sequ…☆28Sep 20, 2025Updated 5 months ago
- Elucidated Text-To-Audio (ETTA) is a SOTA text-to-audio model with a holistic understanding of the design space and trained with syntheti…☆105Updated this week
- A simple Python script to convert FOA audio to binaural.☆15Nov 29, 2022Updated 3 years ago
- Generate accompaniment part with chords using Evolutionary algorithm.☆11May 8, 2022Updated 3 years ago
- ☆30Jan 22, 2026Updated last month
- ☆26Jan 23, 2026Updated last month
- Subband PCA feature calculation☆16Nov 5, 2018Updated 7 years ago
- ☆16Feb 6, 2020Updated 6 years ago
- Lightweight Git Large File Storage fetcher written in python☆34Apr 7, 2023Updated 2 years ago
- Multichannel Acoustic Signal Processing library☆37Jun 1, 2020Updated 5 years ago
- Master's thesis project in collaboration with Rasa, focusing on knowledge distillation from BERT into different very small networks and a…☆13Sep 30, 2022Updated 3 years ago
- 6 DoF Directional Room Impulse Response (RIR) with Dense Loudspeaker Grid☆17Aug 31, 2023Updated 2 years ago
- OpenS2S : Advancing Fully Open-Source End-to-End Empathetic Large Speech Language Model☆112Jul 17, 2025Updated 7 months ago
- Digital Audio Effects in Python (material for MUSI6202@Georgiatech)☆15Nov 30, 2014Updated 11 years ago
- This repository contains the training code from paper "SpidR Learning Fast and Stable Linguistic Units for Spoken Language Models Without…☆52Feb 4, 2026Updated last month
- [ACL 2025] OZSpeech: One-step Zero-shot Speech Synthesis with Learned-Prior-Conditioned Flow Matching☆45Feb 9, 2025Updated last year
- Filter Bank Implementaion as Convolutional Neural Network using Python Keras☆17Dec 18, 2024Updated last year
- ☆96Feb 4, 2026Updated last month
- Sound field estimation based on physics-constrained neural kernel☆21Jun 9, 2025Updated 8 months ago
- ☆53Dec 7, 2025Updated 2 months ago
- MAIR is an open-access library of an extensive set of room impulse responses (RIRs) captured using a total of 40+ microphone techniques f…☆20Apr 14, 2019Updated 6 years ago
- This repository contains a series of works on diffusion-based speech tokenizers, including the official implementation of the paper: "TaD…☆76Jan 25, 2026Updated last month
- ☆78Sep 25, 2025Updated 5 months ago
- MSR Identity Toolkit v1.0☆17Aug 18, 2017Updated 8 years ago
- [EMNLP 2025 Findings] Official code for EZ-VC: Easy Zero-shot Any-to-Any Voice Conversion☆35Sep 9, 2025Updated 5 months ago
- ☆18Jan 31, 2020Updated 6 years ago
- The implementation of TaylorBeamformer, which is in submission to Interspeech2022☆48Jun 10, 2022Updated 3 years ago
- ☆20Nov 22, 2020Updated 5 years ago
- XMOS based MEMS microphone array to USB interface for 16 microphones☆20Feb 26, 2021Updated 5 years ago
- Pytorch implementation of DPCRN☆28Mar 31, 2024Updated last year
- LAFMA: A Latent Flow Matching Model for Text-to-Audio Generation (INTERSPEECH 2024)☆43Jun 13, 2024Updated last year
- ☆18Sep 5, 2024Updated last year