Eureka-Audio: A 1.7B lightweight audio–language model that matches 7B–30B models on ASR, audio understanding, and paralinguistic reasoning.
☆35Feb 28, 2026Updated 2 weeks ago
Alternatives and similar repositories for Eureka-Audio
Users that are interested in Eureka-Audio are comparing it to the libraries listed below
Sorting:
- Onset-and-Offset-Aware Sound Event Detection☆21Feb 10, 2025Updated last year
- Code for ICASSP 2024 Paper: RECAP: Retrieval-Augmented Audio Captioning☆15Jun 23, 2024Updated last year
- open-source Mandarian biased word dataset☆14Sep 21, 2023Updated 2 years ago
- Acoustic echo cancelation(AEC) is a main algorithm in the pipe line of acoustic devices with KWS or ASR. FNLMS is used.☆19Apr 22, 2019Updated 6 years ago
- MichiAI: A Low Latency, Full Duplex Speech LLM with zero coherence loss☆86Feb 6, 2026Updated last month
- ☆113Oct 21, 2025Updated 4 months ago
- ☆77Sep 25, 2025Updated 5 months ago
- A chinese singing voice dataset, professional male singer, 105 songs, 132 minutes☆11Oct 19, 2023Updated 2 years ago
- A lightweight end-of-utterance detection model fine-tuned on SmolLM2-135M, optimized for Raspberry Pi and low-power devices.☆46Nov 8, 2025Updated 4 months ago
- ☆27Jan 16, 2023Updated 3 years ago
- CLASP: Contrastive Language-Speech Pretraining for Multilingual Multimodal Information Retrieval☆13Jun 27, 2025Updated 8 months ago
- semantic tokenizer for speech and music☆21Jul 6, 2025Updated 8 months ago
- A real-time and multilingual speech translation model☆195Feb 13, 2026Updated last month
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Aug 18, 2023Updated 2 years ago
- Open repository of simulated Room Impulse Responses (RIR) accompanying the paper "Hearing Anywhere in Any Environment"☆71Aug 11, 2025Updated 7 months ago
- A simple command line tool to calculate WER for ASR.☆14Oct 14, 2024Updated last year
- Code of the paper "Low-Latency Speech Separation Guided Diarization for Telephone Conversations"☆15Dec 22, 2022Updated 3 years ago
- MRSAudio: A Large-Scale Multimodal Recorded Spatial Audio Dataset with Refined Annotations☆34Oct 15, 2025Updated 5 months ago
- We propose C2SER, a novel audio-language model designed to enhance the stability and accuracy of speech emotion recognition through conte…☆44Mar 3, 2025Updated last year
- EchoX: Towards Mitigating Acoustic-Semantic Gap via Echo Training for Speech-to-Speech LLMs☆47Sep 19, 2025Updated 6 months ago
- Audio samples for the paper 'Phase-aware music super-resolution using generative adversarial networks'☆14May 15, 2020Updated 5 years ago
- This thesis applies an autoencoder deep neural network to the multichannel speech enhancement problem. It takes the problem from dataset …☆12Sep 1, 2022Updated 3 years ago
- Huawei Ascend Mate 7 kernel tree☆12Sep 20, 2016Updated 9 years ago
- Variable Bitrate Residual Vector Quantization for Audio Coding☆50May 1, 2025Updated 10 months ago
- Code for the submitted 2021 DCASE Workshop paper: "Waveforms and Spectrograms: Enhancing Acoustic Scene Classification Using Multimodal F…☆16Aug 9, 2021Updated 4 years ago
- Matlab implementation of the popular room acoustic model "image method", with the addition of randomisation to remove sweeping echoes (if…☆14Aug 29, 2020Updated 5 years ago
- Ultra-low-bitrate Speech Codec for Speech Language Modeling Applications☆89Dec 20, 2024Updated last year
- ☆19Jan 14, 2019Updated 7 years ago
- Pushing the Limits of Zero-shot End-to-End Speech Translation☆26Dec 12, 2024Updated last year
- BAE-NET: A LOW COMPLEXITY AND HIGH FIDELITY BANDWIDTH-ADAPTIVE NEURAL NETWORK FOR SPEECH SUPER-RESOLUTION☆80Aug 20, 2024Updated last year
- Efficient Training for Multilingual Visual Speech Recognition: Pre-training with Discretized Visual Speech Representation (ACM MM 2024)☆20Mar 17, 2025Updated last year
- A pitch detection model trained to be robust against noise and reverberation environments.☆27Jan 21, 2025Updated last year
- This repository is developed in MATLAB. Speech Augmentation is based on Adaptive Filtering while Endpoint Detection is based on Voice Act…☆10Dec 7, 2020Updated 5 years ago
- ICASSP2026 HumDial Challenge☆36Dec 13, 2025Updated 3 months ago
- Causal streaming adaptation of OpenAI Whisper for real-time transcription on small audio chunks.☆64Sep 18, 2025Updated 6 months ago
- Script to demonstrate how to use a Language Model for Semantic Turn Detection. Refer to blog post for full details.☆17May 9, 2025Updated 10 months ago
- 把webrtc的agc转成matlab代码以供科研工作者研究☆36Dec 10, 2022Updated 3 years ago
- ☆29Nov 4, 2025Updated 4 months ago
- ☆31Mar 3, 2026Updated 2 weeks ago