Eureka-Audio: A 1.7B lightweight audio–language model that matches 7B–30B models on ASR, audio understanding, and paralinguistic reasoning.
☆40Apr 11, 2026Updated last month
Alternatives and similar repositories for Eureka-Audio
Users that are interested in Eureka-Audio are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Onset-and-Offset-Aware Sound Event Detection☆21Feb 10, 2025Updated last year
- Code for ICASSP 2024 Paper: RECAP: Retrieval-Augmented Audio Captioning☆15Jun 23, 2024Updated last year
- open-source Mandarian biased word dataset☆14Sep 21, 2023Updated 2 years ago
- Acoustic echo cancelation(AEC) is a main algorithm in the pipe line of acoustic devices with KWS or ASR. FNLMS is used.☆19Apr 22, 2019Updated 7 years ago
- ☆116Oct 21, 2025Updated 6 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A chinese singing voice dataset, professional male singer, 105 songs, 132 minutes☆11Oct 19, 2023Updated 2 years ago
- MichiAI: A Low Latency, Full Duplex Speech LLM with zero coherence loss☆101Apr 24, 2026Updated 3 weeks ago
- ☆87Sep 25, 2025Updated 7 months ago
- ☆27Jan 16, 2023Updated 3 years ago
- A lightweight end-of-utterance detection model fine-tuned on SmolLM2-135M, optimized for Raspberry Pi and low-power devices.☆56Mar 20, 2026Updated last month
- CLASP: Contrastive Language-Speech Pretraining for Multilingual Multimodal Information Retrieval☆13Jun 27, 2025Updated 10 months ago
- semantic tokenizer for speech and music☆21Jul 6, 2025Updated 10 months ago
- A real-time and multilingual speech translation model☆247Feb 13, 2026Updated 3 months ago
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Aug 18, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [ACMMM'2024] Generative Expressive Conversational Speech Synthesis☆45Oct 28, 2024Updated last year
- A simple command line tool to calculate WER for ASR.☆14Oct 14, 2024Updated last year
- Code of the paper "Low-Latency Speech Separation Guided Diarization for Telephone Conversations"☆15Dec 22, 2022Updated 3 years ago
- Open repository of simulated Room Impulse Responses (RIR) accompanying the paper "Hearing Anywhere in Any Environment"☆77Aug 11, 2025Updated 9 months ago
- We propose C2SER, a novel audio-language model designed to enhance the stability and accuracy of speech emotion recognition through conte…☆47Mar 3, 2025Updated last year
- kaldi cnn-tdnnf baseline☆13Aug 31, 2021Updated 4 years ago
- EchoX: Towards Mitigating Acoustic-Semantic Gap via Echo Training for Speech-to-Speech LLMs☆47Sep 19, 2025Updated 8 months ago
- Audio samples for the paper 'Phase-aware music super-resolution using generative adversarial networks'☆14May 15, 2020Updated 6 years ago
- This thesis applies an autoencoder deep neural network to the multichannel speech enhancement problem. It takes the problem from dataset …☆13Sep 1, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The official repository TimeAudio, a comprehensive framework that incorporates fine-grained acoustic cues into LALMs with enhanced module…☆28Nov 18, 2025Updated 6 months ago
- MRSAudio: A Large-Scale Multimodal Recorded Spatial Audio Dataset with Refined Annotations☆39Oct 15, 2025Updated 7 months ago
- Huawei Ascend Mate 7 kernel tree☆12Sep 20, 2016Updated 9 years ago
- Code for the submitted 2021 DCASE Workshop paper: "Waveforms and Spectrograms: Enhancing Acoustic Scene Classification Using Multimodal F…☆16Aug 9, 2021Updated 4 years ago
- Matlab implementation of the popular room acoustic model "image method", with the addition of randomisation to remove sweeping echoes (if…☆14Aug 29, 2020Updated 5 years ago
- Variable Bitrate Residual Vector Quantization for Audio Coding☆52May 1, 2025Updated last year
- Ultra-low-bitrate Speech Codec for Speech Language Modeling Applications☆91Dec 20, 2024Updated last year
- ☆19Jan 14, 2019Updated 7 years ago
- Pushing the Limits of Zero-shot End-to-End Speech Translation☆25Dec 12, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- BAE-NET: A LOW COMPLEXITY AND HIGH FIDELITY BANDWIDTH-ADAPTIVE NEURAL NETWORK FOR SPEECH SUPER-RESOLUTION☆80Aug 20, 2024Updated last year
- A pitch detection model trained to be robust against noise and reverberation environments.☆27Jan 21, 2025Updated last year
- Efficient Training for Multilingual Visual Speech Recognition: Pre-training with Discretized Visual Speech Representation (ACM MM 2024)☆20Mar 17, 2025Updated last year
- This repository is developed in MATLAB. Speech Augmentation is based on Adaptive Filtering while Endpoint Detection is based on Voice Act…☆10Dec 7, 2020Updated 5 years ago
- ICASSP2026 HumDial Challenge☆43Dec 13, 2025Updated 5 months ago
- Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)☆79Nov 1, 2024Updated last year
- Causal streaming adaptation of OpenAI Whisper for real-time transcription on small audio chunks.☆71Mar 31, 2026Updated last month