YoshikiMas / madeon-asrView external linksLinks
[SLT'24] Mamba-based Decoder-Only Approach for Speech Recognition
☆18Dec 1, 2024Updated last year
Alternatives and similar repositories for madeon-asr
Users that are interested in madeon-asr are comparing it to the libraries listed below
Sorting:
- ☆14Nov 26, 2024Updated last year
- ☆16Nov 9, 2023Updated 2 years ago
- ☆10Oct 20, 2022Updated 3 years ago
- ConMamba for Automatic Speech Recognition☆102Aug 12, 2024Updated last year
- Lightweight Speech Representation Learning for One-Shot Voice Conversion☆24Dec 12, 2024Updated last year
- ☆10Sep 2, 2024Updated last year
- The project for speech translation☆12Sep 28, 2023Updated 2 years ago
- Text-to-dysarthric speech (TTDS) synthesis. An implementation using the Grad-TTS model with the TORGO database.☆12Mar 15, 2025Updated 11 months ago
- Source code for "BLOOM-Net: Blockwise Optimization for Masking Networks Toward Scalable and Efficient Speech Enhancement"☆14Feb 13, 2022Updated 4 years ago
- ☆11Mar 22, 2023Updated 2 years ago
- This repository contains the code for our upcoming paper An Investigation of End-to-End Models for Robust Speech Recognition at ICASSP 20…☆49Dec 25, 2024Updated last year
- [ICASSP2023] Source code, model links and open test sets for paper SeACo-Paraformer.☆39Mar 15, 2024Updated last year
- Code for the paper "JELLY: Joint Emotion Recognition and Context Reasoning with LLMs for Conversational Speech Synthesis"☆14Nov 5, 2024Updated last year
- Soniox Compare. Compare real-time voice AI side by side. No glossy charts, just results.☆18Jul 15, 2025Updated 7 months ago
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆18Oct 2, 2024Updated last year
- Differentiable implementation of MSBG hearing loss model and MBSTOI intelligibility metric for Clarity Enhancement challenge.☆19Nov 19, 2021Updated 4 years ago
- ☆17Oct 18, 2023Updated 2 years ago
- 端到端语音识别实现;包含LAS、CTC、RNNT解码方式,模型SA(MHA)、LSTM、CNN、DFSMN等☆15Jun 4, 2021Updated 4 years ago
- ☆29Nov 4, 2025Updated 3 months ago
- ☆15Aug 25, 2022Updated 3 years ago
- ☆18Aug 23, 2024Updated last year
- ESLTTS dataset☆16Feb 6, 2025Updated last year
- ☆14Oct 10, 2024Updated last year
- Official release of pretrained models and codes for 'Golden Gemini Is All You Need: Finding the Sweet Spots for Speaker Verification'☆15Jan 20, 2025Updated last year
- ☆12Jun 10, 2021Updated 4 years ago
- ☆16Dec 23, 2021Updated 4 years ago
- ☆18Mar 13, 2024Updated last year
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆16Mar 28, 2023Updated 2 years ago
- The official implementation of DMEL the method presented in the paper "DMEL: The differentiable log-Mel spectrogram as a trainable layer …☆22Dec 21, 2024Updated last year
- Code for the paper "MULTI-BAND MASKING FOR WAVEFORM-BASED SINGING VOICE SEPARATION" that was accepted on EUSIPCO2022☆15Jun 18, 2022Updated 3 years ago
- Python implementation of a few speech intelligibility prediction algorithms☆15May 29, 2024Updated last year
- Microservice that generates subtitles for TUM-Live☆18Nov 23, 2025Updated 2 months ago
- Implementation of CoBERT: Self-Supervised Speech Representation Learning Through Code Representation Learning☆48Nov 8, 2023Updated 2 years ago
- Code for paper "Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition"☆43May 23, 2023Updated 2 years ago
- TriNet: stabilizing self-supervised learning from complete or slow collapse on ASR.☆26Jun 1, 2023Updated 2 years ago
- This repository describes our reproducible framework for assessing self-supervised representation learning from speech☆51Oct 8, 2021Updated 4 years ago
- Variable Bitrate Residual Vector Quantization for Audio Coding☆51May 1, 2025Updated 9 months ago
- [ICASSP 2023] FedAudio: A Federated Learning Benchmark for Audio and Speech Tasks☆51Feb 21, 2024Updated last year
- Implementation for paper: Multi-Metric Optimization using Generative Adversarial Networks for Near-End Speech Intelligibility Enhancement☆22Sep 21, 2021Updated 4 years ago