☆25Jan 2, 2024Updated 2 years ago
Alternatives and similar repositories for MISP-2023-Challenge-Baseline
Users that are interested in MISP-2023-Challenge-Baseline are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆32Jun 26, 2023Updated 2 years ago
- ☆17Jan 26, 2021Updated 5 years ago
- Training data simulation☆59May 6, 2024Updated last year
- The baseline system for the ICASSP2024 ICMC-ASR Challenge.☆55Dec 6, 2023Updated 2 years ago
- A pytorch implementation of the paper "ANSD-MA-MSE: Adaptive Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding"☆61Sep 19, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- NOTSOFAR-1 Challenge: Distant Diarization and ASR☆60Feb 12, 2025Updated last year
- A pytorch template for beginners based on pytorch_lightning☆49Feb 1, 2024Updated 2 years ago
- Official data preparation and metric evaluation scripts for the Interspeech 2025 URGENT challenge.☆83May 21, 2025Updated 10 months ago
- SLT 2024 Challenge: Post-ASR-Speaker-Tagging☆16Jun 16, 2024Updated last year
- TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings☆39Oct 27, 2025Updated 5 months ago
- NeMo: a toolkit for conversational AI☆13May 4, 2024Updated last year
- CHIME-7/8 diarization champion system: neural speaker diarization using memory-aware multi-speaker embedding with sequence-to-sequence ar…☆85Jun 17, 2025Updated 9 months ago
- Single-Channel Dereverberation in Matlab☆39Nov 13, 2018Updated 7 years ago
- A simple package for Guided source separation (GSS)☆133May 20, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Codebase of the submitted work in ICASSP 2023☆14Nov 30, 2022Updated 3 years ago
- This is the official train-dev-test release of the Interspeech2024 Discrete Speech Representation Challenge.☆32Jan 26, 2024Updated 2 years ago
- Open implementation of UNIVERSE and UNIVERSE++ diffusion-based speech enhancement models.☆112Aug 29, 2024Updated last year
- To overcome the limitation and obtain more appropriate control filters, a generative fixed-filter active noise control (GFANC) approach i…☆36Aug 28, 2025Updated 7 months ago
- CLEAR benchmark (NeurIPS 2021 Dataset & Benchmark)☆28Apr 23, 2023Updated 2 years ago
- This is the official implementation of the SEMamba paper. (Accepted to IEEE SLT 2024)☆256Dec 12, 2025Updated 3 months ago
- Deep-Learning-Based Audio-Visual Speech Enhancement and Separation☆219Apr 16, 2023Updated 2 years ago
- Official repository for U-SAM (Interspeech 2025)☆26Jun 3, 2025Updated 9 months ago
- ☆12May 22, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Unofficial Pytorch Lightning Implementation of "Towards Robust Speech Super-Resolution"☆10May 8, 2023Updated 2 years ago
- Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.☆24Feb 25, 2025Updated last year
- ☆24Mar 18, 2024Updated 2 years ago
- ICASSP 2023: 'Speaker recognition with two-step multi-modal deep cleansing'☆44Oct 31, 2022Updated 3 years ago
- INTERSPEECH2023: Target Active Speaker Detection with Audio-visual Cues☆58May 29, 2023Updated 2 years ago
- COG-MHEAR Audio-Visual Speech Enhancement Challenge☆45Feb 17, 2026Updated last month
- The project is associated with the recently-launched INTERSPEECH 2025 Workshop on Multilingual Conversational Speech Language Model (MLC-…☆50May 14, 2025Updated 10 months ago
- ☆31Aug 28, 2022Updated 3 years ago
- ☆17Jan 1, 2024Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- This repository contains a set of codes to run (i.e., train, perform inference with, evaluate) a diarization method called EEND-vector-cl…☆79Oct 18, 2022Updated 3 years ago
- A description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NeurI…☆155Apr 29, 2025Updated 11 months ago
- This repository contains the source code for the implementation of two deep learning models concerning the audio super resolution task.☆14Mar 14, 2023Updated 3 years ago
- Here we will track the latest Audio AI Agent, including speech, music, sound effects, etc.☆16Dec 8, 2023Updated 2 years ago
- ☆13Oct 25, 2024Updated last year
- An unofficial code reproduction of Channel Attention Dense U-Net for Multichannel Speech Enhancement☆13Jul 17, 2023Updated 2 years ago
- A Python toolkit for data-driven HRTF research☆16Feb 6, 2025Updated last year