☆25Jan 2, 2024Updated 2 years ago
Alternatives and similar repositories for MISP-2023-Challenge-Baseline
Users that are interested in MISP-2023-Challenge-Baseline are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆33Jun 26, 2023Updated 2 years ago
- ☆17Jan 26, 2021Updated 5 years ago
- Training data simulation☆59May 6, 2024Updated 2 years ago
- The baseline system for the ICASSP2024 ICMC-ASR Challenge.☆56Dec 6, 2023Updated 2 years ago
- A pytorch implementation of the paper "ANSD-MA-MSE: Adaptive Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding"☆62Sep 19, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- NOTSOFAR-1 Challenge: Distant Diarization and ASR☆61Feb 12, 2025Updated last year
- A pytorch template for beginners based on pytorch_lightning☆49Feb 1, 2024Updated 2 years ago
- Official data preparation and metric evaluation scripts for the Interspeech 2025 URGENT challenge.☆84May 21, 2025Updated 11 months ago
- SLT 2024 Challenge: Post-ASR-Speaker-Tagging☆16Jun 16, 2024Updated last year
- TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings☆40Oct 27, 2025Updated 6 months ago
- NeMo: a toolkit for conversational AI☆13May 4, 2024Updated 2 years ago
- CHIME-7/8 diarization champion system: neural speaker diarization using memory-aware multi-speaker embedding with sequence-to-sequence ar…☆85Jun 17, 2025Updated 10 months ago
- Single-Channel Dereverberation in Matlab☆39Nov 13, 2018Updated 7 years ago
- A simple package for Guided source separation (GSS)☆134May 20, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Codebase of the submitted work in ICASSP 2023☆14Nov 30, 2022Updated 3 years ago
- This is the official train-dev-test release of the Interspeech2024 Discrete Speech Representation Challenge.☆32Jan 26, 2024Updated 2 years ago
- Open implementation of UNIVERSE and UNIVERSE++ diffusion-based speech enhancement models.☆114Aug 29, 2024Updated last year
- To overcome the limitation and obtain more appropriate control filters, a generative fixed-filter active noise control (GFANC) approach i…☆37Aug 28, 2025Updated 8 months ago
- CLEAR benchmark (NeurIPS 2021 Dataset & Benchmark)☆28Apr 23, 2023Updated 3 years ago
- This is the official implementation of the SEMamba paper. (Accepted to IEEE SLT 2024)☆260Dec 12, 2025Updated 4 months ago
- Deep-Learning-Based Audio-Visual Speech Enhancement and Separation☆220Apr 16, 2023Updated 3 years ago
- Official repository for U-SAM (Interspeech 2025)☆27Jun 3, 2025Updated 11 months ago
- ☆12May 22, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Unofficial Pytorch Lightning Implementation of "Towards Robust Speech Super-Resolution"☆10May 8, 2023Updated 3 years ago
- Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.☆24Feb 25, 2025Updated last year
- ☆24Mar 18, 2024Updated 2 years ago
- ICASSP 2023: 'Speaker recognition with two-step multi-modal deep cleansing'☆44Oct 31, 2022Updated 3 years ago
- INTERSPEECH2023: Target Active Speaker Detection with Audio-visual Cues☆59May 29, 2023Updated 2 years ago
- COG-MHEAR Audio-Visual Speech Enhancement Challenge☆46Feb 17, 2026Updated 2 months ago
- ☆31Aug 28, 2022Updated 3 years ago
- The project is associated with the recently-launched INTERSPEECH 2025 Workshop on Multilingual Conversational Speech Language Model (MLC-…☆50May 14, 2025Updated 11 months ago
- ☆17Jan 1, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- This repository contains a set of codes to run (i.e., train, perform inference with, evaluate) a diarization method called EEND-vector-cl…☆79Oct 18, 2022Updated 3 years ago
- This repository contains the source code for the implementation of two deep learning models concerning the audio super resolution task.☆14Mar 14, 2023Updated 3 years ago
- Here we will track the latest Audio AI Agent, including speech, music, sound effects, etc.☆16Dec 8, 2023Updated 2 years ago
- A description of "RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization" [NeurI…☆162Apr 29, 2025Updated last year
- ☆13Oct 25, 2024Updated last year
- An unofficial code reproduction of Channel Attention Dense U-Net for Multichannel Speech Enhancement☆13Jul 17, 2023Updated 2 years ago
- ☆12Nov 1, 2024Updated last year