michaelneri / unsupervised-audio-anomaly-detectionView external linksLinks
Official repository of the work "Low-complexity Unsupervised Audio Anomaly Detection exploiting Separable Convolutions and Angular Loss" published to IEEE Sensors Letters.
☆10Nov 6, 2024Updated last year
Alternatives and similar repositories for unsupervised-audio-anomaly-detection
Users that are interested in unsupervised-audio-anomaly-detection are comparing it to the libraries listed below
Sorting:
- Noisy-ArcMix: Additive Noisy Angular Margin Loss Combined With Mixup for Anomalous Sound Detection☆29Aug 22, 2024Updated last year
- ☆11Aug 11, 2023Updated 2 years ago
- Learning an Interpretable End-to-End Network for Real-Time Acoustic Beamforming☆15Aug 20, 2024Updated last year
- System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection☆28Jul 6, 2022Updated 3 years ago
- [ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder☆12Mar 11, 2025Updated 11 months ago
- text to speech☆10Mar 19, 2024Updated last year
- This is not remotely close to a finished product, and does not intend to nor does this claim to be working fine-tuning code for MaskGCT. …☆13Dec 4, 2024Updated last year
- ☆13Nov 22, 2022Updated 3 years ago
- Onset-and-Offset-Aware Sound Event Detection☆20Feb 10, 2025Updated last year
- ☆11Nov 7, 2024Updated last year
- [ICMR 2025] Official Repository for The Paper, Let Network Decide What to Learn: Symbolic Music Understanding Model Based on Large-scale …☆18Aug 17, 2025Updated 5 months ago
- ☆11Jul 6, 2022Updated 3 years ago
- Implementation of CGMM-MVDR beamforming used for Clarity challenge☆13Jan 14, 2022Updated 4 years ago
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts☆16Dec 3, 2024Updated last year
- ☆15Nov 11, 2024Updated last year
- DysfluentWFST☆17Nov 13, 2025Updated 3 months ago
- Neural model for prediction of stress position in Russian words☆12Jun 22, 2025Updated 7 months ago
- Cantonese Grapheme-to-Phoneme Converter based on GitYCC/g2pW☆15Dec 10, 2024Updated last year
- ☆15Nov 10, 2025Updated 3 months ago
- b站视频音轨下载器(支持多P) Rebuild from https://github.com/Quandong-Zhang/bilibiliAudioDownloader.ps1 with python☆11Jul 31, 2025Updated 6 months ago
- A chinese singing voice dataset, professional male singer, 105 songs, 132 minutes☆11Oct 19, 2023Updated 2 years ago
- ☆14Aug 1, 2025Updated 6 months ago
- ☆14Aug 16, 2023Updated 2 years ago
- An open-source Kazakh Emotional Text-to-Speech Dataset☆35Aug 1, 2025Updated 6 months ago
- A piano music dataset with Audio, Symbolic and Text labels☆33Mar 6, 2025Updated 11 months ago
- VITS2 using Phoneme-Level Japanese BERT☆14Dec 17, 2023Updated 2 years ago
- ☆15Jun 22, 2025Updated 7 months ago
- ☆11Oct 14, 2023Updated 2 years ago
- Audio-Visual Speech Enhancement Challenge (AVSE) 2024☆12Feb 6, 2026Updated last week
- This repository contains prompts & best practices to annotate audio clips with a very high degree of details using Audio-Language-Models☆35Oct 13, 2024Updated last year
- Causal Speech Enhancement Based on a Two-Branch Nested U-Net Architecture Using Self-Supervised Speech Embeddings☆19Jun 6, 2025Updated 8 months ago
- ☆15Apr 2, 2025Updated 10 months ago
- ☆13Jan 11, 2026Updated last month
- ☆15Mar 31, 2025Updated 10 months ago
- ESLTTS dataset☆16Feb 6, 2025Updated last year
- ☆19Sep 20, 2024Updated last year
- A solution to denoising and separating for two-speaker-mixed noisy speech, using a BSRNN inspired network.☆14Aug 22, 2023Updated 2 years ago
- ☆32Oct 23, 2025Updated 3 months ago
- Tidy Tunes is an easy-to-use pipeline for mining high-quality audio data for speech generation models. To do so, it chains multiple open …☆22Feb 7, 2026Updated last week