ta012/DTFAT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ta012/DTFAT)

ta012 / DTFAT

[AAAI 2024] DTF-AT: Decoupled Time-Frequency Audio Transformer for Event Classification

☆12

Alternatives and similar repositories for DTFAT

Users that are interested in DTFAT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Sara-Ahmed / ASiT
View on GitHub
ASiT: Audio Spectrogram vIsion Transformer for General Audio Representation
☆30Mar 10, 2024Updated 2 years ago
wilkinghoff / dcase2022
View on GitHub
Submission for task 2 "Unsupervised Anomalous Sound Detection for Machine Condition Monitoring Applying Domain Generalization Techniques"…
☆16Sep 19, 2022Updated 3 years ago
kaen2891 / stethoscope-guided_supervised_contrastive_learning
View on GitHub
(ICASSP 2024) Official Implementation of "Stethoscope-guided Supervised Contrastive Learning for Cross-domin Adaptation on Respiratory So…
☆18Dec 5, 2024Updated last year
kaen2891 / adversarial_fine-tuning_using_generated_respiratory_sound
View on GitHub
(NeurIPS 2023 Workshop on DGM4H) Official Implementation of "Adversarial Fine-tuning using Generated Respiratory Sound to Address Class I…
☆19Dec 5, 2024Updated last year
vinceasvp / meta-sc
View on GitHub
☆11May 30, 2023Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
zds-potato / multilingual-phonetic-sv
View on GitHub
☆10Dec 22, 2023Updated 2 years ago
YeongHyeon / Skip-GANomaly
View on GitHub
Implementation of Skip-GANomaly with MNIST dataset
☆11Nov 28, 2019Updated 6 years ago
midas-research / speechmix
View on GitHub
☆12Oct 2, 2020Updated 5 years ago
JongSuk1 / EquiAV
View on GitHub
☆36Jan 20, 2025Updated last year
bwrc / embla-r
View on GitHub
An R-package for reading physiologic signal data stored in the Embla Data Format (EBM).
☆10May 2, 2016Updated 10 years ago
APILASTRI / DCASE_Task2_UMINHO
View on GitHub
☆25Nov 21, 2022Updated 3 years ago
shinmura0 / DCASE2020_Task2_Solution-Anomaly_detection-
View on GitHub
DCASE2020 Task2 Self-supervised learning solution
☆34Feb 10, 2021Updated 5 years ago
syedajannatulferdous121 / transformer
View on GitHub
The MATLAB code implements a Transformer model, a recent innovation in deep neural networks. It includes modules for multi-head attention…
☆11Jul 5, 2023Updated 3 years ago
JinhuaLiang / LaD-ProtoNet
View on GitHub
☆16Sep 14, 2023Updated 2 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
parth2170 / DCASE2020-Task2
View on GitHub
☆14Jun 18, 2020Updated 6 years ago
gefleury / datascientest_anomalous_sounds
View on GitHub
Anomalous sound detection with machine learning and deep learning
☆14Jun 24, 2024Updated 2 years ago
yushuai / FTANet-melodic
View on GitHub
This repository is the offical implementation for the paper 《Frequency-Temporal Attention Network for Singing Melody Extraction》.
☆40Sep 16, 2022Updated 3 years ago
looking-for-my-magic-bean / DCASE2020-TASK2-semi-VAE
View on GitHub
☆10Jun 20, 2020Updated 6 years ago
haoheliu / diffres-python
View on GitHub
Learning differentiable temporal resolution on time-series data.
☆36Nov 12, 2022Updated 3 years ago
umbertocappellazzo / PETL_AST
View on GitHub
This is the official repository of the papers "Parameter-Efficient Transfer Learning of Audio Spectrogram Transformers" [IEEE MLSP 2024] …
☆41Jul 31, 2024Updated last year
fschmid56 / EfficientAT
View on GitHub
This repository aims at providing efficient CNNs for Audio Tagging. We provide AudioSet pre-trained models ready for downstream training …
☆353Nov 20, 2024Updated last year
Liu-Tianchi / Golden-Gemini-for-Speaker-Verification
View on GitHub
Official release of pretrained models and codes for 'Golden Gemini Is All You Need: Finding the Sweet Spots for Speaker Verification'
☆15Jan 20, 2025Updated last year
SarthakYadav / audio-mamba-official
View on GitHub
Official implementation for our paper "Audio Mamba: Selective State Spaces for Self-Supervised Audio Representations"
☆44Aug 14, 2025Updated 11 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
NeuroLexDiagnostics / Voice-Analysis-Pipeline
View on GitHub
Voice Analysis Pipeline for DigiPsych Lab
☆10Sep 15, 2019Updated 6 years ago
nttcslab / eval-audio-repr
View on GitHub
EVAR ~ Evaluation package for Audio Representations
☆81Feb 19, 2026Updated 5 months ago
WangHelin1997 / MaskSpec
View on GitHub
The Pytorch implementation of paper: Masked Spectrogram Prediction For Self-Supervised Audio Pre-Training
☆51Dec 17, 2024Updated last year
Kebii / Freehand-Genshin-Diffusion
View on GitHub
Transferring Genshin PVs into a freehand style with Diffusion Model.
☆10Jun 5, 2024Updated 2 years ago
liuyoude / AE-ASD
View on GitHub
Autoencoder(AE) based methods for anomalous sound detection(ASD)
☆13Jan 10, 2023Updated 3 years ago
ChihchengHsieh / multimodal-abnormalities-detection
View on GitHub
☆13Mar 10, 2023Updated 3 years ago
Jeremiah-210511 / GANormaly-Image-Anormaly-Detection
View on GitHub
基于GAN的自监督学习图像异常检测
☆18Dec 13, 2021Updated 4 years ago
chychen / tf2-ganomaly
View on GitHub
Tensorflow2 implementation of the paper GANomaly: Semi-Supervised Anomaly Detection via Adversarial Training
☆21Jan 18, 2021Updated 5 years ago
Torabiy / HLS-CMDS
View on GitHub
Heart and Lung Sounds Dataset Recorded from a Clinical Manikin using Digital Stethoscope (HLS-CMDS)
☆19May 13, 2026Updated 2 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
AHruler / Exploring-AAD
View on GitHub
Exploring possible methods for Audio Anomaly Detection - on machine sounds (MIMII dataset)
☆19Sep 12, 2025Updated 10 months ago
aeesha-T / parkinsons_prediction_using_speech
View on GitHub
☆18Nov 15, 2021Updated 4 years ago
Haochen-Wang409 / DropPos
View on GitHub
[NeurIPS'23] DropPos: Pre-Training Vision Transformers by Reconstructing Dropped Positions
☆61Apr 30, 2024Updated 2 years ago
Kvothe045 / Audio-Enhancer
View on GitHub
☆13Aug 3, 2025Updated 11 months ago
Chengyuann / Awesome-Anomalous-Sound-Detection-Methods
View on GitHub
paper for Anomalous sound detection
☆45Feb 27, 2026Updated 4 months ago
wilkinghoff / DCASE2023_task2
View on GitHub
Submission for task 2 "First-Shot Unsupervised Anomalous Sound Detection for Machine Condition Monitoring" of the DCASE challenge 2023 (h…
☆18May 22, 2023Updated 3 years ago
KrishnaDN / Attentive-Statistics-Pooling-for-Deep-Speaker-Embedding
View on GitHub
Implementation of the paper "Attentive Statistics Pooling for Deep Speaker Embedding" in Pytorch
☆49Jun 4, 2020Updated 6 years ago