gyx-gloria/DMT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/gyx-gloria/DMT)

gyx-gloria / DMT

Official Implementation of DMT: Dual Mean-Teacher in PyTorch.

☆10

Alternatives and similar repositories for DMT

Users that are interested in DMT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

dr-costas / SEDLM
View on GitHub
Language modelling for sound event detection
☆20Jan 2, 2020Updated 6 years ago
Graph-COM / Neural_Higher-order_Pattern_Prediction
View on GitHub
☆15Mar 4, 2022Updated 4 years ago
SAGNIKMJR / ego-AV-spatial-correspondence
View on GitHub
[CVPR 2024] Code and datasets for 'Learning Spatial Features from Audio-Visual Correspondence in Egocentric Videos'
☆14Jun 16, 2024Updated 2 years ago
xiaoneil / LPNet
View on GitHub
☆13Nov 28, 2021Updated 4 years ago
facebookresearch / ego-env
View on GitHub
Human-centric environment representations from egocentric video
☆15Feb 5, 2026Updated 5 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
bingo-todd / WaveLoc
View on GitHub
End-to-End binaural sound localization
☆17Feb 27, 2020Updated 6 years ago
yandex-research / adaptive-diffusion
View on GitHub
[CVPR'2024] Adaptive Teacher-Student Collaboration for Text-Conditional Diffusion Models
☆33Apr 3, 2024Updated 2 years ago
Samyu0304 / thought-propagation
View on GitHub
Code and dataset for the ICLR 2024 paper "Thought Propagation: An analogical Approach to Complex Reasoning with Large Language Models."
☆17Mar 4, 2024Updated 2 years ago
DCASE2024-Task7-Sound-Scene-Synthesis / AudioLDM-training-finetuning
View on GitHub
AudioLDM training, finetuning, evaluation and inference.
☆14Mar 27, 2024Updated 2 years ago
mashijie1028 / TrustDD
View on GitHub
(Pattern Recognition 2025) Towards Trustworthy Dataset Distillation
☆14Dec 8, 2024Updated last year
zhshj0110 / SiT-MLP
View on GitHub
[TCSVT 2024] Implementation of the paper "SiT-MLP: A Simple MLP with Point-wise Topology Feature Learning for Skeleton-based Action Recog…
☆19Apr 10, 2024Updated 2 years ago
karchkha / MelSpec_GPT_VQVAE
View on GitHub
Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms
☆18Oct 8, 2023Updated 2 years ago
ShengKuangCN / BAST
View on GitHub
☆18May 28, 2025Updated last year
NVlabs / FRAG
View on GitHub
☆15Apr 25, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
BolinLai / CSTS
View on GitHub
[ECCV2024] The official implementation of "Listen to Look into the Future: Audio-Visual Egocentric Gaze Anticipation".
☆16Feb 24, 2025Updated last year
dzhuang / sing-box-converter
View on GitHub
subconverter for singbox
☆12Dec 15, 2025Updated 7 months ago
sadPororo / AD-YOLO
View on GitHub
AD-YOLO: You Look Only Once in Training Multiple Sound Event Localization and Detection, IEEE ICASSP 2023
☆35Dec 21, 2025Updated 7 months ago
Qiangest / DeepEar
View on GitHub
DeepEar: Sound Localization with Binaural Microphones
☆16Nov 20, 2025Updated 8 months ago
Vicinity111 / DCE-RD
View on GitHub
☆17Aug 8, 2023Updated 2 years ago
iLearn-Lab / MM23-RTQ
View on GitHub
ACM Multimedia 2023 (Oral) - RTQ: Rethinking Video-language Understanding Based on Image-text Model
☆15Apr 7, 2026Updated 3 months ago
Samyu0304 / LiSA
View on GitHub
Code for Mind the Label Shift of Augmentation-based Graph OOD generalization (LiSA) in CVPR 2023. LiSA is a model-agnostic Graph OOD fram…
☆16Jun 24, 2023Updated 3 years ago
d62lu / 3D-UMamba
View on GitHub
3D-UMamba: 3D U-Net with state space model for semantic segmentation of multi-source LiDAR point clouds
☆24Dec 12, 2024Updated last year
bingo-todd / GCC-PHAT_DNN_Loc
View on GitHub
DNN based binaural sound localization model, using GCC-PHAT as features
☆22Jun 13, 2023Updated 3 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
XinyuSun / mlc-chatbot
View on GitHub
python interface for mlc chat cli
☆14May 7, 2023Updated 3 years ago
rain305f / OSP
View on GitHub
[CVPR 2023] Out-of-Distributed Semantic Pruning for Robust Semi-Supervised Learning
☆22Jun 11, 2023Updated 3 years ago
jim-schwoebel / sound_event_detection
View on GitHub
🎵 A repository for manually annotating files to create labeled acoustic datasets for machine learning.
☆47Feb 20, 2022Updated 4 years ago
jiwonix / Sound-Event-Detection-papers
View on GitHub
Sound Event Detection (SED) paper collection
☆15Jun 26, 2024Updated 2 years ago
Minglu58 / TA2V
View on GitHub
☆16Dec 1, 2025Updated 7 months ago
HuangJuanLR / sd_plugin_tuto
View on GitHub
Substance 3D Designer Plugin Tutorial 2024
☆14Jan 11, 2025Updated last year
stogiannidis / srbench
View on GitHub
Source code for the Paper "Mind the Gap: Benchmarking Spatial Reasoning in Vision-Language Models"
☆19Feb 1, 2026Updated 5 months ago
xiaomabufei / SKDF
View on GitHub
☆14Feb 21, 2024Updated 2 years ago
ChunmingHe / Camouflageator
View on GitHub
☆30Dec 2, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
MichWozPol / LEGO_StableDiffusion
View on GitHub
The project aim was to fine-tune the stable diffusion model in order to generate images in the LEGO style based on the prompt.
☆16Jun 7, 2023Updated 3 years ago
yiskw713 / VideoCaptioning
View on GitHub
video captioning using 3DCNN and LSTM (pytorch)
☆11Sep 26, 2019Updated 6 years ago
muuda / MUSIC-algorithm-for-circular-microphone-array
View on GitHub
通过单层圆形麦克风阵列采集音频，实现MUSIC算法的声源定位。
☆23Mar 16, 2023Updated 3 years ago
lavendery / AudioComposer
View on GitHub
☆27Sep 10, 2025Updated 10 months ago
yusunnny / CST-former
View on GitHub
CST-former: Transformer with Channel-Spectro-Temporal Attention for Sound Event Localization and Detection (ICASSP 2024)
☆39May 20, 2025Updated last year
zcxu-eric / Ego4d_TalkNet_ASD
View on GitHub
☆21Feb 15, 2022Updated 4 years ago
muuda / MFF-EINV2
View on GitHub
MFF-EINV2: Multi-scale Feature Fusion across Spectral-Spatial-Temporal Domains for Sound Event Localization and Detection
☆23Jul 17, 2024Updated 2 years ago