shanwangshan/TAU-urban-audio-visual-scenes

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/shanwangshan/TAU-urban-audio-visual-scenes)

shanwangshan / TAU-urban-audio-visual-scenes

☆12

Alternatives and similar repositories for TAU-urban-audio-visual-scenes

Users that are interested in TAU-urban-audio-visual-scenes are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Blade6570 / Learningimage-to-imagetranslationusingpairedandunpairedtrainingsamples
View on GitHub
Learning image-to-image translation using paired and unpaired training samples
☆20May 25, 2021Updated 5 years ago
lukewys / dcase_2020_T6
View on GitHub
2nd place solution for 2020 DCASE challenge task 6 audio captioning. http://dcase.community/challenge2020/task-automatic-audio-captioning…
☆24Aug 3, 2023Updated 2 years ago
ffevotte / slurm.el
View on GitHub
Emacs extension to interact with the SLURM jobs scheduler
☆47Aug 6, 2021Updated 4 years ago
stoneMo / MGN
View on GitHub
Official implementation for MGN
☆20Dec 22, 2022Updated 3 years ago
qiuchili / diasenti
View on GitHub
Conversational Multimodal Emotion Recognition
☆12Dec 7, 2020Updated 5 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
MNSfuxiang / MFN
View on GitHub
A multimodal fine-grained correlation fusion network with attention mechanisms for visual-textual sentiment analysis
☆10Jan 13, 2024Updated 2 years ago
msh9184 / contrastive-equilibrium-learning
View on GitHub
☆21Apr 6, 2021Updated 5 years ago
Janie1996 / MSRFG
View on GitHub
The code for Multi-Scale Receptive Field Graph Model for Emotion Recognition in Conversations
☆11Jan 17, 2023Updated 3 years ago
dragonzzl / MERC_rank3
View on GitHub
该仓库主要描述了CCAC2023多模态对话情绪识别评测第3名的实现过程
☆12Aug 11, 2024Updated last year
wsntxxn / AudioCaption
View on GitHub
Audio captioning recipe
☆53Oct 23, 2025Updated 9 months ago
facebookresearch / AVID-CMA
View on GitHub
Audio Visual Instance Discrimination with Cross-Modal Agreement
☆133Aug 13, 2021Updated 4 years ago
xiaomi1024 / code_SAMS
View on GitHub
☆13Jan 11, 2024Updated 2 years ago
audio-captioning / audio-captioning-resources
View on GitHub
A list of resources that can help in research for automated audio captioning
☆34Feb 17, 2021Updated 5 years ago
nii-yamagishilab / SpeechSPC-mini
View on GitHub
Speech Security and Privacy Compendium - Mini
☆10Jun 18, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
TJArk-Robotics / coderelease_2017
View on GitHub
TJArk CodeRelease 2017
☆11Feb 13, 2018Updated 8 years ago
XinhaoMei / DCASE2021_task6_v2
View on GitHub
Code for CVSSP submission to DCASE 2021 Task 6
☆36Nov 22, 2022Updated 3 years ago
krisbalintona / work-timer
View on GitHub
☆11Dec 4, 2025Updated 7 months ago
Shahabks / Machine-Learning-Algorithm-for-Voice-Analysis
View on GitHub
It is an algorithm analysed the acoustic features of a voice and creates an acoustic classifier - USEFUL for auto-speech-rater
☆11Mar 8, 2019Updated 7 years ago
Richarn290 / SonarImageDatasets
View on GitHub
☆19Dec 8, 2024Updated last year
fuyahuii / ConSK-GCN
View on GitHub
The PyTorch code for paper: "CONSK-GCN: Conversational Semantic- and Knowledge-Oriented Graph Convolutional Network for Multimodal Emotio…
☆13Oct 21, 2022Updated 3 years ago
dcaulley / av_diarization
View on GitHub
AudioVisual Diarization - Supervised and Unsupervised
☆15Nov 22, 2022Updated 3 years ago
dglai / WSDM2022-Challenge
View on GitHub
WSDM2022 Challenge - Large scale temporal graph link prediction
☆38Jan 25, 2022Updated 4 years ago
FrankFundel / SGCond
View on GitHub
☆10Jun 28, 2023Updated 3 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
LongLong-Jing / 3DRotNet
View on GitHub
Code for Self-supervised Spatiotemporal Feature Learning by Video Geometric Transformations
☆16Sep 11, 2019Updated 6 years ago
Alena-Xinran / MaTAV
View on GitHub
☆15Oct 10, 2024Updated last year
ml-master / MMFN_yyt
View on GitHub
the implement for "Multi-modal Fake News Detection on Social Media via Multi-grained Information Fusion"
☆21Jun 18, 2024Updated 2 years ago
PoloWlg / Joint-Multimodal-Transformer-6th-ABAW
View on GitHub
☆22Apr 22, 2024Updated 2 years ago
zrr1999 / emotion-recognition
View on GitHub
多模态情绪识别方法研究（Multimodal Emotion Recognition）
☆28Mar 24, 2026Updated 4 months ago
xiaoyangdu22 / QiandaoEar22
View on GitHub
☆20Mar 21, 2024Updated 2 years ago
marl / l3embedding
View on GitHub
Learn and L3 embedding from audio/video pairs
☆89Apr 24, 2022Updated 4 years ago
corticph / MSTmodel
View on GitHub
Code for https://arxiv.org/abs/1712.00254
☆18Dec 6, 2017Updated 8 years ago
WeeeicheN / MInD
View on GitHub
Code for MInD: Multimodal Information Disentanglement
☆19Jun 3, 2026Updated last month
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
daniellutscher / MixMatch-TransferLearning
View on GitHub
Combines the SSL Method MixMatch with a pre-trained model (EfficientNet) on a chest x-ray dataset.
☆11Jun 22, 2019Updated 7 years ago
Qengineering / YoloV6-ncnn-Raspberry-Pi-4
View on GitHub
YoloV6 for a bare Raspberry Pi using ncnn.
☆11Jun 12, 2024Updated 2 years ago
chimechallenge / C8DASR-Baseline-NeMo
View on GitHub
NeMo: a toolkit for conversational AI
☆13May 4, 2024Updated 2 years ago
hyperparameters / tracking_via_colorization
View on GitHub
☆18Aug 16, 2020Updated 5 years ago
wavlab-speech / shinjiwlab.github.io
View on GitHub
☆18Jul 20, 2026Updated last week
SRPOL-AUI / spectrum-correction
View on GitHub
Source code for publication: "Spectrum Correction: Acoustic Scene Classification with Mismatched Recording Devices"
☆13Feb 22, 2022Updated 4 years ago
ethanhe42 / dds
View on GitHub
DDS: Delta Denoising Score PyTorch implementation
☆19Sep 2, 2023Updated 2 years ago