shanwangshan / TAU-urban-audio-visual-scenesView external linksLinks
☆12Oct 23, 2021Updated 4 years ago
Alternatives and similar repositories for TAU-urban-audio-visual-scenes
Users that are interested in TAU-urban-audio-visual-scenes are comparing it to the libraries listed below
Sorting:
- 2nd place solution for 2020 DCASE challenge task 6 audio captioning. http://dcase.community/challenge2020/task-automatic-audio-captioning…☆24Aug 3, 2023Updated 2 years ago
- Learning image-to-image translation using paired and unpaired training samples☆20May 25, 2021Updated 4 years ago
- Official implementation for MGN☆20Dec 22, 2022Updated 3 years ago
- ☆21Apr 6, 2021Updated 4 years ago
- Code for CVSSP submission to DCASE 2021 Task 6☆36Nov 22, 2022Updated 3 years ago
- A list of resources that can help in research for automated audio captioning☆34Feb 17, 2021Updated 5 years ago
- A multimodal fine-grained correlation fusion network with attention mechanisms for visual-textual sentiment analysis☆10Jan 13, 2024Updated 2 years ago
- The active learning algorithm, mismatch-first farthest-traversal. Implementation and visualization.☆12Dec 25, 2021Updated 4 years ago
- Jewel: Resource-Efficient Joint Packet and Flow Level Inference in Programmable Switches☆12Mar 18, 2024Updated last year
- ☆39Oct 19, 2025Updated 3 months ago
- Official implementation for “HarmonyGuard: Toward Safety and Utility in Web Agents via Adaptive Policy Enhancement and Dual-Objective Opt…☆25Jan 10, 2026Updated last month
- ☆10Jun 18, 2024Updated last year
- Accompanying code for the paper Sub-Cluster AdaCos: Learning Representations for Anomalous Sound Detection.☆10Jun 7, 2022Updated 3 years ago
- WSDM2022 Challenge - Large scale temporal graph link prediction☆38Jan 25, 2022Updated 4 years ago
- Audio captioning baseline system for DCASE 2020 challenge.☆38Aug 22, 2023Updated 2 years ago
- 该仓库主要描述了CCAC2023多模态对话情绪识别评测第3名的实现过程☆11Aug 11, 2024Updated last year
- Conversational Multimodal Emotion Recognition☆11Dec 7, 2020Updated 5 years ago
- YoloV6 for a bare Raspberry Pi using ncnn.☆11Jun 12, 2024Updated last year
- MSMA: Multi-agent Trajectory Prediction in Connected and Autonomous Vehicle Environment with Multi-source Data Integration☆12Aug 4, 2024Updated last year
- Crawl traffic data from PEMS☆10Jul 19, 2021Updated 4 years ago
- ☆11Oct 7, 2023Updated 2 years ago
- Speech Security and Privacy Compendium - Mini☆10Jun 18, 2024Updated last year
- Language-Aligned Waypoint (LAW) Supervision for Vision-and-Language Navigation in Continuous Environments☆11Nov 29, 2021Updated 4 years ago
- P4Control: Line-Rate Cross-Host Attack Prevention via In-Network Information Flow Control Enabled by Programmable Switches and eBPF☆11May 20, 2024Updated last year
- A reviewed paper list about applying deep learning models for smarter transportation systems☆12Sep 15, 2020Updated 5 years ago
- It is an algorithm analysed the acoustic features of a voice and creates an acoustic classifier - USEFUL for auto-speech-rater☆11Mar 8, 2019Updated 6 years ago
- ☆10Jun 28, 2023Updated 2 years ago
- A collection of papers on LLM applications in the IoT field.☆18Jan 21, 2026Updated 3 weeks ago
- Combines the SSL Method MixMatch with a pre-trained model (EfficientNet) on a chest x-ray dataset.☆11Jun 22, 2019Updated 6 years ago
- Audio captioning recipe☆51Oct 23, 2025Updated 3 months ago
- [JRTIP 2023] Efficient Convolutional Neural Networks on Raspberry Pi for Image Classification☆10Aug 12, 2025Updated 6 months ago
- ☆31Sep 19, 2025Updated 4 months ago
- Training, optimization and deployment of Object Detection model with dinov2 backbone for efficient inference on NVIDIA Jetson☆13Jul 26, 2025Updated 6 months ago
- keras implementation of A Discriminative Feature Learning Approach for Deep Face Recognition based on MNIST☆10Mar 1, 2019Updated 6 years ago
- ☆13Apr 23, 2025Updated 9 months ago
- Emacs extension to interact with the SLURM jobs scheduler☆45Aug 6, 2021Updated 4 years ago
- COALA: Co-Aligned Autoencoders for Learning Semantically Enriched Audio Representations☆48Jul 25, 2024Updated last year
- The code for Multi-Scale Receptive Field Graph Model for Emotion Recognition in Conversations☆11Jan 17, 2023Updated 3 years ago
- "Towards Improving Document Understanding: An Exploration on Text-Grounding via MLLMs" 2023☆16Nov 28, 2024Updated last year