The audio-visual fusion method for FFIA
☆26Aug 5, 2024Updated last year
Alternatives and similar repositories for U-FFIA
Users that are interested in U-FFIA are comparing it to the libraries listed below
Sorting:
- ☆10Apr 12, 2023Updated 2 years ago
- 一个讯飞智能语音平台 MSC 的第三方 Python SDK,支持语音唤醒、语音识别、语音合成、语音评测等功能。A third-party Python SDK for a iFLYTEK MSC. Using for ASR, TSS, KWS.☆23Jan 27, 2024Updated 2 years ago
- Aligning First, Then Fusing: A Novel Weakly-Supervised Multimodal Violence Detection Method☆22Oct 2, 2025Updated 4 months ago
- Cross-model active contrastive coding☆22Mar 17, 2021Updated 4 years ago
- ☆10Jul 29, 2022Updated 3 years ago
- Code and generated sounds for "Conditional Sound Generation Using Neural Discrete Time-Frequency Representation Learning", MLSP 2021☆69Sep 3, 2021Updated 4 years ago
- SDN controllers synchronization approach based on Reinforcement Learning aimed at reducing the average path cost (APC)☆12Oct 23, 2021Updated 4 years ago
- Official implementation of EfficientLEAF, a learnable audio frontend.☆48Dec 9, 2022Updated 3 years ago
- Code and data recipes for the paper: Heterogeneous Target Speech Separation☆43Dec 6, 2022Updated 3 years ago
- Learning differentiable temporal resolution on time-series data.☆36Nov 12, 2022Updated 3 years ago
- Official repository for the WACV 2024 paper "Multi-view Classification with Hybrid Fusion and Mutual Distillation"☆15Jan 16, 2024Updated 2 years ago
- Documentation and code for predictive maintenance data and assess scripts.☆11Jun 8, 2023Updated 2 years ago
- 智能控制结课作业实验代码实现部分,包括模糊控制器和PID控制器实现以及控制器参数优化整定,PID参数采用Nelder-Mead优化,模糊控制器参数采用遗传算法优化。☆10Dec 2, 2024Updated last year
- Anki add-on that adds Pinyin and Zhuyin readings above Chinese characters in any field.☆12Sep 23, 2025Updated 5 months ago
- ☆11Jan 13, 2023Updated 3 years ago
- Code for "CL4AC: A Contrastive Loss for Audio Captioning", DCASE Workshop 2021.☆45Oct 8, 2021Updated 4 years ago
- IEEE T-BIOM : "Audio-Visual Fusion for Emotion Recognition in the Valence-Arousal Space Using Joint Cross-Attention"☆45Nov 29, 2024Updated last year
- ☆12Nov 25, 2023Updated 2 years ago
- Companion code for Awe the Audience: How the Narrative Trajectories Affect Audience Perception in Public Speaking☆14Jan 6, 2018Updated 8 years ago
- ☆14Sep 20, 2023Updated 2 years ago
- This repository contains the speaker labeled information of VoxCeleb2 and LRS3 audio-visual datasets. (AAAI 2025)☆13Sep 6, 2024Updated last year
- ☆11Sep 1, 2024Updated last year
- 🕵️♂️🔊 Automatically update Audio Deepfake Detection (ADD) papers daily using GitHub Actions (updates every 12 hours)☆17Feb 13, 2026Updated 2 weeks ago
- Implementation of Stochastic Gradient Descent algorithms in Python (cite https://doi.org/10.1007/s00158-020-02599-z)☆11May 19, 2021Updated 4 years ago
- course project for ECE 9603B Data Analytics Foundations☆10Aug 30, 2019Updated 6 years ago
- The project creates the models and service API for predicting scanned document images' angles ranging between -90° to 90° from the vertic…☆10Oct 3, 2022Updated 3 years ago
- Python implementation of Crop Growth Monitoring System as implemented by the EU MARS project.☆12Feb 23, 2020Updated 6 years ago
- Official source code for the paper "Tailored Design of Audio-Visual Speech Recognition Models using Branchformers"☆14Feb 24, 2025Updated last year
- The repo host the code and model of MAViL.☆45Jul 24, 2023Updated 2 years ago
- Implementation of "Audio Retrieval with Natural Language Queries: A Benchmark Study".☆54Jul 16, 2025Updated 7 months ago
- This is the repository for our EMNLP 2022 paper "The Importance of Being Parameters: An Intra-Distillation Method for Serious Gains".☆10Jun 2, 2023Updated 2 years ago
- python opencv 文档照片与证件照片的仿射变换的矫正☆11Nov 3, 2020Updated 5 years ago
- A pytorch implementation of Fine-Grained Classification via Hierarchical Bilinear Pooling with Aggregated Slack Mask (HBPASM).☆14Sep 24, 2019Updated 6 years ago
- ☆12Jul 21, 2025Updated 7 months ago
- a ros node using face_net do face_recognition☆11Jul 27, 2016Updated 9 years ago
- A Python-Based Information Theoretic Multi-Label Feature Selection☆11Oct 2, 2021Updated 4 years ago
- ☆14Jul 27, 2022Updated 3 years ago
- An unofficial (PyTorch) implementation for the paper Deep Lip Reading: A comparison of models and an online application.☆10May 13, 2020Updated 5 years ago
- Documentation of the Two!Ears Auditory Model☆13Feb 14, 2019Updated 7 years ago