FishMaster93/U-FFIA

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/FishMaster93/U-FFIA)

FishMaster93 / U-FFIA

The audio-visual fusion method for FFIA

☆34

Alternatives and similar repositories for U-FFIA

Users that are interested in U-FFIA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

FishMaster93 / AFFIA3K
View on GitHub
☆10Apr 12, 2023Updated 3 years ago
praveena2j / Cross-Attentional-AV-Fusion
View on GitHub
FG2021: Cross Attentional AV Fusion for Dimensional Emotion Recognition
☆34Nov 29, 2024Updated last year
etzinis / heterogeneous_separation
View on GitHub
Code and data recipes for the paper: Heterogeneous Target Speech Separation
☆44Dec 6, 2022Updated 3 years ago
xjpp2016 / MAVD
View on GitHub
Aligning First, Then Fusing: A Novel Weakly-Supervised Multimodal Violence Detection Method
☆22Oct 2, 2025Updated 9 months ago
microsoft / WavText5K
View on GitHub
Web-crawl for "Audio Retrieval with WavText5K and CLAP Training"
☆50Nov 10, 2022Updated 3 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
haoheliu / diffres-python
View on GitHub
Learning differentiable temporal resolution on time-series data.
☆36Nov 12, 2022Updated 3 years ago
jm12138 / iFLYTEK-MSC-Python-SDK
View on GitHub
一个讯飞智能语音平台 MSC 的第三方 Python SDK，支持语音唤醒、语音识别、语音合成、语音评测等功能。A third-party Python SDK for a iFLYTEK MSC. Using for ASR, TSS, KWS.
☆23Jan 27, 2024Updated 2 years ago
zakaria76al / USC
View on GitHub
The official implementation of the paper "A spatio-temporal deep learning approach for underwater acoustic signals classification". In th…
☆34Apr 6, 2023Updated 3 years ago
praveena2j / Joint-Cross-Attention-for-Audio-Visual-Fusion
View on GitHub
IEEE T-BIOM : "Audio-Visual Fusion for Emotion Recognition in the Valence-Arousal Space Using Joint Cross-Attention"
☆47Nov 29, 2024Updated last year
mathilde173 / MAFnet
View on GitHub
☆23Aug 11, 2020Updated 5 years ago
facebookresearch / MAViL
View on GitHub
The repo host the code and model of MAViL.
☆45Jul 24, 2023Updated 2 years ago
liuxubo717 / cl4ac
View on GitHub
Code for "CL4AC: A Contrastive Loss for Audio Captioning", DCASE Workshop 2021.
☆45Oct 8, 2021Updated 4 years ago
liuxubo717 / sound_generation
View on GitHub
Code and generated sounds for "Conditional Sound Generation Using Neural Discrete Time-Frequency Representation Learning", MLSP 2021
☆69Sep 3, 2021Updated 4 years ago
CXH-Research / GuidedHybSensUIR
View on GitHub
[TCSVT] Underwater Image Restoration Through a Prior Guided Hybrid Sense Approach and Extensive Benchmark Analysis
☆28Feb 24, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
yunyikristy / CM-ACC
View on GitHub
Cross-model active contrastive coding
☆22Mar 17, 2021Updated 5 years ago
myclark / TDC-GP22
View on GitHub
An Arduino library for interfacing with an ACAM TDC-GP22 over SPI (For Arduino Due)
☆17Jun 15, 2020Updated 6 years ago
CU-UQ / SGD
View on GitHub
Implementation of Stochastic Gradient Descent algorithms in Python (cite https://doi.org/10.1007/s00158-020-02599-z)
☆12May 19, 2021Updated 5 years ago
Akimoto-Cris / RD_PRUNE
View on GitHub
[ICCV 2023] Efficient Joint Optimization of Layer-Adaptive Weight Pruning in Deep Neural Networks
☆25Oct 31, 2023Updated 2 years ago
CPJKU / EfficientLEAF
View on GitHub
Official implementation of EfficientLEAF, a learnable audio frontend.
☆50Dec 9, 2022Updated 3 years ago
ws-choi / AMSS-Net
View on GitHub
A PyTorch implementation of the paper: "AMSS-Net: Audio Manipulation on User-Specified Sources with Textual Queries" (ACM Multimedia 2021…
☆21Jul 4, 2021Updated 5 years ago
ychtanaka / marine-snow
View on GitHub
Marine Snow Removal Benchmarking Dataset
☆17Aug 29, 2023Updated 2 years ago
changjinhan / ADD-arxiv-daily
View on GitHub
🕵️‍♂️🔊 Automatically update Audio Deepfake Detection (ADD) papers daily using GitHub Actions (updates every 12 hours)
☆17Feb 13, 2026Updated 5 months ago
dicecco1 / fpga_cpfp
View on GitHub
HLS Custom-Precision Floating-Point Library
☆13Nov 6, 2017Updated 8 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
MingTian99 / RSDformer
View on GitHub
Learning An Effective Transformer for Remote Sensing Satellite Image Dehazing
☆12Sep 25, 2023Updated 2 years ago
swagshaw / WildDESED
View on GitHub
WildDESED: A LLM-Powered Dataset for Wild Domestic Environment Sound Event Detection
☆18Nov 19, 2024Updated last year
mlzxy / VCNN
View on GitHub
☆10Sep 3, 2016Updated 9 years ago
bit-ml / DeCLIP
View on GitHub
☆20Jul 3, 2025Updated last year
liuxubo717 / LASS
View on GitHub
This repo hosts the code and model of "Separate What You Describe: Language-Queried Audio Source Separation", Interspeech 2022
☆146Oct 11, 2023Updated 2 years ago
yinkalario / General-Purpose-Sound-Recognition-Demo
View on GitHub
General purpose sound recognition demo
☆161Oct 3, 2023Updated 2 years ago
House-yuyu / UniUIR
View on GitHub
[TIP 2025] UniUIR: Considering Underwater Image Restoration as an All-in-One Learner
☆17Jun 2, 2026Updated last month
bo-yang / stip_fisher
View on GitHub
Action recognition with STIP features and my own Fisher vector implementation
☆14Mar 29, 2017Updated 9 years ago
liuxubo717 / SimPFs
View on GitHub
Code for "Simple Pooling Front-ends for Efficient Audio Calssification", ICASSP 2023
☆57Mar 3, 2023Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
Ylexx / HBPASM
View on GitHub
A pytorch implementation of Fine-Grained Classification via Hierarchical Bilinear Pooling with Aggregated Slack Mask (HBPASM).
☆14Sep 24, 2019Updated 6 years ago
akshaypunwatkar / Sound_classification_urbansound8k
View on GitHub
Classification of Urban sounds using several classification methods, namely SVM, MLP and CNN using MFCC features.
☆13Apr 15, 2020Updated 6 years ago
Jorwnpay / NK-Sonar-Image-Dataset
View on GitHub
A newly created forward looking sonar image recognition benchmark, named NanKai Sonar Image Dataset (NKSID). This dataset contains 2617 i…
☆70Apr 11, 2024Updated 2 years ago
ictnlp / LSG
View on GitHub
The code for AAAI 2025 “Large Language Models Are Read/Write Policy-Makers for Simultaneous Generation”
☆15Jan 3, 2025Updated last year
ajwdewit / pyCGMS
View on GitHub
Python implementation of Crop Growth Monitoring System as implemented by the EU MARS project.
☆12Feb 23, 2020Updated 6 years ago
Jinbo-Hu / L3DAS22-TASK2
View on GitHub
A Track-Wise Ensemble Event Independent Network for 3D Polyphonic Sound Event Localization and Detection
☆23Nov 14, 2024Updated last year
GenjiB / ECLIPSE
View on GitHub
☆33Mar 10, 2023Updated 3 years ago