FishMaster93/AFFIA3K

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/FishMaster93/AFFIA3K)

FishMaster93 / AFFIA3K

☆10

Alternatives and similar repositories for AFFIA3K

Users that are interested in AFFIA3K are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

etzinis / heterogeneous_separation
View on GitHub
Code and data recipes for the paper: Heterogeneous Target Speech Separation
☆44Dec 6, 2022Updated 3 years ago
microsoft / WavText5K
View on GitHub
Web-crawl for "Audio Retrieval with WavText5K and CLAP Training"
☆50Nov 10, 2022Updated 3 years ago
haoheliu / diffres-python
View on GitHub
Learning differentiable temporal resolution on time-series data.
☆36Nov 12, 2022Updated 3 years ago
hqsiswiliam / persona-adaptive-attention
View on GitHub
☆26Oct 13, 2023Updated 2 years ago
kyuyeonpooh / objects-that-sound
View on GitHub
The unofficial implementation of paper, "Objects that Sound", from ECCV 2018.
☆31Jan 29, 2024Updated 2 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
swagshaw / ASC-CL
View on GitHub
Official Pytorch Implementation for Continual Learning For On-Device Environmental Sound Classification
☆14Jul 19, 2022Updated 4 years ago
FishMaster93 / U-FFIA
View on GitHub
The audio-visual fusion method for FFIA
☆34Aug 5, 2024Updated last year
liuxubo717 / LASS-demopage
View on GitHub
☆19Sep 2, 2022Updated 3 years ago
liuxubo717 / sound_generation
View on GitHub
Code and generated sounds for "Conditional Sound Generation Using Neural Discrete Time-Frequency Representation Learning", MLSP 2021
☆69Sep 3, 2021Updated 4 years ago
liuxubo717 / SimPFs
View on GitHub
Code for "Simple Pooling Front-ends for Efficient Audio Calssification", ICASSP 2023
☆57Mar 3, 2023Updated 3 years ago
haoheliu / torchsubband
View on GitHub
Pytorch implementation of subband decomposition
☆93Jul 26, 2022Updated 3 years ago
Labbeti / aac-datasets
View on GitHub
Audio Captioning datasets for PyTorch.
☆129Mar 25, 2026Updated 3 months ago
liuxubo717 / V-ACT
View on GitHub
Visually-Aware Audio Captioning
☆43Mar 3, 2023Updated 3 years ago
XinhaoMei / audio-text_retrieval
View on GitHub
Implementation of our paper 'On Metric Learning For Audio-Text Cross-Modal Retrieval'
☆51May 17, 2022Updated 4 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
akoepke / audio-retrieval-benchmark
View on GitHub
Code for "Audio Retrieval with Natural Language Queries: A Benchmark Study", Transactions on Multimedia 2022
☆54Jul 16, 2025Updated last year
liuxubo717 / cl4ac
View on GitHub
Code for "CL4AC: A Contrastive Loss for Audio Captioning", DCASE Workshop 2021.
☆45Oct 8, 2021Updated 4 years ago
liuxubo717 / LASS
View on GitHub
This repo hosts the code and model of "Separate What You Describe: Language-Queried Audio Source Separation", Interspeech 2022
☆146Oct 11, 2023Updated 2 years ago
keetsky / Net_ghostVLAD-pytorch
View on GitHub
☆21Jul 11, 2019Updated 7 years ago
yinkalario / General-Purpose-Sound-Recognition-Demo
View on GitHub
General purpose sound recognition demo
☆161Oct 3, 2023Updated 2 years ago
cfoster0 / CLAP
View on GitHub
Contrastive Language-Audio Pretraining
☆88Mar 6, 2022Updated 4 years ago
wsntxxn / TextToAudioGrounding
View on GitHub
The dataset and baseline code for Text-to-Audio Grounding (TAG)
☆49Oct 23, 2025Updated 9 months ago
google-research / diffstride
View on GitHub
TF/Keras code for DiffStride, a pooling layer with learnable strides.
☆124Feb 7, 2022Updated 4 years ago
hche11 / VGGSound
View on GitHub
VGGSound: A Large-scale Audio-Visual Dataset
☆359Sep 13, 2021Updated 4 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
haoheliu / audioldm_eval
View on GitHub
This toolbox aims to unify audio generation model evaluation for easier comparison.
☆390Sep 29, 2024Updated last year
pseeth / torch-stft
View on GitHub
An STFT/iSTFT for PyTorch.
☆372Oct 31, 2023Updated 2 years ago
stoneMo / CIGN
View on GitHub
Official implementation for CIGN
☆17Sep 11, 2023Updated 2 years ago
myclark / TDC-GP22
View on GitHub
An Arduino library for interfacing with an ACAM TDC-GP22 over SPI (For Arduino Due)
☆17Jun 15, 2020Updated 6 years ago
sealzjh / face_recognize
View on GitHub
☆15Oct 3, 2023Updated 2 years ago
zailongchen / Audio-Visual-Question-Answering-AVQA
View on GitHub
This task is based on MUSIC-AVQA Dataset. And we focus on optimize the accuracy of AVQA task, which aims to answer questions regarding di…
☆13Feb 11, 2023Updated 3 years ago
DZPeru / fish-datasets
View on GitHub
Datasets of fish for deep learning.
☆20Aug 15, 2024Updated last year
qiuqiangkong / torchlibrosa
View on GitHub
☆512Jun 25, 2024Updated 2 years ago
articalman / Management-System-for-students-grade
View on GitHub
学生成绩管理系统，大二写的数据结构课程设计，用单向链表实现。编码格式是GB2312，目前实现功能：初始化、插入、修改、删除、排序、显示补考名单、显示优秀学生、输出、析构。后续也许会增加文件功能，补充流程图
☆15Dec 5, 2020Updated 5 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
ws-choi / AMSS-Net
View on GitHub
A PyTorch implementation of the paper: "AMSS-Net: Audio Manipulation on User-Specified Sources with Textual Queries" (ACM Multimedia 2021…
☆21Jul 4, 2021Updated 5 years ago
zailongchen / R2Gen-EVA
View on GitHub
Optimizing Efficiency and Visual-Textual Alignment for LLM-Based Radiology Report Generation
☆19Mar 5, 2025Updated last year
zailongchen / R2GenAlign
View on GitHub
Analyzing and Enhancing Visual Learning in LLM-based Radiology Report Generation
☆17Feb 23, 2026Updated 5 months ago
swagshaw / WildDESED
View on GitHub
WildDESED: A LLM-Powered Dataset for Wild Domestic Environment Sound Event Detection
☆18Nov 19, 2024Updated last year
JiangXiangBo / yolo3_fish_detection
View on GitHub
基于yolo3的鱼和人脸的目标检测
☆17Sep 10, 2019Updated 6 years ago
facebookresearch / AudioMAE
View on GitHub
This repo hosts the code and models of "Masked Autoencoders that Listen".
☆673Apr 5, 2024Updated 2 years ago
ajwdewit / pyCGMS
View on GitHub
Python implementation of Crop Growth Monitoring System as implemented by the EU MARS project.
☆12Feb 23, 2020Updated 6 years ago