spkgyk/TDFNet

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/spkgyk/TDFNet)

spkgyk / TDFNet

Official code release for "TDFNet: An Efficient Audio-Visual Speech Separation Model with Top-down Fusion", accepted ICIST 2023

☆14

Alternatives and similar repositories for TDFNet

Users that are interested in TDFNet are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

zexupan / avse_hybrid_loss
View on GitHub
☆16Jun 15, 2022Updated 4 years ago
JusperLee / CTCNet
View on GitHub
An Audio-Visual Speech Separation Model Inspired by Cortico-Thalamo-Cortical Circuits
☆82Apr 28, 2024Updated 2 years ago
ahmadikalkhorani / AVCrossNet
View on GitHub
☆16Jul 4, 2024Updated 2 years ago
Audio-WestlakeU / pytorch_lightning_template_for_beginners
View on GitHub
A pytorch template for beginners based on pytorch_lightning
☆50Feb 1, 2024Updated 2 years ago
arxrean / LipRead-seq2seq
View on GitHub
An unofficial (PyTorch) implementation for the paper Deep Lip Reading: A comparison of models and an online application.
☆10May 13, 2020Updated 6 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
xcmyz / ConvTasNet4BasisMelGAN
View on GitHub
This repo contains conv-tasnet for basis-melgan. If you want to get code of basis-melgan, please refer to FastVocoder.
☆21Jul 21, 2021Updated 5 years ago
TimesXY / TDF_Net
View on GitHub
☆15Feb 28, 2025Updated last year
UARK-AICV / UARK-AICV.github.io
View on GitHub
[Lab] lab website
☆12May 29, 2026Updated last month
santi-pdp / ahoproc_tools
View on GitHub
Tools for Ahocoder data processing and evaluation metrics
☆15Apr 22, 2024Updated 2 years ago
hmartelb / avlit
View on GitHub
Official source code of the INTERSPEECH 2023 paper: "Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Mo…
☆20Sep 1, 2023Updated 2 years ago
JishengBai / ICME2024ASC
View on GitHub
baseline for IEEE ICME 2024 GC: Semi-supervised Acoustic Scene Classification under Domain Shift
☆18Mar 16, 2024Updated 2 years ago
moonburntcat / DMYOLO
View on GitHub
The code of《Enhanced YOLOv10 for Real-time Fish Disease Detection in Aquaculture Farms》
☆16Mar 31, 2026Updated 3 months ago
WenlongJiao / SymUNet
View on GitHub
☆16Dec 23, 2025Updated 6 months ago
Minhchuyentoancbn / SMoPE
View on GitHub
Official implementation of "One-Prompt Strikes Back: Sparse Mixture of Experts for Prompt-based Continual Learning " (ICLR 2026)
☆17Mar 29, 2026Updated 3 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
mdswyz / DiCMoR
View on GitHub
An official implementation of "Distribution-Consistent Modal Recovering for Incomplete Multimodal Learning" in PyTorch. (ICCV 2023)
☆37Sep 28, 2023Updated 2 years ago
ku-vai / TPoS
View on GitHub
This repository is for The Power of Sound(TPoS): Audio Reactive Video Generation with Stable Diffusion (ICCV2023)
☆25Dec 7, 2023Updated 2 years ago
CFSRgroup / Paozival
View on GitHub
☆13Jan 25, 2024Updated 2 years ago
Adventureeee / multi-modal-sentiment
View on GitHub
multi-modal sentiment
☆16Nov 19, 2024Updated last year
wangbxj1234 / AdaCT
View on GitHub
AdaCE implementation
☆12May 12, 2026Updated 2 months ago
avsthiago / deepbee-source
View on GitHub
DeepBee is a project that aims to assist in the assessment of honey bee colonies using image processing and machine learning.
☆25Nov 4, 2024Updated last year
sqbqamar / SAMA-UNet
View on GitHub
Official Implementation of "SAMA-UNet: Enhancing Medical Image Segmentation with Self-Adaptive Mamba-Like Attention and Causal-Resonance …
☆16Oct 9, 2025Updated 9 months ago
ZhuoYulang / CIF-MMIN
View on GitHub
☆41Apr 16, 2024Updated 2 years ago
YuxiaoLuo0013 / TFDNet
View on GitHub
TFDNet: Time-Frequency Enhanced Decomposed Network for Long-term Time Series Forecasting
☆26May 11, 2024Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
isjwdu / DFADD
View on GitHub
Official Implementation and Dataset of paper - DFADD: The Diffusion and Flow-matching based Audio Deepfake Dataset
☆16Apr 7, 2025Updated last year
wangwei2009 / DistantSpeech
View on GitHub
DistantSpeech
☆22Oct 9, 2023Updated 2 years ago
XiangzhuKong / CA-Dense-UNet
View on GitHub
An unofficial code reproduction of Channel Attention Dense U-Net for Multichannel Speech Enhancement
☆13Jul 17, 2023Updated 3 years ago
WinterPan2017 / COVID19-Detection-System
View on GitHub
新冠肺炎辅助检测系统
☆15Jun 16, 2021Updated 5 years ago
bobwangPKU / EEG-Stimulus-Match-Mismatch
View on GitHub
Code to implement the model of No.2 in Task 1 of the Auditory EEG Challenge (ICASSP 2024)
☆12Jan 29, 2024Updated 2 years ago
Elune001 / MVP-FAS
View on GitHub
[ICCV 2025] Multi-View Slot Attention using Paraphrased Texts for Face Anti-Spoofing
☆15Nov 11, 2025Updated 8 months ago
huidudaozou / Time-Series
View on GitHub
《应用时间序列分析》易丹辉、王燕著；案例Python实现
☆16Nov 13, 2019Updated 6 years ago
XiaoyuBIE1994 / SDCodec
View on GitHub
(ICASSP 2025) Learning Source Disentanglement in Neural Audio Codec
☆48May 16, 2025Updated last year
Retinal-Research / STA-UNet
View on GitHub
Code for paper: STA-Unet: Rethink the semantic redundant for Medical Imaging Segmentation
☆29Apr 30, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
cwhy / rwkv-decon
View on GitHub
Trying to deconstruct RWKV in understandable terms
☆14May 6, 2023Updated 3 years ago
artdillon / AI_CDM_for_ICH
View on GitHub
2023年中国研究生数学建模竞赛E题
☆14Sep 22, 2023Updated 2 years ago
Zishan-Shao / FlashSVD
View on GitHub
[AAAI 2026] Official implementation of "FlashSVD: Memory-Efficient Inference with Streaming for Low-Rank Models". If you find this reposi…
☆17May 1, 2026Updated 2 months ago
Beilong-Tang / lauraTSE_code
View on GitHub
Official Implementation of LauraTSE: Target Speaker Extraction using Auto-Regressive Decoder-Only Language Models.
☆37Nov 9, 2025Updated 8 months ago
smulelabs / windowed-roformer
View on GitHub
Official Repository for "Efficient Vocal Source Separation Through Windowed RoFormer"
☆45Oct 30, 2025Updated 8 months ago
JinjiangLiu / ICCRN
View on GitHub
☆18Mar 10, 2023Updated 3 years ago
Rorogogogo / claude-cracks-the-whip
View on GitHub
Claude Code is the boss. Other AI agents are the workforce. Whip goes crack. A skill for dispatching tasks to AI coding agents (Codex, Ge…
☆16Mar 22, 2026Updated 4 months ago