zeyun-zhong/AFFT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/zeyun-zhong/AFFT)

zeyun-zhong / AFFT

Code for the paper: Anticipative Feature Fusion Transformer for Multi-Modal Action Anticipation.

☆32

Alternatives and similar repositories for AFFT

Users that are interested in AFFT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

fpv-iplab / rulstm
View on GitHub
Code for the Paper: Antonino Furnari and Giovanni Maria Farinella. What Would You Expect? Anticipating Egocentric Actions with Rolling-Un…
☆137Aug 23, 2023Updated 2 years ago
AllenXuuu / DCR
View on GitHub
Official implementation of our CVPR'22 paper.
☆13Nov 18, 2022Updated 3 years ago
facebookresearch / MeMViT
View on GitHub
Code Release for MeMViT Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition, CVPR 2022
☆155Nov 30, 2022Updated 3 years ago
SwordfallYeung / LogMonitor
View on GitHub
利用kafka+storm+mysql/redis构建日志监控系统
☆13May 6, 2018Updated 8 years ago
thaolmk54 / LOGNet-VQA
View on GitHub
Implementation for the paper "Dynamic Language Binding in Relational Visual Reasoning" (Le et al., IJCAI 2020)
☆13Jul 25, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
mks0601 / IntegralAction_RELEASE
View on GitHub
Official PyTorch implementation of "IntegralAction: Pose-driven Feature Integration for Robust Human Action Recognition in Videos", CVPRW…
☆36Jul 10, 2024Updated 2 years ago
channelCS / digit-identify
View on GitHub
Digit classification with Convolutional Neural Networks using Keras
☆20May 12, 2018Updated 8 years ago
doerlbh / HumanLSTM
View on GitHub
Code for our PLOS ONE paper: "Predicting Human Decision Making in Psychological Tasks with Recurrent Neural Networks"
☆13Jun 3, 2022Updated 4 years ago
epic-kitchens / epic-kitchens-100-annotations
View on GitHub
Annotations for the public release of the EPIC-KITCHENS-100 dataset
☆173Aug 1, 2022Updated 3 years ago
rishikksh20 / CoaT-pytorch
View on GitHub
CoaT: Co-Scale Conv-Attentional Image Transformers
☆15Apr 20, 2021Updated 5 years ago
Event-AHU / EFV_event_classification
View on GitHub
[PRCV-2023, IEEE TMM-2025] Learning Bottleneck Transformer for Event Image-Voxel Feature Fusion based Classification
☆12Dec 20, 2025Updated 7 months ago
chenzpstar / Multi-Modal-Image-Fusion
View on GitHub
Training for multi-modal image fusion with PyTorch.
☆37Nov 30, 2023Updated 2 years ago
mah533 / Synthetic-ECG-Signal-Generation-using-Probabilistic-Diffusion-Models
View on GitHub
We used Improved DDPM (developed by OpenAI) to generate synthetic ECG signals and compared it with WGAN-GP.
☆26Apr 22, 2023Updated 3 years ago
rust-community / content-o-tron
View on GitHub
A process for helping the Rust community and new comers to share their story of using Rust
☆17Oct 7, 2018Updated 7 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
haifangong / CMSA-MTPT-4-MedicalVQA
View on GitHub
[ICMR'21, Best Poster Paper Award] Medical Visual Question Answering with Multi-task Pre-training and Cross-modal Self-attention
☆34Dec 15, 2022Updated 3 years ago
haifangong / VQAMix
View on GitHub
[IEEE TMI'22] VQAMix: Conditional Triplet Mixup for Medical Visual Question Answering
☆16Oct 9, 2022Updated 3 years ago
MIS-DevWorks / FBR
View on GitHub
This repository contains the official code for "Flexible Biometrics Recognition: Bridging the Multimodality Gap through Attention, Alignm…
☆11Oct 9, 2024Updated last year
EGO4D / forecasting
View on GitHub
☆82Jan 5, 2024Updated 2 years ago
Awenbocc / CPCR
View on GitHub
☆15Mar 11, 2023Updated 3 years ago
epic-kitchens / epic-kitchens-download-scripts
View on GitHub
Download scripts for EPIC-KITCHENS
☆173Jun 10, 2026Updated last month
facebookresearch / ego-topo
View on GitHub
Code accompanying EGO-TOPO: Environment Affordances from Egocentric Video (CVPR 2020)
☆31Aug 3, 2022Updated 3 years ago
GeWu-Lab / MMPareto_ICML2024
View on GitHub
The repo for "MMPareto: Boosting Multimodal Learning with Innocent Unimodal Assistance", ICML 2024
☆55Jun 28, 2024Updated 2 years ago
Jie-su / BPD
View on GitHub
☆11Sep 20, 2025Updated 10 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
shubha07m / On-device-computer-vision-experiments-with-IoT
View on GitHub
Various object detection testing using YOLO and other algorithms, Raspberry pi based integration experiments.
☆13Dec 9, 2024Updated last year
csimo005 / SUMMIT
View on GitHub
☆11Oct 4, 2023Updated 2 years ago
AbdullaDesmal / TBIM
View on GitHub
This repository shows the implementation of the Trained Born Iterative Method (TBIM) applied for electromagnetic imaging.
☆12Nov 9, 2022Updated 3 years ago
mmact19 / challenge
View on GitHub
MMAct Challenge
☆13Jun 20, 2021Updated 5 years ago
zhaoyue-zephyrus / TeSTra
View on GitHub
Code for ECCV2022 "Real-time Online Video Detection with Temporal Smoothing Transformers"
☆119Aug 23, 2025Updated 10 months ago
LorenzoGianassi / Land-Diffuser
View on GitHub
The Land-Diffuser is a novel application of the Denoising Diffusion Probabilistic Model (DDPM) in the realm of 3D Talking Head generation…
☆13Dec 23, 2023Updated 2 years ago
younesbelkada / BraTS_2021
View on GitHub
☆13Aug 7, 2021Updated 4 years ago
GeWu-Lab / TSPM
View on GitHub
Official repository for "Boosting Audio Visual Question Answering via Key Semantic-Aware Cues" in ACM MM 2024.
☆17Oct 25, 2024Updated last year
fyyCS / LSLD
View on GitHub
☆14Nov 13, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
HCPLab-SYSU / STKET
View on GitHub
Spatial-Temporal Knowledge-Embedded Transformer for Video Scene Graph Generation (TIP 2024, ACM MM 2023)
☆19Mar 13, 2024Updated 2 years ago
stypoumic / BECLR
View on GitHub
Official repository for the paper BECLR: Batch Enhanced Contrastive Unsupervised Few-Shot Learning
☆16May 25, 2026Updated last month
MinjieWan / WTAPNet
View on GitHub
☆12Nov 11, 2024Updated last year
alexhe101 / WINet
View on GitHub
Official implementation of "Pan-Sharpening With Wavelet-Enhanced High-Frequency Information"
☆12Mar 28, 2024Updated 2 years ago
781458112 / DWWA
View on GitHub
☆11Oct 18, 2022Updated 3 years ago
noagarcia / knowit-rock
View on GitHub
ROCK model for Knowledge-Based VQA in Videos
☆31Oct 19, 2020Updated 5 years ago
GeWu-Lab / MS-Bot
View on GitHub
The offical repo for "Play to the Score: Stage-Guided Dynamic Multi-Sensory Fusion for Robotic Manipulation", CoRL 2024 (ORAL)
☆22Jun 25, 2025Updated last year