Code for the paper: Anticipative Feature Fusion Transformer for Multi-Modal Action Anticipation.
☆32Aug 15, 2023Updated 2 years ago
Alternatives and similar repositories for AFFT
Users that are interested in AFFT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Future Transformer for Long-term Action Anticipation (CVPR 2022)☆47Dec 22, 2022Updated 3 years ago
- Official implementation of our CVPR'22 paper.☆13Nov 18, 2022Updated 3 years ago
- Code release for ICCV 2021 paper "Anticipative Video Transformer"☆154Feb 11, 2022Updated 4 years ago
- [ICCV 2023] Official implementation of Memory-and-Anticipation Transformer for Online Action Understanding☆50Oct 7, 2023Updated 2 years ago
- Code Release for MeMViT Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition, CVPR 2022☆153Nov 30, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Video + CLIP Baseline for Ego4D Long Term Action Anticipation Challenge (CVPR 2022)☆15Jul 4, 2022Updated 3 years ago
- This project uses RGB and Depth images as input into two different convolutional network of same architecture (namely VGGNet, RESNet, Ale…☆13Jul 10, 2019Updated 6 years ago
- Implementation for the paper "Dynamic Language Binding in Relational Visual Reasoning" (Le et al., IJCAI 2020)☆13Jul 25, 2024Updated last year
- Official code implemtation of paper AntGPT: Can Large Language Models Help Long-term Action Anticipation from Videos?☆30Sep 23, 2024Updated last year
- Annotations for the public release of the EPIC-KITCHENS-100 dataset☆171Aug 1, 2022Updated 3 years ago
- Code for our PLOS ONE paper: "Predicting Human Decision Making in Psychological Tasks with Recurrent Neural Networks"☆13Jun 3, 2022Updated 3 years ago
- 利用kafka+storm+mysql/redis构建日志监控系统☆13May 6, 2018Updated 7 years ago
- ☆78Aug 16, 2021Updated 4 years ago
- ☆13Aug 24, 2018Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [PRCV-2023, IEEE TMM-2025] Learning Bottleneck Transformer for Event Image-Voxel Feature Fusion based Classification☆12Dec 20, 2025Updated 3 months ago
- ☆35Mar 22, 2022Updated 4 years ago
- Training for multi-modal image fusion with PyTorch.☆38Nov 30, 2023Updated 2 years ago
- We used Improved DDPM (developed by OpenAI) to generate synthetic ECG signals and compared it with WGAN-GP.☆25Apr 22, 2023Updated 2 years ago
- [ICMR'21, Best Poster Paper Award] Medical Visual Question Answering with Multi-task Pre-training and Cross-modal Self-attention☆35Dec 15, 2022Updated 3 years ago
- GaitParsing: Human Semantic Parsing for Gait Recognition (IEEE TMM)☆13May 20, 2024Updated last year
- ☆80Jan 5, 2024Updated 2 years ago
- 2021 腾讯广告赛算法大赛 赛道二 决赛第六名☆42Oct 7, 2022Updated 3 years ago
- Download scripts for EPIC-KITCHENS☆165Jul 8, 2025Updated 9 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Code for CVPR'21 paper "Learning Asynchronous and Sparse Human-Object Interaction in Videos".☆25Aug 6, 2021Updated 4 years ago
- Code accompanying EGO-TOPO: Environment Affordances from Egocentric Video (CVPR 2020)☆31Aug 3, 2022Updated 3 years ago
- The repo for "MMPareto: Boosting Multimodal Learning with Innocent Unimodal Assistance", ICML 2024☆54Jun 28, 2024Updated last year
- [IEEE TMI'22] VQAMix: Conditional Triplet Mixup for Medical Visual Question Answering☆16Oct 9, 2022Updated 3 years ago
- ☆11Oct 4, 2023Updated 2 years ago
- ☆11Apr 7, 2024Updated 2 years ago
- ☆13Mar 8, 2024Updated 2 years ago
- Code for ECCV2022 "Real-time Online Video Detection with Temporal Smoothing Transformers"☆117Aug 23, 2025Updated 7 months ago
- The Land-Diffuser is a novel application of the Denoising Diffusion Probabilistic Model (DDPM) in the realm of 3D Talking Head generation…☆13Dec 23, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [ECCV 2024 (Oral)] Towards Scene Graph Anticipation☆19Mar 10, 2026Updated last month
- ☆14Nov 13, 2023Updated 2 years ago
- PMR: Prototypical Modal Rebalance for Multimodal Learning☆46Mar 10, 2023Updated 3 years ago
- Spatial-Temporal Knowledge-Embedded Transformer for Video Scene Graph Generation (TIP 2024, ACM MM 2023)☆20Mar 13, 2024Updated 2 years ago
- ☆13Aug 7, 2021Updated 4 years ago
- 物体検出にて人を検出し、その検出結果に対し姿勢推定を行うフレームワーク☆11Oct 30, 2023Updated 2 years ago
- ☆11Oct 18, 2022Updated 3 years ago