zeyun-zhong / AFFTView external linksLinks
Code for the paper: Anticipative Feature Fusion Transformer for Multi-Modal Action Anticipation.
☆32Aug 15, 2023Updated 2 years ago
Alternatives and similar repositories for AFFT
Users that are interested in AFFT are comparing it to the libraries listed below
Sorting:
- ☆12Apr 6, 2023Updated 2 years ago
- ☆19Sep 10, 2021Updated 4 years ago
- Official implementation of our CVPR'22 paper.☆13Nov 18, 2022Updated 3 years ago
- [ICCV 2023] Official implementation of Memory-and-Anticipation Transformer for Online Action Understanding☆49Oct 7, 2023Updated 2 years ago
- Code Release for MeMViT Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition, CVPR 2022☆153Nov 30, 2022Updated 3 years ago
- This project uses RGB and Depth images as input into two different convolutional network of same architecture (namely VGGNet, RESNet, Ale…☆13Jul 10, 2019Updated 6 years ago
- Video + CLIP Baseline for Ego4D Long Term Action Anticipation Challenge (CVPR 2022)☆15Jul 4, 2022Updated 3 years ago
- We used Improved DDPM (developed by OpenAI) to generate synthetic ECG signals and compared it with WGAN-GP.☆24Apr 22, 2023Updated 2 years ago
- Annotations for the public release of the EPIC-KITCHENS-100 dataset☆164Aug 1, 2022Updated 3 years ago
- Official PyTorch implementation of "IntegralAction: Pose-driven Feature Integration for Robust Human Action Recognition in Videos", CVPRW…☆36Jul 10, 2024Updated last year
- ☆78Jan 5, 2024Updated 2 years ago
- Repository dedicated to developing a robust and modular framework for Multi-Agent Reinforcement Learning (MARL) algorithms.☆13Mar 3, 2024Updated last year
- Implementation of the model from "Faster sorting algorithms discovered using deep reinforcement learning" that discovered an all-new ult…☆11Aug 29, 2023Updated 2 years ago
- ☆10Jun 26, 2024Updated last year
- Download scripts for EPIC-KITCHENS☆161Jul 8, 2025Updated 7 months ago
- ☆11Oct 4, 2018Updated 7 years ago
- This Repository Contains my Microwave Imaging Studies☆11Mar 1, 2016Updated 9 years ago
- ☆14Jan 9, 2025Updated last year
- The Land-Diffuser is a novel application of the Denoising Diffusion Probabilistic Model (DDPM) in the realm of 3D Talking Head generation…☆13Dec 23, 2023Updated 2 years ago
- ☆12Nov 11, 2024Updated last year
- This repository serves as a central hub for discovering tools and services focused on automated prompt engineering. Whether you're lookin…☆14Oct 11, 2024Updated last year
- ☆10Oct 4, 2023Updated 2 years ago
- Multimodal Medical Image Fusion based on Multi-channel Aggregated Network☆11Jan 26, 2025Updated last year
- This is the official repo for the ICML 2025 paper "Tuning-Free Alignment of Diffusion Models with Direct Noise Optimization" Tang et al☆18Jun 8, 2025Updated 8 months ago
- Zeta implementation of a reusable and plug in and play feedforward from the paper "Exponentially Faster Language Modeling"☆16Nov 11, 2024Updated last year
- [PRCV-2023, IEEE TMM-2025] Learning Bottleneck Transformer for Event Image-Voxel Feature Fusion based Classification☆12Dec 20, 2025Updated last month
- This is the repository to the article "NEWBEE: A Multi-Modal Gait Database of Natural Everyday-Walk in an Urban Environment", 2022☆11Aug 2, 2022Updated 3 years ago
- Plug in and Play Prompt Technique to Boost Model reasoning by 40%☆10May 30, 2023Updated 2 years ago
- ☆11Feb 9, 2026Updated last week
- Official implementation of "Pan-Sharpening With Wavelet-Enhanced High-Frequency Information"☆13Mar 28, 2024Updated last year
- Pytorch implementation of The ICML 2020 paper "On Learning Sets of Symmetric Elements" by Haggai Maron, Or Litany, Gal Chechik, Ethan Fet…☆10Apr 22, 2021Updated 4 years ago
- ☆13Aug 7, 2021Updated 4 years ago
- My implementation of the model KosmosG from "KOSMOS-G: Generating Images in Context with Multimodal Large Language Models"☆14Nov 11, 2024Updated last year
- ☆11Oct 18, 2022Updated 3 years ago
- Recent papers and projects in multitask learning and their applications☆11Aug 11, 2025Updated 6 months ago
- Fine-tune copilot based on your codebase☆12Mar 26, 2024Updated last year
- UnWave-Net: Unrolled Wavelet Network for Compton Tomography Image Reconstruction☆10Jul 6, 2025Updated 7 months ago
- Applying Deep Reinforcement Learning for dialogue generation. aka chatbot☆13Apr 30, 2017Updated 8 years ago
- FLEX: A Largescale Multimodal, Multiview Dataset for Learning Structured Representations for Fitness Action Quality Assessment☆25Feb 10, 2026Updated last week