Official repository of ECCV 2024 paper - "HAT: History-Augmented Anchor Transformer for Online Temporal Action Localization"
☆20Aug 23, 2024Updated last year
Alternatives and similar repositories for ECCV24-HAT
Users that are interested in ECCV24-HAT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repo takes the initial step towards leveraging text learning for online action detection without explicit human supervision.☆15Dec 13, 2024Updated last year
- Awesome Online Action Detection☆72Jan 25, 2025Updated last year
- Progress-Aware Online Action Segmentation for Egocentric Procedural Task Videos☆35Sep 9, 2024Updated last year
- ☆41May 7, 2024Updated 2 years ago
- ☆22Mar 7, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆20Jul 26, 2023Updated 2 years ago
- This is a project on visual spatial reasoning tasks-SIBench☆26Jan 12, 2026Updated 4 months ago
- ☆12Aug 7, 2024Updated last year
- Official implementation of "Test-Time Zero-Shot Temporal Action Localization", CVPR 2024☆73Sep 11, 2024Updated last year
- 🚀 Official code for “XStreamVGGT: Extremely Memory-Efficient Streaming Vision Geometry Grounded Transformer with KV Cache Compression”, …☆45Jan 27, 2026Updated 4 months ago
- ☆16Feb 3, 2025Updated last year
- Temporal Action Localization Visualization Tool (TALVT) is a Javascript based simple visualization tool to visualize the outcomes of the …☆29Sep 29, 2020Updated 5 years ago
- A reading list of papers about Visual Grounding.☆31Aug 24, 2022Updated 3 years ago
- Third place of 2021 IEEE GRSS Data Fusion Contest: Track MSD☆10Mar 31, 2021Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [ICCV 2023] Official implementation of Memory-and-Anticipation Transformer for Online Action Understanding☆50Oct 7, 2023Updated 2 years ago
- ☆14Jan 5, 2022Updated 4 years ago
- Temporal Action Detection & Weakly Supervised Temporal Action Detection & Temporal Action Proposal Generation☆589May 29, 2026Updated last week
- An implementation of several unsupervised object discovery models (Slot Attention, SLATE, GNM) in PyTorch with pre-trained models.☆15May 26, 2025Updated last year
- Code and resources for EMNLP 2022 paper on 'Robustness of Fusion-based Multimodal Classifiers to Cross-Modal Content Dilutions'☆10Mar 11, 2024Updated 2 years ago
- [ECCV 2024 oral] -C2C: Component-to-Composition Learning for Zero-Shot Compositional Action Recognition☆40Dec 7, 2024Updated last year
- ☆13Mar 22, 2018Updated 8 years ago
- Code for the paper "Representing Spatial Trajectories as Distributions"☆13Jan 17, 2023Updated 3 years ago
- The official implementation of CVPR 2021 Paper: Improving Weakly Supervised Visual Grounding by Contrastive Knowledge Distillation.☆12Oct 15, 2021Updated 4 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Official implementation of "In-style: Bridging Text and Uncurated Videos with Style Transfer for Cross-modal Retrieval." ICCV 2023☆11Oct 5, 2023Updated 2 years ago
- The implementation of Learning Instance and Task-Aware Dynamic Kernels for Few Shot Learning☆13Apr 14, 2024Updated 2 years ago
- TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models☆40Nov 10, 2024Updated last year
- Code for paper titled, "Learning to Predict Task Progress by Self-Supervised Video Alignment" by Gerard Donahue and Ehsan Elhamifar, publ…☆16Jul 26, 2024Updated last year
- [ Arxiv 2023 ] This repository contains the code for "MUPPET: Multi-Modal Few-Shot Temporal Action Detection"☆16Aug 30, 2023Updated 2 years ago
- This repository contains the Adverbs in Recipes (AIR) dataset and the code published at the CVPR 23 paper: "Learning Action Changes by Me…☆13May 25, 2023Updated 3 years ago
- ☆15Jun 11, 2021Updated 4 years ago
- ☆13Nov 28, 2021Updated 4 years ago
- Embodied Instruction Following in Unknown Environments☆17Dec 8, 2025Updated 6 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Large-Scale Pre-Training for Dual-Accelerometer Human Activity Recognition☆19Dec 16, 2024Updated last year
- Teeth Mold Point Cloud Completion Via Data Augmentation and Hybrid RL-GAN (Paper Code)☆13May 23, 2023Updated 3 years ago
- Code accompanying paper "Fine-Grained Visual Entailment" [ECCV 2022].☆11Oct 31, 2022Updated 3 years ago
- Implementation of "Temporal Recurrent Networks for Online Action Detection"☆23May 6, 2019Updated 7 years ago
- ☆12Sep 29, 2019Updated 6 years ago
- [ICCV 2025] Repository for A Quality-Guided Mixture of Score-fusion Experts Framework for Human Recognition☆18Sep 29, 2025Updated 8 months ago
- [TCSVT23] Official code for "SPT: Spatial Pyramid Transformer for Image Captioning".☆10Aug 14, 2024Updated last year