2tianyao1/ActionLLM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/2tianyao1/ActionLLM)

2tianyao1 / ActionLLM

Multimodal Large Models Are Effective Action Anticipators （IEEE TMM）🌳

☆27

Alternatives and similar repositories for ActionLLM

Users that are interested in ActionLLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

zllxot / CORE
View on GitHub
ICCV2023 - CORE: Cooperative Reconstruction for Multi-Agent Perception
☆46Nov 25, 2023Updated 2 years ago
robert80203 / EgoPER_official
View on GitHub
The official implementation of Error Detection in Egocentric Procedural Task Videos
☆33Sep 20, 2025Updated 10 months ago
assembly-101 / assembly101-mistake-detection
View on GitHub
Annotations for the Mistake Detection benchmark of Assembly101
☆12Aug 3, 2023Updated 2 years ago
EthanSeok / YOLO_v8_with_SAM
View on GitHub
YOLO v8과 SAM (Sagment Anything Model)을 결합한 해충 (pest) detection model
☆13Apr 28, 2024Updated 2 years ago
CAMMA-public / rendezvous-in-time
View on GitHub
rendezvous-in-time
☆14Sep 17, 2025Updated 10 months ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
lingorX / Mem3D
View on GitHub
☆90Sep 22, 2022Updated 3 years ago
f-ilic / SelectivePrivacyPreservation
View on GitHub
[CVPR 2024] Selective, Interpretable and Motion Consistent Privacy Attribute Obfuscation for Action Recognition
☆12Mar 20, 2024Updated 2 years ago
olga-zats / goal_consistency
View on GitHub
[ICIP2023] Code for the paper 'Action Anticipation with Goal Consistency'
☆12Apr 5, 2024Updated 2 years ago
zxzhaoeric / Semi-InstruSeg
View on GitHub
☆16Oct 9, 2020Updated 5 years ago
YuemingJin / Trans-SVNet_Journal
View on GitHub
[IJCARS'22]Trans-SVNet: hybrid embedding aggregation Transformer for surgical workflow analysis, 1st Prize of Best Paper Award of IJCARS-…
☆16Dec 20, 2022Updated 3 years ago
hcmr-lab / Seg2Track-SAM2
View on GitHub
☆17Jan 6, 2026Updated 6 months ago
Flaick / Surgical-Workflow-Anticipation
View on GitHub
[MedIA'22] Anticipation for surgical workflow through instrument interaction and recognized signals
☆17Feb 11, 2022Updated 4 years ago
omron-sinicx / com_kitchens
View on GitHub
COM Kitchens: An Unedited Overhead-view Video Dataset as a Vision-Language Benchmark
☆15Aug 22, 2024Updated last year
VividLe / ExtractVideoFeature
View on GitHub
Extract video features. Currently, the models includes I3D, will be continuously updated.
☆12Jun 4, 2020Updated 6 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
fpv-iplab / MECCANO
View on GitHub
The MECCANO Dataset: official repository in which we provide code and models.
☆32Jul 31, 2023Updated 2 years ago
idiap / sharingan
View on GitHub
Sharingan: A Transformer Architecture for Multi-Person Gaze Following
☆32Nov 11, 2024Updated last year
Sid2697 / EgoProceL-egocentric-procedure-learning
View on GitHub
Code implementation for our ECCV, 2022 paper titled "My View is the Best View: Procedure Learning from Egocentric Videos"
☆35Feb 5, 2024Updated 2 years ago
willprice / flowty
View on GitHub
The swiss army knife for extracting optical flow
☆16May 13, 2020Updated 6 years ago
md-mohaiminul / BIMBA
View on GitHub
☆29Jul 25, 2025Updated 11 months ago
WenliangGuo / SCHEMA
View on GitHub
[ICLR 2024 Poster] SCHEMA: State CHangEs MAtter for Procedure Planning in Instructional Videos
☆20Aug 21, 2025Updated 11 months ago
YingqianWang / DistgLF
View on GitHub
[TPAMI 2022] Disentangling Light Fields for Super-Resolution and Disparity Estimation
☆13Apr 2, 2024Updated 2 years ago
hamarh / HMNet_pth
View on GitHub
PyTorch implementation of Hierarchical Neural Memory Network
☆49Feb 28, 2024Updated 2 years ago
openmedlab / MedLSAM
View on GitHub
MedLSAM: Localize and Segment Anything Model for 3D Medical Images
☆522Apr 30, 2024Updated 2 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
Michedev / diffusion-uncertainty
View on GitHub
Official implementation of "Diffusion Model Guided Sampling with Pixel-Wise Aleatoric Uncertainty Estimation"
☆17Apr 1, 2025Updated last year
jutanke / social_diffusion
View on GitHub
Re-implementation for ICCV23 "Social Diffusion: Long-term Multiple Human Motion Anticipation"
☆24Oct 3, 2023Updated 2 years ago
YujiaBao / pytorch-pretrained-BERT
View on GitHub
📖The Big-&-Extending-Repository-of-Transformers: Pretrained PyTorch models for Google's BERT, OpenAI GPT & GPT-2, Google/CMU Transformer…
☆15Sep 13, 2022Updated 3 years ago
effepivi / MicroTomoRegistration
View on GitHub
Registration of 3D triangular meshes onto a 2D image can be performed using optimisation and fast X-ray simulation on GPU. Automatic esti…
☆11Aug 28, 2019Updated 6 years ago
LifengFan / Human-Gaze-Communication
View on GitHub
☆35Aug 26, 2024Updated last year
pittisl / mPnP-LLM
View on GitHub
Code for paper "Modality Plug-and-Play: Elastic Modality Adaptation in Multimodal LLMs for Embodied AI"
☆13Jan 19, 2024Updated 2 years ago
doc-doc / EgoBlind
View on GitHub
EgoBlind: Towards Egocentric Visual Assistance for the Blind (NeurIPS'25, D&B Track)
☆23Apr 20, 2026Updated 3 months ago
bert-nmt / ctx-bert-nmt
View on GitHub
Extend bert-nmt to context-aware translation.
☆11May 24, 2021Updated 5 years ago
BCV-Uniandes / TAPIR
View on GitHub
☆38Apr 5, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
huckiyang / EyeNet
View on GitHub
ICML 18 workshop - A Novel Hybrid Machine Learning Model for Auto-Classification of Retinal Diseases
☆15Jul 18, 2018Updated 8 years ago
esa / neuralg
View on GitHub
Neural network approximators of linear algebra operations on GPU with PyTorch
☆17May 30, 2022Updated 4 years ago
alexandrosstergiou / progressive-action-prediction
View on GitHub
[CVPR 2023] Code for action prediction from videos
☆26Mar 8, 2024Updated 2 years ago
IVUL-KAUST / GroupReading
View on GitHub
IVUL group reading
☆18Apr 9, 2019Updated 7 years ago
Finspire13 / Towards-Unified-Surgical-Skill-Assessment
View on GitHub
Codes for "Towards Unified Surgical Skill Assessment" (CVPR 2021)
☆35Nov 11, 2023Updated 2 years ago
CAMMA-public / rendezvous
View on GitHub
A transformer-inspired neural network for surgical action triplet recognition from laparoscopic videos.
☆39Sep 17, 2025Updated 10 months ago
svip-lab / WeakSVR
View on GitHub
(CVPR 2023) Official implemention of the paper "Weakly Supervised Video Representation Learning with Unaligned Text for Sequential Videos…
☆32Apr 2, 2024Updated 2 years ago