This repository contains video datasets that can be used for training coarse to fine-grained (phase, step and action) temporal classification tasks.
☆16Oct 26, 2021Updated 4 years ago
Alternatives and similar repositories for video-action-recognition-datasets
Users that are interested in video-action-recognition-datasets are comparing it to the libraries listed below
Sorting:
- [TMI'2021] Temporal Memory Relation Network for Workflow Recognition from Surgical Video☆66Feb 15, 2022Updated 4 years ago
- IEEE TMI 2022: Exploring Segment-level Semantics for Online Phase Recognition from Surgical Videos☆15Jun 27, 2022Updated 3 years ago
- ☆37Apr 5, 2025Updated 11 months ago
- CholecTriplet 2022 challenge on surgical action triplet detection☆12Sep 17, 2025Updated 5 months ago
- Laparoscopic video dataset for surgical action triplet recognition☆43Sep 17, 2025Updated 5 months ago
- [TMI'22]Exploring Intra- and Inter-Video Relation for Surgical Semantic Scene Segmentation☆23Dec 20, 2022Updated 3 years ago
- ☆29Feb 7, 2024Updated 2 years ago
- ☆66Feb 1, 2024Updated 2 years ago
- [CVPR 2022] Understanding 3D Object Articulation in Internet Videos☆33Mar 7, 2024Updated 2 years ago
- 【CVPR2026】Official repository for the paper "LEMON: A Large Endoscopic MONocular Dataset and Foundation Model for Perception in Surgical …☆81Feb 24, 2026Updated last week
- This repository contains the code associated with our 2023 TMI paper "Latent Graph Representations for Critical View of Safety Assessment…☆35Sep 17, 2025Updated 5 months ago
- This project is an AI Recruitment System designed to accelerate the hiring process for HR and technical recruiters.☆14Jan 3, 2025Updated last year
- Reading list and publicly available datasets for surgical vision☆40Dec 2, 2021Updated 4 years ago
- Self hosted AI workflow for scraping Instagram Reels (audio and description). Extracting, summarising and categorising, then storing all …☆28Jan 10, 2026Updated last month
- DisTime: Distribution-based Time Representation for Video Large Language Models.☆19Jul 10, 2025Updated 7 months ago
- [AAAI2024] An official pytorch implement of the paper: Vision-Language Pre-training with Object Contrastive Learning for 3D Scene Underst…☆13Dec 8, 2024Updated last year
- Cheatsheet for slurm command lines☆10Apr 9, 2023Updated 2 years ago
- Video Visual Relation Detection (VidVRD) tracklets generation. also for ACM MM Visual Relation Understanding Grand Challenge☆39Dec 5, 2022Updated 3 years ago
- I have created a dataset of Image-Text-Pairs by using the cosine similarity of the CLIP embeddings of the image & it's caption derrived f…☆16Apr 22, 2021Updated 4 years ago
- An AI-powered tool that translates plain English commands into multi-step API workflows, automating the entire testing process.☆17Jul 27, 2025Updated 7 months ago
- 基于触发词的燃气事件抽取,包括:时间、地点、原因、后果、组织等实体信息☆10Apr 13, 2021Updated 4 years ago
- Dataset: UET Driver Activity Recognition☆10Apr 19, 2022Updated 3 years ago
- Official PyTorch implementation of: "Cannot See the Forest for the Trees: Aggregating Multiple Viewpoints to Better Classify Objects in V…☆14Aug 29, 2022Updated 3 years ago
- Code for our project CROWN (Conversational Passage Ranking by Reasoning over Word Networks)☆10Jan 11, 2024Updated 2 years ago
- Agentic translation using reflection workflow, refactored and sugared.☆11Sep 25, 2024Updated last year
- [ICLR'25] Do Egocentric Video-Language Models Truly Understand Hand-Object Interactions?☆12Apr 11, 2025Updated 10 months ago
- Code for the paper "Multi-Task Learning of Object States and State-Modifying Actions from Web Videos" published in TPAMI☆11Mar 3, 2024Updated 2 years ago
- ☆17Feb 8, 2026Updated 3 weeks ago
- [COLING22] Text-to-Text Extraction and Verbalization of Biomedical Event Graphs☆10Nov 5, 2022Updated 3 years ago
- [CVPR 2022] Sequential Voting with Relational Box Fields for Active Object Detection☆10Jun 19, 2022Updated 3 years ago
- Open-Retrieval Conversational Machine Reading: A new setting & OR-ShARC dataset☆13Nov 19, 2022Updated 3 years ago
- ☆14Mar 11, 2025Updated 11 months ago
- ☆12May 7, 2018Updated 7 years ago
- 舆情项目处理层 分词 情感分析☆10Mar 22, 2016Updated 9 years ago
- 公安网备 敏感词过滤词☆14Oct 7, 2018Updated 7 years ago
- ☆12Oct 21, 2019Updated 6 years ago
- In this codebase we establish a benchmark for egocentric user adaptation based on Ego4d.First, we start from a population model which ha…☆15Jan 16, 2025Updated last year
- [ Arxiv 2023 ] This repository contains the code for "MUPPET: Multi-Modal Few-Shot Temporal Action Detection"☆15Aug 30, 2023Updated 2 years ago
- Code for the paper Joint Discovery of Object States and Manipulation Actions, ICCV 2017☆14Aug 7, 2018Updated 7 years ago