This repository contains video datasets that can be used for training coarse to fine-grained (phase, step and action) temporal classification tasks.
☆16Oct 26, 2021Updated 4 years ago
Alternatives and similar repositories for video-action-recognition-datasets
Users that are interested in video-action-recognition-datasets are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- IEEE TMI 2022: Exploring Segment-level Semantics for Online Phase Recognition from Surgical Videos☆15Jun 27, 2022Updated 3 years ago
- ☆18Sep 19, 2025Updated 6 months ago
- [TMI'2021] Temporal Memory Relation Network for Workflow Recognition from Surgical Video☆67Feb 15, 2022Updated 4 years ago
- ☆38Apr 5, 2025Updated last year
- Towards context-aware head-mounted display-based augmented reality for surgical guidance.☆25Jun 17, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- CholecTriplet 2022 challenge on surgical action triplet detection☆12Sep 17, 2025Updated 6 months ago
- Laparoscopic video dataset for surgical action triplet recognition☆43Sep 17, 2025Updated 6 months ago
- [TMI'22]Exploring Intra- and Inter-Video Relation for Surgical Semantic Scene Segmentation☆23Dec 20, 2022Updated 3 years ago
- Finalist entry for the M2CAI Workflow Challenge 2016☆10Nov 25, 2016Updated 9 years ago
- ☆68Feb 1, 2024Updated 2 years ago
- Reading list and publicly available datasets for surgical vision☆40Dec 2, 2021Updated 4 years ago
- Official repository of the GraSP dataset and implemention of TAPIS☆51Dec 31, 2024Updated last year
- [CVPR2022] Bridge-Prompt: Towards Ordinal Action Understanding in Instructional Videos☆102Oct 30, 2022Updated 3 years ago
- Official repository for "Dissecting Self-Supervised Learning Methods for Surgical Computer Vision"☆45May 23, 2025Updated 10 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Papers of ComputerVision x Surgery☆112Jan 7, 2024Updated 2 years ago
- ☆13May 6, 2024Updated last year
- This repository contains code for our paper titled "A semi-supervised teacher-student framework for surgical tool detection and localizat…☆10Nov 16, 2023Updated 2 years ago
- The Code for M2CAI19 Paper: Hard Frame Detection and Online Mapping for Surgical Phase Recognition☆14Oct 31, 2019Updated 6 years ago
- training a vision transformer based model to detect violence in real life videos☆10Dec 7, 2023Updated 2 years ago
- How to build Text-to-Image app using stable diffusion via hugging face☆10May 28, 2023Updated 2 years ago
- CVPR'19 experiments with (on-manifold) adversarial examples.☆43Feb 27, 2020Updated 6 years ago
- This repo contains an implementation code for the weakly supervised surgical tool tracker. In this research, the temporal dependency in s…☆43Sep 1, 2020Updated 5 years ago
- It can detect a Cat😺 Faces from image or video using OpenCV☆19Sep 8, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- CV codes from CIAM Group at SUSTech, Shenzhen, China☆12Aug 26, 2024Updated last year
- ☆18Sep 30, 2024Updated last year
- This repository contains the code associated with our 2023 TMI paper "Latent Graph Representations for Critical View of Safety Assessment…☆36Sep 17, 2025Updated 6 months ago
- [ECCV 2024] Official Implementation of "OphNet: A Large-Scale Video Benchmark for Ophthalmic Surgical Workflow Understanding"☆62Jul 5, 2025Updated 9 months ago
- OmniStream: Mastering Perception, Reconstruction and Action in Continuous Streams☆81Mar 15, 2026Updated last month
- I have created a dataset of Image-Text-Pairs by using the cosine similarity of the CLIP embeddings of the image & it's caption derrived f…☆16Apr 22, 2021Updated 4 years ago
- ☆17Sep 17, 2025Updated 6 months ago
- [ECCV 2024] Official Implementation of "Appearance-Based Refinement for Object-Centric Motion Segmentation" Junyu Xie, Weidi Xie, Andrew …☆13Oct 23, 2024Updated last year
- Perform RAG (Retrieval-Augmented Generation) from your PDFs using this Colab notebook! Powered by Llama 2☆16Mar 24, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- An authentication proxy for Google Cloud managed databases☆26Mar 28, 2023Updated 3 years ago
- Long Surgical Phase Recognition☆24Nov 7, 2024Updated last year
- Contains interesting projects like Cat face detection, cat face recognition, code generation, Building chatbot, finding similar documents…☆24Jun 9, 2024Updated last year
- The official codebase of FineAction dataset. We will update the data and code of our FineAction.☆22Apr 10, 2025Updated last year
- [BMVC 2021]OMAD: Object Model with Articulated Deformations for Pose Estimation and Retrieval☆12Dec 17, 2021Updated 4 years ago
- ☆12Sep 19, 2022Updated 3 years ago
- TMI 2023: Less is More: Surgical Phase Recognition from Timestamp Supervision☆21Feb 9, 2023Updated 3 years ago