Shunli-Wang / CPR-CoachLinks
Coach-Project
☆13Updated 5 months ago
Alternatives and similar repositories for CPR-Coach
Users that are interested in CPR-Coach are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2023] HASSOD: Hierarchical Adaptive Self-Supervised Object Detection☆57Updated last year
- Official repository for "Video-FocalNets: Spatio-Temporal Focal Modulation for Video Action Recognition" [ICCV 2023]☆101Updated last year
- ☆19Updated 2 months ago
- Odd-One-Out: Anomaly Detection by Comparing with Neighbors (CVPR25)☆45Updated 7 months ago
- ☆34Updated last year
- [WACV 2025] Efficient Video Object Segmentation via Modulated Cross-Attention Memory☆58Updated 4 months ago
- [IJCV 2024] MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation☆124Updated 9 months ago
- [NeurIPS2022] This is the official implementation of the paper "Expediting Large-Scale Vision Transformer for Dense Prediction without Fi…☆85Updated last year
- [CVPR2023] Code for "Streaming Video Model"☆78Updated 2 years ago
- The official repository of the RePoGen paper☆48Updated last month
- Baby-DALL3: Annotation anything in visual tasks and Generate anything just all in one-pipeline with GPT-4 (a small baby of DALL·E 3).☆83Updated last year
- FaceXBench: Evaluating Multimodal LLMs on Face Understanding☆14Updated 5 months ago
- ☆19Updated 9 months ago
- Official PyTorch implementation of "No Time to Waste: Squeeze Time into Channel for Mobile Video Understanding"☆31Updated last year
- Training with Product Digital Twins for AutoRetail Checkout☆18Updated last year
- [CVPR2022] Animal Kingdom: A Large and Diverse Dataset for Animal Behavior Understanding☆144Updated 7 months ago
- Frame Flexible Network (CVPR2023)☆56Updated 2 years ago
- [CVPR 2024] Code and models for pi-ViT, a video transformer for understanding activities of daily living☆25Updated 5 months ago
- [NeurIPS 2023] HAP: Structure-Aware Masked Image Modeling for Human-Centric Perception☆43Updated last year
- [NeurIPS 2023] Official implementation of the paper "CAST: Cross-Attention in Space and Time for Video Action Recognition"☆52Updated last year
- Edge Weight Prediction For Category-Agnostic Pose Estimation☆42Updated last month
- Graph learning framework for long-term video understanding☆65Updated last week
- [IJCAI 2022] Official PyTorch implementation of AggPose: Deep Aggregation Vision Transformer for Infant Pose Estimation☆28Updated 3 years ago
- (ICLR 2024, CVPR 2024) SparseFormer☆74Updated 8 months ago
- Masked Vision-Language Transformer in Fashion☆34Updated last year
- "Object-Region Video Transformers”, Herzig et al., CVPR 2022☆46Updated 3 years ago
- ☆51Updated last year
- The official project website of "Ske2Grid: Skeleton-to-Grid Representation Learning for Action Recognition" (The paper of Ske2Grid is pub…☆20Updated last year
- ☆69Updated last year
- [ICCV 2023] Controllable Person Image Synthesis with Pose‑Constrained Latent Diffusion☆41Updated last year