SkiddieAhn / Paper-AnyAnomaly
PyTorch Implementation of the Paper 'AnyAnomaly': Official Version
☆24Updated 3 weeks ago
Alternatives and similar repositories for Paper-AnyAnomaly:
Users that are interested in Paper-AnyAnomaly are comparing it to the libraries listed below
- [NeurIPS 2023] HASSOD: Hierarchical Adaptive Self-Supervised Object Detection☆55Updated last year
- ☆23Updated 5 months ago
- Multi-vision Sensor Perception and Reasoning (MS-PR) benchmark, assessing VLMs on their capacity for sensor-specific reasoning.☆13Updated last month
- Official Pytorch Implementation of Self-emerging Token Labeling☆32Updated last year
- Official Pytorch implementation for "IFORMER: INTEGRATING CONVNET AND TRANSFORMER FOR MOBILE APPLICATION" [ICLR 2025]☆40Updated last week
- ☆33Updated 4 months ago
- Real-time, YOLO-like object detection using the Florence-2-base-ft model with a user-friendly GUI.☆20Updated last week
- Code of paper "A new baseline for edge detection: Make Encoder-Decoder great again"☆38Updated last month
- EMOv2: Pushing 5M Vision Model Frontier☆45Updated 3 months ago
- [ICCV23] Official Implementation of DARTH: Holistic Test-time Adaptation for Multiple Object Tracking☆19Updated last year
- 🏄 [ICLR 2025] OVTR: End-to-End Open-Vocabulary Multiple Object Tracking with Transformer☆42Updated last week
- EdgeSAM model for use with Autodistill.☆26Updated 9 months ago
- FaceXBench: Evaluating Multimodal LLMs on Face Understanding☆14Updated last month
- OpenVLThinker: An Early Exploration to Vision-Language Reasoning via Iterative Self-Improvement☆50Updated last week
- ☆33Updated last year
- Python scripts performing optical flow estimation using the NeuFlowV2 model in ONNX.☆41Updated 6 months ago
- 6D Rotation Representation for Unconstrained Head Pose Estimation☆13Updated 11 months ago
- [WACV 2025] Efficient Video Object Segmentation via Modulated Cross-Attention Memory☆54Updated last month
- Official PyTorch implementation of "No Time to Waste: Squeeze Time into Channel for Mobile Video Understanding"☆32Updated 10 months ago
- ☆20Updated 2 weeks ago
- LiVOS: Light Video Object Segmentation with Gated Linear Matching (CVPR 2025)☆28Updated 3 weeks ago
- Referring any person or objects given a natural language description. Code base for RexSeek and HumanRef Benchmark☆89Updated 2 weeks ago
- ☆32Updated 8 months ago
- ☆33Updated last week
- SAM-CLIP module for use with Autodistill.☆15Updated last year
- This repo contains the code for our TMLR paper: A Simple Video Segmenter by Tracking Objects Along Axial Trajectories☆27Updated 2 weeks ago
- HybridTrack: A Hybrid Approach for Robust Multi-Object Tracking☆12Updated 4 months ago
- ☆26Updated 2 months ago
- Vinci: A Real-time Embodied Smart Assistant based on Egocentric Vision-Language Model☆49Updated 2 months ago
- OLA-VLM: Elevating Visual Perception in Multimodal LLMs with Auxiliary Embedding Distillation, arXiv 2024☆58Updated last month