This repo contains the code for our TMLR paper: A Simple Video Segmenter by Tracking Objects Along Axial Trajectories
☆27Mar 20, 2025Updated last year
Alternatives and similar repositories for Axial-VS
Users that are interested in Axial-VS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [Pattern Recognition 2024] Semantic-Aware Frame-Event Fusion based Pattern Recognition via Large Vision-Language Models, Dong Li, Jiandon…☆18Jan 18, 2025Updated last year
- Official Pytorch Implementation of Paper "A Semantic Space is Worth 256 Language Descriptions: Make Stronger Segmentation Models with Des…☆55Aug 27, 2025Updated 8 months ago
- ViCaS: A Dataset for Combining Holistic and Pixel-level Video Understanding using Captions with Grounded Segmentation (CVPR'25)☆21Apr 2, 2025Updated last year
- This repo contains the code for our paper Towards Open-Ended Visual Recognition with Large Language Model☆101Jul 15, 2024Updated last year
- [CVPR 2024] Official implementation of "ViTamin: Designing Scalable Vision Models in the Vision-language Era"☆211Jun 9, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆28Apr 4, 2025Updated last year
- a PyTorch re-implementation of ECCV 2022 paper based on Detectron2: k-means mask Transformer.☆80Jul 28, 2023Updated 2 years ago
- [MICCAI 2024] Embracing Massive Medical Data☆20Jul 5, 2024Updated last year
- The official project for the paper: Slot-VPS: Object-centric Representation Learning for Video Panoptic Segmentation, CVPR 2022☆14Nov 9, 2022Updated 3 years ago
- ☆13Nov 29, 2023Updated 2 years ago
- [CVPR 2025] Fine-Grained Image-Text Correspondence with Cost Aggregation for Open-Vocabulary Part Segmentation☆26Nov 17, 2025Updated 5 months ago
- ☆203Apr 21, 2026Updated last week
- [ECAI 2023] MonoSKD: General Distillation Framework for Monocular 3D Object Detection via Spearman Correlation Coefficient☆32Dec 8, 2023Updated 2 years ago
- The official implementation for the paper [ODTrack: Online Dense Temporal Token Learning for Visual Tracking].☆180Oct 7, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- [ICML 2024] This repository includes the official implementation of our paper "Rejuvenating image-GPT as Strong Visual Representation Lea…☆99May 3, 2024Updated last year
- [NeurIPS 2025] Completeness-Aware Reconstruction Enhancement☆36Oct 18, 2025Updated 6 months ago
- [CVPR 2024] Task-aligned Part-aware Panoptic Segmentation through Joint Object-Part Representations☆24Jan 20, 2025Updated last year
- [NeurIPS 2023] This repo contains the code for our paper Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convoluti…☆341Feb 5, 2024Updated 2 years ago
- [NeurIPS VLM workshop 2024] In-Context Ensemble Learning from Pseudo Labels Improves Video-Language Models for Low-Level Workflow Underst…☆23Mar 16, 2025Updated last year
- This repo provides tutorials and a library to help CV researchers to generate data using blender.☆15Feb 2, 2020Updated 6 years ago
- Codebase for the paper-Elucidating the design space of language models for image generation☆46Nov 17, 2024Updated last year
- The official PyTorch implementation of the CVPR 2023 paper "Contrastive Grouping with Transformer for Referring Image Segmentation".☆51Apr 17, 2024Updated 2 years ago
- [AAAI 2025] Official data and code for "TB-HSU: Hierarchical 3D Scene Understanding with Contextual Affordances"☆15Sep 11, 2025Updated 7 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆12Aug 16, 2023Updated 2 years ago
- MAexp is a generic platform for RL-based multi-agent exploration☆112Aug 25, 2025Updated 8 months ago
- [ICCV 2025] Dataset of 10,135 abdominal CT scans with 15,130 tumors annotated across six organs and 5,893 controls. The AI ranks first in…☆56Nov 3, 2025Updated 5 months ago
- Code For Our Work: DVIS-DAQ: Improving Video Segmentation via Dynamic Anchor Queries [ECCV-2024]☆14Jul 11, 2024Updated last year
- ☆25Sep 19, 2023Updated 2 years ago
- ☆17Jun 20, 2023Updated 2 years ago
- [CVPR'25 Highlight] A VQA benchmark for 6D spatial reasoning.☆20Updated this week
- ☆15Dec 16, 2023Updated 2 years ago
- VideoAuteur: Towards Long Narrative Video Generation☆43Oct 22, 2025Updated 6 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [IEEE TMI] Tumor synthesis leveraging medical reports.☆50Jan 26, 2026Updated 3 months ago
- [ICLR 2026] Draw-In-Mind: Rebalancing Designer-Painter Roles in Unified Multimodal Models Benefits Image Editing☆27Jan 27, 2026Updated 3 months ago
- ROS package for SOTA Computer Vision Models including SAM, Cutie, GroundingDINO, YOLO-World, VLPart, DEVA and MaskDINO.☆51Aug 4, 2024Updated last year
- 2D road segmentation using lidar data during training☆43Dec 21, 2023Updated 2 years ago
- Does patch ordering affect context-limited vision transformers?☆17Oct 10, 2025Updated 6 months ago
- [ECCV2024] PartCraft: Crafting Creative Objects by Parts☆101Jan 22, 2025Updated last year
- [ECCV 2024] Learning Video Context as Interleaved Multimodal Sequences☆44Mar 11, 2025Updated last year