Yxxxb / LAVT-RSView external linksLinks
[CVPR'2022, TPAMI'2024] LAVT: Language-Aware Vision Transformer for Referring Segmentation
☆24Jan 21, 2025Updated last year
Alternatives and similar repositories for LAVT-RS
Users that are interested in LAVT-RS are comparing it to the libraries listed below
Sorting:
- ☆22May 9, 2024Updated last year
- ☆17Feb 26, 2024Updated last year
- The repository contains the official implementation of "DPMesh: Exploiting Diffusion Prior for Occluded Human Mesh Recovery", CVPR 2024☆45Jun 4, 2024Updated last year
- [CVPR 2024] The repository contains the official implementation of "Open-Vocabulary Segmentation with Semantic-Assisted Calibration"☆76Sep 23, 2024Updated last year
- The official code of "Thinking With Videos: Multimodal Tool-Augmented Reinforcement Learning for Long Video Reasoning"☆81Oct 15, 2025Updated 4 months ago
- The official implementation of our paper ''IteRPrimE: Zero-shot Referring Image Segmentation with Iterative Grad-CAM Refinement and Prima…☆20Apr 6, 2025Updated 10 months ago
- [ICCV2025] AnyBimanual: Transfering Unimanual Policy for General Bimanual Manipulation☆97Jun 26, 2025Updated 7 months ago
- [CVPR 2023] LOGO: A Long-Form Video Dataset for Group Action Quality Assessment☆46Apr 9, 2024Updated last year
- UniVG-R1: Reasoning Guided Universal Visual Grounding with Reinforcement Learning☆157Jun 2, 2025Updated 8 months ago
- [TIP 2025] Self-Calibrated CLIP for Training-Free Open-Vocabulary Segmentation☆58Dec 22, 2025Updated last month
- [NeurIPS'24]Efficient and accurate memory saving method towards W4A4 large multi-modal models.☆95Jan 3, 2025Updated last year
- [ICCV 2025] Code for Momentum-GS: Momentum Gaussian Self-Distillation for High-Quality Large Scene Reconstruction☆169Dec 15, 2025Updated 2 months ago
- Yet another RL Baseline repo.☆12May 28, 2024Updated last year
- ☆51Aug 22, 2025Updated 5 months ago
- Official PyTorch implementation of GaussianToken: An Effective Image Tokenizer with 2D Gaussian Splatting☆101Apr 3, 2025Updated 10 months ago
- ManiCM: Real-time 3D Diffusion Policy via Consistency Model for Robotic Manipulation☆122May 8, 2025Updated 9 months ago
- [IEEE TCSVT 2023] The implementation of our paper Semi-Supervised Subspace Clustering via Tensor Low-Rank Representation.☆25Dec 21, 2023Updated 2 years ago
- This is the official implementation of ICCV 2025 "Flash-VStream: Efficient Real-Time Understanding for Long Video Streams"☆269Oct 15, 2025Updated 4 months ago
- Pytorch implementation of "Move to See Better: Self-Improving Embodied Object Detection" (https://arxiv.org/abs/2012.00057)☆24Apr 5, 2021Updated 4 years ago
- [CVPR'24] The repository provides code for running inference and training for "Segment and Caption Anything" (SCA) , links for downloadin…☆231Sep 30, 2024Updated last year
- [NeurIPS 2024] Geometry-Aware Large Reconstruction Model for Efficient and High-Quality 3D Generation☆173Sep 30, 2024Updated last year
- [ECCV 2024] PyTorch implementation of Rethinking Features-Fused-Pyramid-Neck for Object Detection☆49Sep 7, 2025Updated 5 months ago
- (TIP 2024) Towards Robust Referring Image Segmentation☆36Mar 2, 2024Updated last year
- 一个操作系统☆31Oct 27, 2021Updated 4 years ago
- [ECCV2022] Global Spectral Filter Memory Network for Video Object Segmentation☆42Jul 13, 2022Updated 3 years ago
- Official code for "ContrastMask: Contrastive Learning to Segment Every Thing" (CVPR2022)☆35May 1, 2022Updated 3 years ago
- ☆22Dec 11, 2025Updated 2 months ago
- A list of awesome human motion generation papers. Continuing to be updated!!!☆38Sep 28, 2025Updated 4 months ago
- code for TIDEE: Novel Room Reorganization using Visuo-Semantic Common Sense Priors☆40Nov 21, 2023Updated 2 years ago
- Computer Vision Written Note in Chinese| 同济大学计算机视觉课程手写笔记☆25Jul 11, 2021Updated 4 years ago
- [NeurIPS-W 2025] Official Implementation of "Seg-R1: Segmentation Can Be Surprisingly Simple with Reinforcement Learning"☆59Jul 1, 2025Updated 7 months ago
- 同济软院编译原理课程Repository @2014☆62Jan 7, 2015Updated 11 years ago
- [ECCV 2024] Efficient Diffusion Transformer with Step-wise Dynamic Attention Mediators☆47Sep 11, 2024Updated last year
- SpeedVision is an AI-powered tool that detects and calculates vehicle speed from video footage using YOLO-based object detection and fram…☆10Sep 22, 2024Updated last year
- ☆12Jan 18, 2024Updated 2 years ago
- The detector of wire's breakage in power system. (Python, Opencv...)☆12Oct 29, 2018Updated 7 years ago
- Official PyTorch implementation of The Linear Attention Resurrection in Vision Transformer☆15Sep 7, 2024Updated last year
- OpenTMA: support text-motion alignment for HumanML3D, Motion-X, and UniMoCap☆46May 22, 2024Updated last year
- Plan, Posture and Go: Towards Open-World Text-to-Motion Generation☆42Nov 19, 2024Updated last year