[CVPR'2022, TPAMI'2024] LAVT: Language-Aware Vision Transformer for Referring Segmentation
☆24Jan 21, 2025Updated last year
Alternatives and similar repositories for LAVT-RS
Users that are interested in LAVT-RS are comparing it to the libraries listed below
Sorting:
- ☆22May 9, 2024Updated last year
- [CVPR 2024] Narrative Action Evaluation with Prompt-Guided Multimodal Interaction☆42May 16, 2024Updated last year
- ☆17Feb 26, 2024Updated 2 years ago
- The repository contains the official implementation of "DPMesh: Exploiting Diffusion Prior for Occluded Human Mesh Recovery", CVPR 2024☆45Jun 4, 2024Updated last year
- The official implementation of our paper ''IteRPrimE: Zero-shot Referring Image Segmentation with Iterative Grad-CAM Refinement and Prima…☆20Apr 6, 2025Updated 11 months ago
- The official code of "Thinking With Videos: Multimodal Tool-Augmented Reinforcement Learning for Long Video Reasoning"☆87Oct 15, 2025Updated 4 months ago
- [ICCV2025] AnyBimanual: Transfering Unimanual Policy for General Bimanual Manipulation☆98Jun 26, 2025Updated 8 months ago
- [CVPR 2023] LOGO: A Long-Form Video Dataset for Group Action Quality Assessment☆46Apr 9, 2024Updated last year
- UniVG-R1: Reasoning Guided Universal Visual Grounding with Reinforcement Learning☆158Jun 2, 2025Updated 9 months ago
- The repository of VG-Refiner paper☆17Dec 9, 2025Updated 3 months ago
- [TIP 2025] Self-Calibrated CLIP for Training-Free Open-Vocabulary Segmentation☆65Dec 22, 2025Updated 2 months ago
- [CVPR'2025] VoCo-LLaMA: This repo is the official implementation of "VoCo-LLaMA: Towards Vision Compression with Large Language Models".☆203Jun 18, 2025Updated 8 months ago
- [NeurIPS'24]Efficient and accurate memory saving method towards W4A4 large multi-modal models.☆98Jan 3, 2025Updated last year
- The repository contains the official implementation of "DPMesh: Exploiting Diffusion Prior for Occluded Human Mesh Recovery"☆25Jul 25, 2024Updated last year
- [ICCV 2025] Code for Momentum-GS: Momentum Gaussian Self-Distillation for High-Quality Large Scene Reconstruction☆171Dec 15, 2025Updated 2 months ago
- Yet another RL Baseline repo.☆12May 28, 2024Updated last year
- ☆51Aug 22, 2025Updated 6 months ago
- Chat about anything on any video!☆39Sep 5, 2023Updated 2 years ago
- This repository contains the official implementation of "FlowIE: Efficient Image Enhancement via Rectified Flow"☆142Jan 13, 2025Updated last year
- [IEEE TCSVT 2023] The implementation of our paper Semi-Supervised Subspace Clustering via Tensor Low-Rank Representation.☆25Dec 21, 2023Updated 2 years ago
- [ ECCV 2024 ] MotionLCM: This repo is the official implementation of "MotionLCM: Real-time Controllable Motion Generation via Latent Cons…☆444Feb 24, 2025Updated last year
- [NeurIPS 2024] Geometry-Aware Large Reconstruction Model for Efficient and High-Quality 3D Generation☆173Sep 30, 2024Updated last year
- (TIP 2024) Towards Robust Referring Image Segmentation☆36Mar 2, 2024Updated 2 years ago
- Official code for "ContrastMask: Contrastive Learning to Segment Every Thing" (CVPR2022)☆35May 1, 2022Updated 3 years ago
- code for TIDEE: Novel Room Reorganization using Visuo-Semantic Common Sense Priors☆40Nov 21, 2023Updated 2 years ago
- [NeurIPS-W 2025] Official Implementation of "Seg-R1: Segmentation Can Be Surprisingly Simple with Reinforcement Learning"☆61Jul 1, 2025Updated 8 months ago
- This is the official pytorch implementation for the paper: Towards Accurate Post-training Quantization for Diffusion Models.(CVPR24 Poste…☆38Jun 4, 2024Updated last year
- 同济软院编译原理课程Repository @2014☆62Jan 7, 2015Updated 11 years ago
- SpeedVision is an AI-powered tool that detects and calculates vehicle speed from video footage using YOLO-based object detection and fram…☆10Sep 22, 2024Updated last year
- ☆23Dec 11, 2025Updated 2 months ago
- Official PyTorch implementation of The Linear Attention Resurrection in Vision Transformer☆16Sep 7, 2024Updated last year
- ☆30Updated this week
- The detector of wire's breakage in power system. (Python, Opencv...)☆12Oct 29, 2018Updated 7 years ago
- Plan, Posture and Go: Towards Open-World Text-to-Motion Generation☆42Nov 19, 2024Updated last year
- 😎 Awesome papers on token redundancy reduction☆11Mar 12, 2025Updated 11 months ago
- ☆20Jun 12, 2025Updated 8 months ago
- ☆12Mar 21, 2025Updated 11 months ago
- Code for CoRL 2022 paper: https://arxiv.org/abs/2211.09006 (ToolFlowNet, for simulation envs)☆11Mar 16, 2023Updated 2 years ago
- Image-processing filters implemented on GPU with OpenCL☆12Jun 7, 2022Updated 3 years ago