[CVPR'2022, TPAMI'2024] LAVT: Language-Aware Vision Transformer for Referring Segmentation
☆25Jan 21, 2025Updated last year
Alternatives and similar repositories for LAVT-RS
Users that are interested in LAVT-RS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The repository contains the official implementation of "DPMesh: Exploiting Diffusion Prior for Occluded Human Mesh Recovery", CVPR 2024☆45Jun 4, 2024Updated last year
- The official code of "Thinking With Videos: Multimodal Tool-Augmented Reinforcement Learning for Long Video Reasoning"☆93Oct 15, 2025Updated 7 months ago
- The official implementation of our paper ''IteRPrimE: Zero-shot Referring Image Segmentation with Iterative Grad-CAM Refinement and Prima…☆20Apr 6, 2025Updated last year
- [CVPR'2025] VoCo-LLaMA: This repo is the official implementation of "VoCo-LLaMA: Towards Vision Compression with Large Language Models".☆205Jun 18, 2025Updated 11 months ago
- The repository contains the official implementation of "DPMesh: Exploiting Diffusion Prior for Occluded Human Mesh Recovery"☆25Jul 25, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- UniVG-R1: Reasoning Guided Universal Visual Grounding with Reinforcement Learning☆164Jun 2, 2025Updated 11 months ago
- [CVPR 2023] LOGO: A Long-Form Video Dataset for Group Action Quality Assessment☆46Apr 9, 2024Updated 2 years ago
- [TIP 2025] Self-Calibrated CLIP for Training-Free Open-Vocabulary Segmentation☆71Mar 27, 2026Updated 2 months ago
- The repository of VG-Refiner paper☆19Dec 9, 2025Updated 5 months ago
- [NeurIPS'24]Efficient and accurate memory saving method towards W4A4 large multi-modal models.☆101Jan 3, 2025Updated last year
- [CVPR 2024 oral]This repository contains the official implementation of "FlowIE: Efficient Image Enhancement via Rectified Flow"☆149Jan 13, 2025Updated last year
- Chat about anything on any video!☆39Sep 5, 2023Updated 2 years ago
- Yet another RL Baseline repo.☆12May 28, 2024Updated 2 years ago
- This is the official implementation of ICCV 2025 "Flash-VStream: Efficient Real-Time Understanding for Long Video Streams"☆278Oct 15, 2025Updated 7 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [ ECCV 2024 ] MotionLCM: This repo is the official implementation of "MotionLCM: Real-time Controllable Motion Generation via Latent Cons…☆456Feb 24, 2025Updated last year
- [NeurIPS 2024] Geometry-Aware Large Reconstruction Model for Efficient and High-Quality 3D Generation☆174Sep 30, 2024Updated last year
- Pytorch implementation of "Move to See Better: Self-Improving Embodied Object Detection" (https://arxiv.org/abs/2012.00057)☆24Apr 5, 2021Updated 5 years ago
- A video question answering dataset that focuses on the dynamics properties of objects (velocity, acceleration) and their collisions withi…☆20Apr 23, 2025Updated last year
- [IEEE TCSVT 2023] The implementation of our paper Semi-Supervised Subspace Clustering via Tensor Low-Rank Representation.☆25Dec 21, 2023Updated 2 years ago
- [CVPR'24] The repository provides code for running inference and training for "Segment and Caption Anything" (SCA) , links for downloadin…☆231Sep 30, 2024Updated last year
- RS-Paper-Hub: A curated collection of remote sensing papers from arXiv. 遥感论文社:打造遥感领域的专属论文集(如卫星、无人机、地面基站)(http://rspaper.top/)☆41Updated this week
- Pytorch implementation for the paper: "RVCDet: Rethinking Voxelization and Classification for 3D Object Detection" [ICONIP-2022]☆13Apr 15, 2024Updated 2 years ago
- ☆12May 5, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Official implementation of CVPR 2025 paper "MOVIS: Enhancing Multi-Object Novel View Synthesis for Indoor Scenes"☆31Feb 24, 2025Updated last year
- ☆19Apr 11, 2026Updated last month
- Rewrite the cmakefile to install and run it on Ubuntu☆15Sep 11, 2024Updated last year
- [SIGGRAPH 2025] Official code of the paper "FlexiAct: Towards Flexible Action Control in Heterogeneous Scenarios"☆343Oct 30, 2025Updated 6 months ago
- ☆40May 5, 2026Updated 3 weeks ago
- A list of awesome human motion generation papers. Continuing to be updated!!!☆39Sep 28, 2025Updated 8 months ago
- (TIP 2024) Towards Robust Referring Image Segmentation☆37Mar 2, 2024Updated 2 years ago
- RL based agents for real robots☆10Oct 31, 2022Updated 3 years ago
- [CVPR 2024 Highlight] Scaffold-GS: Structured 3D Gaussians for View-Adaptive Rendering☆10Jul 29, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆18May 7, 2022Updated 4 years ago
- ☆22Mar 17, 2026Updated 2 months ago
- This Keras code is for the paper A. Jamali, Ali and Roy, Swalpa Kumar and Hong, Danfeng and Atkinson, Peter M and Ghamisi, Pedram, "[Spat…☆11Jan 22, 2024Updated 2 years ago
- ☆12Aug 2, 2022Updated 3 years ago
- Plan, Posture and Go: Towards Open-World Text-to-Motion Generation☆42Nov 19, 2024Updated last year
- [NeurIPS2025] The official implementation of MindOmni: Unleashing Reasoning Generation in Vision Language Models with RGPO☆140Oct 15, 2025Updated 7 months ago
- Iterative training on pseudo-labeled data experiment on the MNIST-dataset☆11Sep 3, 2024Updated last year