neu-vi/FleVRS

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/neu-vi/FleVRS)

neu-vi / FleVRS

FleVRS: Towards Flexible Visual Relationship Segmentation, NeurIPS 2024

☆22

Alternatives and similar repositories for FleVRS

Users that are interested in FleVRS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

neu-vi / struct2d
View on GitHub
Code release for 'Struct2D: A Perception-Guided Framework for Spatial Reasoning in MLLMs' (NeurIPS 2025)
☆31Oct 28, 2025Updated 9 months ago
Espere-1119-Song / Video-MMLU
View on GitHub
A Massive Multi-Discipline Lecture Understanding Benchmark
☆34Apr 20, 2026Updated 3 months ago
WildVision-AI / LMM-Engines
View on GitHub
☆17Oct 22, 2024Updated last year
amazon-science / object-centric-multiple-object-tracking
View on GitHub
☆37Oct 18, 2023Updated 2 years ago
neu-vi / MARDM
View on GitHub
☆124Jul 22, 2026Updated last week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Owen718 / AWRCP
View on GitHub
ICCV'23 | Adverse Weather Removal with Codebook Priors
☆10Aug 28, 2023Updated 2 years ago
neu-vi / SMooDi
View on GitHub
☆116Sep 3, 2025Updated 10 months ago
SHI-Labs / VisPer-LM
View on GitHub
[NeurIPS 2025] Elevating Visual Perception in Multimodal LLMs with Visual Embedding Distillation
☆74Oct 17, 2025Updated 9 months ago
wenhaochai / PoseDA
View on GitHub
[ICCV 2023] Global Adaptation meets Local Generalization: Unsupervised Domain Adaptation for 3D Human Pose Estimation
☆24Aug 26, 2023Updated 2 years ago
UCSB-AI / FedVLN
View on GitHub
[ECCV 2022] Official pytorch implementation of the paper "FedVLN: Privacy-preserving Federated Vision-and-Language Navigation"
☆14Oct 8, 2022Updated 3 years ago
SHI-Labs / VCoder
View on GitHub
[CVPR 2024] VCoder: Versatile Vision Encoders for Multimodal Large Language Models
☆280Apr 17, 2024Updated 2 years ago
rrmenon10 / ExEnt
View on GitHub
[ACL 2022] CLUES: A Benchmark for Learning Classifiers using Natural Language Explanations
☆10Jun 5, 2022Updated 4 years ago
limenlp / FairLocator
View on GitHub
Code and data for the paper: AI Sees Your Location—But With A Bias Toward The Wealthy World
☆19Dec 15, 2025Updated 7 months ago
NVlabs / RelViT
View on GitHub
[ICLR 2022] RelViT: Concept-guided Vision Transformer for Visual Relational Reasoning
☆62Sep 10, 2022Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
wenhaochai / Awesome-DriveLM
View on GitHub
📚 A collection of resources and papers on Large Language Models in autonomous driving
☆27Oct 30, 2023Updated 2 years ago
princetonvisualai / imagecaptioning-bias
View on GitHub
Code for the paper "Understanding and Evaluating Racial Biases in Image Captioning"
☆12Mar 26, 2026Updated 4 months ago
Wayne-Mai / EgoLoc
View on GitHub
For Ego4D VQ3D Task
☆22Jan 9, 2024Updated 2 years ago
g-jing / phy-world-bench
View on GitHub
☆18Jul 22, 2025Updated last year
ymingxie / PARQ
View on GitHub
Pixel-Aligned Recurrent Queries for Multi-View 3D Object Detection (ICCV23)
☆45Oct 19, 2023Updated 2 years ago
kemaloksuz / BoundingBoxGenerator
View on GitHub
Official PyTorch Implementation of BB Generator & pRoI Generator [WACV2020]
☆30Mar 24, 2021Updated 5 years ago
showlab / Awesome-Long-Context
View on GitHub
A curated list of resources about long-context in large-language models and video understanding.
☆32Aug 8, 2023Updated 2 years ago
Owen718 / LongPrompt-LLamaGen
View on GitHub
This repository provides an improved LLamaGen Model, fine-tuned on 500,000 high-quality images, each accompanied by over 300 token prompt…
☆30Oct 21, 2024Updated last year
chenllliang / DnD-Transformer
View on GitHub
[ICLR 2025] Source code for paper "A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegr…
☆81Dec 10, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
UCSB-AI / via-video
View on GitHub
☆25May 12, 2026Updated 2 months ago
concept-fusion / concept-fusion.github.io
View on GitHub
Webpage
☆16Feb 16, 2024Updated 2 years ago
viiika / HumanEdit
View on GitHub
[CVPR 2025 AI4CC Workshop] Official Implementation of HumanEdit: A High-Quality Human-Rewarded Dataset for Instruction-based Image Editin…
☆36May 8, 2025Updated last year
csarron / PuMer
View on GitHub
[ACL 2023] PuMer: Pruning and Merging Tokens for Efficient Vision Language Models
☆37Oct 3, 2024Updated last year
neu-vi / SportsSloMo
View on GitHub
SportsSloMo: A New Benchmark and Baseline Models for Human-centric Video Frame Interpolation, CVPR 2024 (https://arxiv.org/abs/2308.16876…
☆79Apr 4, 2024Updated 2 years ago
neu-vi / ezflow
View on GitHub
A modular PyTorch library for optical flow estimation using neural networks
☆136Apr 8, 2024Updated 2 years ago
JusperLee / Gull-Codec-Training
View on GitHub
☆12Mar 11, 2025Updated last year
neu-vi / LASER
View on GitHub
[CVPR 2026] Layer-wise Scale Alignment for Training-Free Streaming 4D Reconstruction
☆80Mar 18, 2026Updated 4 months ago
NVlabs / Bongard-HOI
View on GitHub
[CVPR 2022 (oral)] Bongard-HOI for benchmarking few-shot visual reasoning
☆74Nov 7, 2022Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
kayburns / women-snowboard
View on GitHub
☆19Nov 22, 2022Updated 3 years ago
atinfinity / stable_diffusion.openvino-docker
View on GitHub
This is a Dockerfile to use stable_diffusion.openvino in Docker container.
☆13Aug 29, 2022Updated 3 years ago
hjwdzh / PrimitiveFitting
View on GitHub
☆18Sep 27, 2021Updated 4 years ago
fpv-iplab / EASG
View on GitHub
Action Scene Graphs for Long-Form Understanding of Egocentric Videos (CVPR 2024)
☆47Apr 9, 2025Updated last year
guoqincode / Focus-on-Your-Instruction
View on GitHub
[CVPR 2024] Focus on Your Instruction: Fine-grained and Multi-instruction Image Editing by Attention Modulation
☆116Mar 22, 2024Updated 2 years ago
shengyuhao / DIVOTrack
View on GitHub
A Novel Dataset and Baseline Method for Cross-View Multi-Object Tracking in DIVerse Open Scenes (IJCV 2024)
☆100Nov 13, 2025Updated 8 months ago
kotoba-tech / Open-GPT-4o
View on GitHub
☆10May 16, 2024Updated 2 years ago