THUNLP-MT/ActiView

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/THUNLP-MT/ActiView)

THUNLP-MT / ActiView

☆11

Alternatives and similar repositories for ActiView

Users that are interested in ActiView are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Yutong-Zhou-cv / AgriBench
View on GitHub
[ECCV 2024 Workshop🎈] The first agriculture benchmark to evaluate MM-LLMs.
☆27Jan 1, 2025Updated last year
alipay / POA
View on GitHub
☆22Aug 8, 2024Updated last year
MiliLab / Text-Before-Vision
View on GitHub
[ICML 2026] Text Before Vision: Staged Knowledge Injection Matters for Agentic RLVR in Ultra-High-Resolution Remote Sensing Understanding
☆16Mar 13, 2026Updated 4 months ago
like413 / VisTA
View on GitHub
[arXiv, 2024] Show Me What and Where has Changed? Question Answering and Grounding for Remote Sensing Change Detection
☆36Jul 2, 2025Updated last year
hulianyuyy / iLLaVA
View on GitHub
iLLaVA: An Image is Worth Fewer Than 1/3 Input Tokens in Large Multimodal Models (ICLR2026)
☆23Jun 24, 2026Updated 3 weeks ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
earth-insights / awesome-layout-to-image
View on GitHub
An up-to-date & curated list of awesome layout to image papers, methods & resources.
☆13Jun 28, 2024Updated 2 years ago
jsntcheng / douyin_funny
View on GitHub
抖音抓包-抖音是个有趣的东西
☆14Oct 16, 2024Updated last year
UMass-Embodied-AGI / FlexAttention
View on GitHub
[ECCV 2024] FlexAttention for Efficient High-Resolution Vision-Language Models
☆49Jan 8, 2025Updated last year
THUNLP-MT / ModelCompose
View on GitHub
Official code for our paper "Model Composition for Multimodal Large Language Models" (ACL 2024)
☆31Jan 8, 2025Updated last year
Bili-Sakura / awesome-remote-sensing-visual-generative-models
View on GitHub
A curated list of awesome remote sensing visual generative models, papers, datasets, and resources. This repository focuses exclusively o…
☆20Jul 8, 2026Updated last week
StriveZs / ALPS
View on GitHub
ALPS: An Auto-Labeling and Pre-training Scheme for Remote Sensing Segmentation With Segment Anything Model
☆21Aug 20, 2024Updated last year
ChocoWu / SeTok
View on GitHub
Codes for ICLR 2025 Paper: Towards Semantic Equivalence of Tokenization in Multimodal LLM
☆81Apr 19, 2025Updated last year
om-ai-lab / ZoomEye
View on GitHub
[EMNLP-2025 Oral] ZoomEye: Enhancing Multimodal LLMs with Human-Like Zooming Capabilities through Tree-Based Image Exploration
☆91Nov 20, 2025Updated 8 months ago
MitsuiChen14 / DGTRS
View on GitHub
☆32Jun 10, 2026Updated last month
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
reyoung / blog
View on GitHub
☆10Jun 2, 2022Updated 4 years ago
iecashhy / RS-vHeat
View on GitHub
☆15Oct 14, 2025Updated 9 months ago
SivanDoveh / IPLoc
View on GitHub
Repository for the paper: Teaching VLMs to Localize Specific Objects from In-context Examples
☆40Nov 27, 2024Updated last year
Junjue-Wang / EarthVQA
View on GitHub
[AAAI 2024] EarthVQA: Towards Queryable Earth via Relational Reasoning-Based Remote Sensing Visual Question Answering
☆155Jan 26, 2026Updated 5 months ago
SpeechEE / SpeechEE
View on GitHub
☆11Aug 20, 2025Updated 11 months ago
loiccoyle / phomo
View on GitHub
📷 Python package and CLI utility to create photo mosaics - now with GPU support
☆18Mar 6, 2026Updated 4 months ago
vision3d-lab / lightsplat
View on GitHub
[CVPR 2026] LightSplat: Fast and Memory-Efficient Open-Vocabulary 3D Scene Understanding in Five Seconds
☆27Mar 30, 2026Updated 3 months ago
GeoX-Lab / RS-GPT4V
View on GitHub
☆37Jul 1, 2024Updated 2 years ago
mengzaiqiao / awesome-natural-language-reasoning
View on GitHub
A collection of research papers related to Natural Language Reasoning
☆10May 27, 2022Updated 4 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
ATR-DBI / Map-EQA
View on GitHub
☆12Oct 10, 2024Updated last year
YijinHuang / FPT
View on GitHub
[TNNLS'25] [MICCAI'24] A Parameter and Memory Efficient Transfer Learning Method
☆35Oct 29, 2025Updated 8 months ago
clab / clab-autodiff-examples
View on GitHub
Examples of using the adept autodifferentiation library for standard NLP learning problems
☆17Sep 2, 2014Updated 11 years ago
aim-uofa / Active-o3
View on GitHub
[ICML2026] ACTIVE-O3: Empowering Multimodal Large Language Models with Active Perception via GRPO
☆83Apr 30, 2026Updated 2 months ago
VisionXLab / DVGBench
View on GitHub
[ISPRS2026] DVGBench: Implicit-to-Explicit Visual Grounding Benchmark in UAV Imagery with Large Vision-Language Models
☆30Mar 24, 2026Updated 3 months ago
YuJungHeo / kbvqa-public
View on GitHub
☆40Nov 29, 2022Updated 3 years ago
zhangguanghao523 / CMMCoT
View on GitHub
[AAAI'26] Official implementation of CMMCoT: Enhancing Complex Multi-Image Comprehension via Multi-Modal Chain-of-Thought and Memory Augm…
☆11Dec 5, 2025Updated 7 months ago
tim-learn / UEO
View on GitHub
ICML-2024 highlight paper "Realistic Unsupervised CLIP Fine-tuning with Universal Entropy Optimization"
☆19Jul 18, 2024Updated 2 years ago
zhu-xlab / ChatEarthNet
View on GitHub
☆41Jun 29, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
91097luke / phileo-bench
View on GitHub
Repo for testing foundation models
☆12Jan 19, 2024Updated 2 years ago
GasolSun36 / SURf
View on GitHub
[EMNLP 2024] SURf: Teaching Large Vision-Language Models to Selectively Utilize Retrieved Information
☆11Oct 11, 2024Updated last year
YuanLi95 / KECPM
View on GitHub
Tis is code for Few-Shot Joint Multimodal Entity-Relation Extraction via Knowledge-Enhanced Cross-modal Prompt Model (ACM MM 2024))
☆12Aug 27, 2024Updated last year
Z-Zheng / dynamic_highres_poverty
View on GitHub
Dynamic, high-resolution poverty measurement in data-scarce environments
☆11Dec 8, 2024Updated last year
metpallyv / DecisionTrees
View on GitHub
Goal of this project is to build Classification Decision Trees and Regression Decision trees without using any Machine learning libraries
☆10Dec 28, 2018Updated 7 years ago
FSoft-AI4Code / VisualCoder
View on GitHub
[NAACL 2025] Guiding Large Language Models in Code Execution with Fine-grained Multimodal Chain-of-Thought Reasoning
☆10Feb 9, 2025Updated last year
jmoortgat / DeepRiverFCN
View on GitHub
Codes for Arctic river segmentation using various fully convolutional neural networks.
☆10Dec 27, 2022Updated 3 years ago