qiujihao19/Artemis

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/qiujihao19/Artemis)

qiujihao19 / Artemis

[NeurIPS 2024] Artemis: Towards Referential Understanding in Complex Videos

☆27

Alternatives and similar repositories for Artemis

Users that are interested in Artemis are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

mingrui-wu / OSI-Bench
View on GitHub
Official repo of From Indoor to Open World: Revealing the Spatial Reasoning Gap in MLLMs
☆24Jun 23, 2026Updated last month
sunsmarterjie / ChatterBox
View on GitHub
[AAAI2025] ChatterBox: Multi-round Multimodal Referring and Grounding, Multimodal, Multi-round dialogues
☆61May 2, 2025Updated last year
PKU-YuanGroup / N-LoRA
View on GitHub
【COLING 2025🔥】Code for the paper "Is Parameter Collision Hindering Continual Learning in LLMs?".
☆38Dec 5, 2024Updated last year
PKU-YuanGroup / GPT-as-Language-Tree
View on GitHub
GPT as a Monte Carlo Language Tree: A Probabilistic Perspective
☆46Jan 18, 2025Updated last year
PKU-YuanGroup / AsFT
View on GitHub
Code for the paper "AsFT: Anchoring Safety During LLM Fune-Tuning Within Narrow Safety Basin".
☆37Jul 10, 2025Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
qiujihao19 / LongVideo-R1
View on GitHub
[CVPR 2026] LongVideo-R1: Smart Navigation for Low-cost Long Video Understanding
☆50Jul 7, 2026Updated 3 weeks ago
AkitsukiM / VMamba-DOTA
View on GitHub
☆31Sep 24, 2024Updated last year
Lyu6PosHao / HME
View on GitHub
Here is the official code for Nature Communications "Navigating Chemical-Linguistic Sharing Space with Heterogeneous Molecular Encoding".
☆23May 23, 2026Updated 2 months ago
PKU-YuanGroup / EvaGaussians
View on GitHub
☆60Mar 16, 2025Updated last year
PKU-YuanGroup / Next-Patch-Prediction
View on GitHub
[AAAI26] Next Patch Prediction
☆129Jan 2, 2025Updated last year
callsys / ControlCap
View on GitHub
[ECCV 2024] ControlCap: Controllable Region-level Captioning
☆81Oct 25, 2024Updated last year
PKU-YuanGroup / UniSandBox
View on GitHub
Does Understanding Inform Generation in Unified Multimodal Models? From Analysis to Path Forward
☆60Nov 27, 2025Updated 8 months ago
VIStA-H / GPT-4V_Social_Media
View on GitHub
GPT-4V(ision) as A Social Media Analysis Engine
☆39Dec 20, 2024Updated last year
yuweihao / LV-BERT
View on GitHub
LV-BERT: Exploiting Layer Variety for BERT (Findings of ACL 2021)
☆18May 10, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
PKU-YuanGroup / GS2E
View on GitHub
[NeurIPS 2025 D&B🔥] Implementation of "GS2E: Gaussian Splatting is an Effective Data Generator for Event Stream Generation"
☆20Jun 1, 2025Updated last year
PKU-YuanGroup / OSP-Next
View on GitHub
OSP-Next
☆68Jun 22, 2026Updated last month
PKU-YuanGroup / SwapAnyone
View on GitHub
An official implementation of SwapAnyone.
☆77Mar 14, 2025Updated last year
PKU-YuanGroup / Look-Back
View on GitHub
This repository is the official implementation of "Look-Back: Implicit Visual Re-focusing in MLLM Reasoning".
☆100Jul 10, 2025Updated last year
Tencent-Hunyuan / GEAR
View on GitHub
☆65Jul 1, 2026Updated 3 weeks ago
PKU-YuanGroup / LLMBind
View on GitHub
LLMBind: A Unified Modality-Task Integration Framework
☆19Jun 16, 2024Updated 2 years ago
KangarooGroup / Kangaroo
View on GitHub
official impelmentation of Kangaroo: A Powerful Video-Language Model Supporting Long-context Video Input
☆67Aug 30, 2024Updated last year
callsys / DynRefer
View on GitHub
[CVPR 2025] DynRefer: Delving into Region-level Multimodal Tasks via Dynamic Resolution
☆59Mar 4, 2025Updated last year
SuperFCR / E-4DGS
View on GitHub
[🔥ACMMM 2025] mplemetation of "E-4DGS: High-Fidelity Dynamic Reconstruction from the Multi-view Event Cameras"
☆19Aug 14, 2025Updated 11 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
IDEA-XL / ChemCoTBench
View on GitHub
LLM Reasoning Benchmark & Chain-of-Thoughts Dataset for Chemistry
☆55Oct 9, 2025Updated 9 months ago
callsys / GenPromp
View on GitHub
[ICCV 2023] Generative Prompt Model for Weakly Supervised Object Localization
☆57Nov 10, 2023Updated 2 years ago
MzeroMiko / XDLM
View on GitHub
[ICML 2026 Spotlight] Code for miXed Discrete Diffusion Language Model
☆27Mar 16, 2026Updated 4 months ago
hitsz-zuoqi / VideoMV
View on GitHub
☆16Mar 25, 2024Updated 2 years ago
CASIA-IVA-Lab / VideoNIAH
View on GitHub
VideoNIAH: A Flexible Synthetic Method for Benchmarking Video MLLMs
☆57Mar 9, 2025Updated last year
PKU-YuanGroup / Video-Bench
View on GitHub
A Comprehensive Benchmark and Toolkit for Evaluating Video-based Large Language Models!
☆140Dec 31, 2023Updated 2 years ago
mlvlab / ST-VLM
View on GitHub
☆13Mar 28, 2025Updated last year
AZZMM / CC-Diff
View on GitHub
Implementation of paper "CC-Diff: Enhancing Contextual Coherence in Remote Sensing Image Synthesis"
☆28Dec 19, 2025Updated 7 months ago
google-research-datasets / 2.5vrd
View on GitHub
This dataset contains about 110k images annotated with the depth and occlusion relationships between arbitrary objects. It enables resear…
☆16Apr 28, 2021Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
PKU-YuanGroup / OpenAI4S
View on GitHub
9.9 元豆包API复刻 Claude Science
☆129Updated this week
munanning / MADAv2
View on GitHub
MADAv2: Advanced Multi-Anchor Based Active Domain Adaptation Segmentation
☆25Jul 8, 2023Updated 3 years ago
PKU-YuanGroup / Hallucination-Attack
View on GitHub
Attack to induce LLMs within hallucinations
☆163May 17, 2024Updated 2 years ago
ncTimTang / AKS
View on GitHub
[CVPR 2025] Adaptive Keyframe Sampling for Long Video Understanding
☆228Dec 19, 2025Updated 7 months ago
zhentao-zou / MURE
View on GitHub
Beyond Textual CoT: Interleaved Text-image chains with Deep Confidence Reasoning for Image Editing
☆19Jun 24, 2026Updated last month
cxh0519 / Progressive3D
View on GitHub
Official implementation of "Progressive3D: Progressively Local Editing for Text-to-3D Content Creation with Complex Semantic Prompts" [IC…
☆123Jun 27, 2024Updated 2 years ago
marinero4972 / CyberV
View on GitHub
☆20Jun 10, 2025Updated last year