CASIA-IVA-Lab/VRoPE

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/CASIA-IVA-Lab/VRoPE)

CASIA-IVA-Lab / VRoPE

[EMNLP 2025 Main] Official implementation of VRoPE: Rotary Position Embedding for Video Large Language Models.

☆28

Alternatives and similar repositories for VRoPE

Users that are interested in VRoPE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

CASIA-IVA-Lab / PrefixGrouper
View on GitHub
An efficient GRPO training util.
☆56Jun 13, 2025Updated last year
ivattyue / Ada-K
View on GitHub
Official code for the ICLR 2025 paper, "Ada-K Routing: Boosting the Efficiency of MoE-based LLMs"
☆12Mar 1, 2025Updated last year
CASIA-IVA-Lab / SC-Tune
View on GitHub
Official code for CVPR 2024 paper, "SC-Tune: Unleashing Self-Consistent Referential Comprehension in Large Vision Language Models"
☆16Apr 22, 2024Updated 2 years ago
Rubics-Xuan / IVG
View on GitHub
This repo holds the official code and data for "Beyond Literal Descriptions: Understanding and Locating Open-World Objects Aligned with H…
☆15May 21, 2024Updated 2 years ago
berkegokmen1 / counterfactual-chexray-disease-editing
View on GitHub
Official Implementation for the paper titled: "Counterfactual Disease Removal and Generation in Chest X-Rays Using Diffusion Models"
☆15Dec 8, 2025Updated 7 months ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
abyildirim / md-projtex
View on GitHub
Text-guided 3D texture generation using training-free multi-diffusion in UV space.
☆13Apr 7, 2025Updated last year
CASIA-IVA-Lab / MRES
View on GitHub
This repo holds the official code and data for "Unveiling Parts Beyond Objects: Towards Finer-Granularity Referring Expression Segmentati…
☆74Jun 3, 2024Updated 2 years ago
CASIA-IVA-Lab / VideoNIAH
View on GitHub
VideoNIAH: A Flexible Synthetic Method for Benchmarking Video MLLMs
☆57Mar 9, 2025Updated last year
ZhenglinZhou / Zero-1-to-A
View on GitHub
[CVPR 2025] Zero-1-to-A: Zero-Shot One Image to Animatable Head Avatars Using Video Diffusion
☆43Mar 21, 2025Updated last year
wrudman / NOTICE
View on GitHub
☆14Apr 10, 2025Updated last year
CASIA-IVA-Lab / COSA
View on GitHub
[ICLR2024] Codes and Models for COSA: Concatenated Sample Pretrained Vision-Language Foundation Model
☆43Dec 25, 2024Updated last year
ali-vilab / CDT
View on GitHub
Official implementation for our paper: Rethinking Video Tokenization: A Conditioned Diffusion-based Approach
☆17Apr 2, 2025Updated last year
RAIVNLab / VideoNet
View on GitHub
CVPR '26 Highlight
☆25May 6, 2026Updated 2 months ago
zeeshannisar / COVID-19
View on GitHub
☆12Jul 20, 2020Updated 6 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
Wiselnn570 / VideoRoPE
View on GitHub
[ICML 2025 Oral] An official implementation of VideoRoPE & VideoRoPE++
☆223Apr 15, 2026Updated 3 months ago
irvingzhang0512 / open-images-downloader
View on GitHub
☆14Aug 13, 2021Updated 4 years ago
adobe-research / llava-score
View on GitHub
☆11Oct 2, 2024Updated last year
insait-institute / COM4D
View on GitHub
[CVPR2026] Code repository for the paper "Inferring Compositional 4D Scenes without Ever Seeing One".
☆25Apr 12, 2026Updated 3 months ago
oneThousand1000 / 3DPortraitGAN
View on GitHub
(IEEE TCSVT) 3DPortraitGAN: Learning One-Quarter Headshot 3D GANs from a Single-View Portrait Dataset with Diverse Body Poses
☆33Jul 9, 2025Updated last year
baaivision / DIVA
View on GitHub
[ICLR 2025] Diffusion Feedback Helps CLIP See Better
☆301Jan 23, 2025Updated last year
EzioBy / 3dpe
View on GitHub
[ECCV 2024] 3DPE: Real-time 3D-aware Portrait Editing from a Single Image
☆22Sep 15, 2025Updated 10 months ago
lose4578 / CircleRoPE
View on GitHub
☆15Sep 1, 2025Updated 10 months ago
google-research-datasets / maverics
View on GitHub
MAVERICS (Manually-vAlidated Vq^2a Examples fRom Image-Caption datasetS) is a suite of test-only benchmarks for visual question answering…
☆13Feb 18, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
jiahai-feng / binding-iclr
View on GitHub
☆19Mar 5, 2024Updated 2 years ago
jingyu198 / Hyper3D
View on GitHub
☆17Jun 29, 2026Updated 3 weeks ago
three-bee / triplane_edit
View on GitHub
[CVPR 2025 Highlight] Reference-Based 3D-Aware Image Editing with Triplanes
☆44Apr 4, 2025Updated last year
berkegokmen1 / slurm-ssh-compute-node
View on GitHub
Connect to Slurm Compute Nodes Directly from VSCode. Do your debugging and development easily.
☆31Oct 8, 2024Updated last year
ysbsb / awesome-quantization
View on GitHub
Awesome Quantization Paper lists with Codes
☆10Feb 24, 2021Updated 5 years ago
Hongcheng-Gao / HAVEN
View on GitHub
Code and data for paper "Exploring Hallucination of Large Multimodal Models in Video Understanding: Benchmark, Analysis and Mitigation".
☆25Oct 22, 2025Updated 8 months ago
shuoyang129 / eamat
View on GitHub
Entity-Aware and Motion-Aware Transformers for Language-driven Action Localization(IJCAI-22)
☆12Oct 11, 2022Updated 3 years ago
Rubics-Xuan / Med-DANet
View on GitHub
Med-DANet Series (ECCV 2022 & WACV 2024)
☆13Jan 2, 2024Updated 2 years ago
StanfordMIMI / LieRE
View on GitHub
[ICML-2025] We introduce Lie group Relative position Encodings (LieRE) that goes beyond RoPE in supporting n-dimensional inputs.
☆36Aug 13, 2025Updated 11 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
xucao-42 / Neuralangelo_DFD
View on GitHub
Accelerating SDF gradient computation in NeuS-like multi-view reconstruction with directional finite difference (DFD) and patch-based sam…
☆34Mar 24, 2024Updated 2 years ago
daniel-code / TubeViT
View on GitHub
An unofficial implementation of TubeViT in "Rethinking Video ViTs: Sparse Video Tubes for Joint Image and Video Learning"
☆95Updated this week
Jiahao000 / VICT
View on GitHub
[CVPR 2025] Test-Time Visual In-Context Tuning
☆30Dec 31, 2025Updated 6 months ago
TIGER-AI-Lab / VISTA
View on GitHub
The code for "VISTA: Enhancing Long-Duration and High-Resolution Video Understanding by VIdeo SpatioTemporal Augmentation" [CVPR2025]
☆20Feb 27, 2025Updated last year
TingtingLiao / unique3d-diffusion
View on GitHub
☆46Sep 27, 2024Updated last year
CASIA-IVA-Lab / VALOR
View on GitHub
[TPAMI2024] Codes and Models for VALOR: Vision-Audio-Language Omni-Perception Pretraining Model and Dataset
☆311Dec 25, 2024Updated last year
CJReinforce / JOWA
View on GitHub
Official code for the ICLR 2025 paper, "Scaling Offline Model-Based RL via Jointly-Optimized World-Action Model Pretraining"
☆30Dec 1, 2024Updated last year