[EMNLP 2025 Main] Official implementation of VRoPE: Rotary Position Embedding for Video Large Language Models.
☆28Nov 18, 2025Updated 7 months ago
Alternatives and similar repositories for VRoPE
Users that are interested in VRoPE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official code for the ICLR 2025 paper, "Ada-K Routing: Boosting the Efficiency of MoE-based LLMs"☆12Mar 1, 2025Updated last year
- Official implementation for our paper: Rethinking Video Tokenization: A Conditioned Diffusion-based Approach☆17Apr 2, 2025Updated last year
- An efficient GRPO training util.☆55Jun 13, 2025Updated last year
- VideoNIAH: A Flexible Synthetic Method for Benchmarking Video MLLMs☆57Mar 9, 2025Updated last year
- ☆14Apr 10, 2025Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆17Updated this week
- ☆11Oct 2, 2024Updated last year
- ChatBridge, an approach to learning a unified multimodal model to interpret, correlate, and reason about various modalities without rely…☆55Sep 4, 2023Updated 2 years ago
- This repository contains the code for the paper - "Aligning Text, Images, and 3D Structure Token-by-Token" (CVPR 2026)☆48Jun 11, 2025Updated last year
- ☆19Mar 5, 2024Updated 2 years ago
- ☆20Mar 31, 2025Updated last year
- The official code repository for the paper "CostBench: Evaluating Multi-Turn Cost-Optimal Planning and Adaptation in Dynamic Environments…☆32Jun 14, 2026Updated 2 weeks ago
- [CVPR 2025] Zero-1-to-A: Zero-Shot One Image to Animatable Head Avatars Using Video Diffusion☆43Mar 21, 2025Updated last year
- This repo holds the official code and data for "Unveiling Parts Beyond Objects: Towards Finer-Granularity Referring Expression Segmentati…☆73Jun 3, 2024Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- [ICML 2025 Oral] An official implementation of VideoRoPE & VideoRoPE++☆221Apr 15, 2026Updated 2 months ago
- This repo holds the official code and data for "Beyond Literal Descriptions: Understanding and Locating Open-World Objects Aligned with H…☆16May 21, 2024Updated 2 years ago
- The code for "VISTA: Enhancing Long-Duration and High-Resolution Video Understanding by VIdeo SpatioTemporal Augmentation" [CVPR2025]☆20Feb 27, 2025Updated last year
- Analogist: Out-of-the-box Visual In-Context Learning with Image Diffusion Model (SIGGRAPH 2024)☆38Sep 10, 2024Updated last year
- ☆62Mar 3, 2025Updated last year
- (ICCV2023) A large-pose Flickr face dataset comprised of 19,590 high-quality real large-pose portrait images.☆81Mar 25, 2024Updated 2 years ago
- Text-guided 3D texture generation using training-free multi-diffusion in UV space.☆13Apr 7, 2025Updated last year
- Code for "Is CLIP ideal? No. Can we fix it? Yes!"☆57Dec 12, 2025Updated 6 months ago
- [SIGGRAPH Asia 2023] Official pytorch implementation of "360° Reconstruction From a Single Image Using Space Carved Outpainting"☆17Sep 15, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Resources and paper list for 'Scaling Environments for Agents'. This repository accompanies our survey on how environments contribute to …☆70Jan 28, 2026Updated 5 months ago
- [CVPR 2025] VASparse: Towards Efficient Visual Hallucination Mitigation via Visual-Aware Token Sparsification☆50Mar 24, 2025Updated last year
- implementation of dualformer☆25Mar 1, 2025Updated last year
- [AAAI 2026] Few-step Flow for 3D Generation via Marginal-Data Transport Distillation☆54Apr 29, 2026Updated 2 months ago
- Simple implementation of Retrieval-Augmented Generation System☆28Oct 24, 2024Updated last year
- the official implementation of the paper: Neural Parameterization for Dynamic Human Head Editing☆31Feb 14, 2024Updated 2 years ago
- (Siggraph Asia 2023) Project Page of "HyperDreamer: Hyper-Realistic 3D Content Generation and Editing from a Single Image"☆10Dec 9, 2023Updated 2 years ago
- Will Pre-Training Ever End? A First Step Toward Next-Generation Foundation MLLMs via Self-Improving Systematic Cognition☆31May 14, 2025Updated last year
- All code for FlairGPT: Repurposing LLMs for Interior Designs, Eurographics 2025☆21Mar 6, 2025Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Official code of AAAI'23 paper AudioEar: Single-View Ear Reconstruction for Personalized Spatial Audio written in PyTorch☆43Dec 14, 2023Updated 2 years ago
- Learning to Generate UTR Sequences for Optimized Ribosome Load and Gene Expression☆23May 11, 2025Updated last year
- [ICLR 2026] Code for our paper "Next Visual Granularity Generation".☆54Jan 26, 2026Updated 5 months ago
- BVH animation viewer.☆10Sep 1, 2012Updated 13 years ago
- Official implementation for "Diffusion Instruction Tuning"☆35Apr 1, 2026Updated 3 months ago
- [ICLR2024] Codes and Models for COSA: Concatenated Sample Pretrained Vision-Language Foundation Model☆43Dec 25, 2024Updated last year
- Official pytorch implementation of paper "High-quality Animatable Eyelid Shapes from Lightweight Captures" (SIGGRAPH Asia 2024).☆39Dec 11, 2024Updated last year