Use 2 lines to empower absolute time awareness for Qwen2.5VL's MRoPE
☆29Sep 20, 2025Updated 8 months ago
Alternatives and similar repositories for DATE
Users that are interested in DATE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official repository of paper "LOVE-R1: Advancing Long Video Understanding with Adaptive Zoom-in Mechanism via Multi-Step Reasoning"☆24Nov 1, 2025Updated 7 months ago
- Use the tokenizer in parallel to achieve superior acceleration☆20Mar 21, 2024Updated 2 years ago
- [AAAI 2025] SSLFusion: Scale and Space Aligned Latent Fusion Model for Multimodal 3D Object Detection☆18Nov 14, 2025Updated 7 months ago
- [AAAI2025] Revisiting Tampered Scene Text Detection in the Era of Generative AI☆69Jun 7, 2026Updated last week
- VideoEval-Pro: Robust and Realistic Long Video Understanding Evaluation [TMLR26]☆16Jun 1, 2026Updated 2 weeks ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [ICML 2025] Official repository for paper "Scaling Video-Language Models to 10K Frames via Hierarchical Differential Distillation"☆192Sep 23, 2025Updated 8 months ago
- Official code for DAM: Dynamic Adapter Merging for Continual Video QA Learning☆15Apr 25, 2024Updated 2 years ago
- ☐ ☐ A simple, out-of-the-box and cross-platform bbox annotation tool by Python. Try it by `pip install easybox`☆10May 28, 2021Updated 5 years ago
- The code for paper Interpreting Key Mechanisms of Factual Recall in Transformer-Based Language Models.☆13Apr 10, 2024Updated 2 years ago
- [ICCV 2025] Factorized Learning for Temporally Grounded Video-Language Models☆24Apr 18, 2026Updated last month
- UMB: Understanding Model Behavior for Open-World object Detection (NeurIPS 2024)☆12May 26, 2024Updated 2 years ago
- ☆19Jun 14, 2024Updated 2 years ago
- Source code of paper: Process vs. Outcome Reward: Which is Better for Agentic RAG Reinforcement Learning☆46Jun 24, 2025Updated 11 months ago
- ☆10Oct 20, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Visual Instruction Tuning for Qwen2 Base Model☆44Jun 29, 2024Updated last year
- Registration of 3D triangular meshes onto a 2D image can be performed using optimisation and fast X-ray simulation on GPU. Automatic esti…☆11Aug 28, 2019Updated 6 years ago
- content for Using Combine - notes on learning Combine with UIKit and SwiftUI☆15Feb 12, 2024Updated 2 years ago
- Unofficial implementation of MVSS-Net (ICCV 2021) with Pytorch including training code.☆70Sep 26, 2023Updated 2 years ago
- Badminton Analytics using CV - so far, highlight creation and automatic score updation using TrackNet and YOLO☆16Aug 2, 2024Updated last year
- [CVPRW 2026] Official implementation of "BST: Badminton Stroke-type Transformer for Skeleton-based Action Recognition in Racket Sports"☆35Jun 6, 2026Updated last week
- Extend bert-nmt to context-aware translation.☆11May 24, 2021Updated 5 years ago
- RelayGS: Reconstructing Dynamic Scenes with Large-Scale and Complex Motions via Relay Gaussians☆14Dec 5, 2024Updated last year
- Implementation of "Novel view synthesis with Diffusion models" by Google in JAX distributed☆11May 25, 2023Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Kling-Foley: Multimodal Diffusion Transformer for High-Quality Video-to-Audio Generation☆62Jun 26, 2025Updated 11 months ago
- Source code of the paper "The NeRF Signature: Codebook-Aided Watermarking for Neural Radiance Fields".☆17Mar 3, 2025Updated last year
- FlashVTG: Feature Layering and Adaptive Score Handling Network for Video Temporal Grounding. (WACV2025)☆39Apr 17, 2025Updated last year
- ☆10Nov 27, 2024Updated last year
- ☆149Nov 17, 2025Updated 6 months ago
- [IJCV 2025] OmniDrag: Enabling Motion Control for Omnidirectional Image-to-Video Generation☆16Feb 13, 2026Updated 4 months ago
- Code for "Spatial-Temporal Enhanced Transformer Towards Multi-Frame 3D Object Detection" (TPAMI2024)☆13Apr 3, 2025Updated last year
- [ACL 2026 Main] Official repository for paper: OS-Symphony: A Holistic Framework for Robust and Generalist Computer-Using Agents☆47Apr 7, 2026Updated 2 months ago
- Neural network approximators of linear algebra operations on GPU with PyTorch☆17May 30, 2022Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- AvatarShield: Visual Reinforcement Learning for Human-Centric Video Forgery Detection☆23Jun 3, 2025Updated last year
- ☆12Nov 28, 2022Updated 3 years ago
- Starter code for working with the YouTube-8M dataset.☆16Jun 9, 2017Updated 9 years ago
- [ICLR 2026 Oral] Reasoning as Representation: Rethinking Visual Reinforcement Learning in Image Quality Assessment☆35Feb 14, 2026Updated 4 months ago
- [SIGGRAPH 2025] MotionCanvas: Cinematic Shot Design with Controllable Image-to-Video Generation☆36Aug 5, 2025Updated 10 months ago
- InvTorch: Memory-Efficient Invertible Functions☆17Oct 31, 2024Updated last year
- C++ implementation for 《"GrabCut" — Interactive Foreground Extraction using Iterated Graph Cuts》☆12Jul 25, 2023Updated 2 years ago