codebase for iccv 2025 paper "One Trajectory, One Token: Grounded Video Tokenization via Panoptic Sub-object Trajectory"
☆117Aug 13, 2025Updated 7 months ago
Alternatives and similar repositories for trajvit
Users that are interested in trajvit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The official code for Improving Multimodal Learning via Imbalanced Learning☆50Updated this week
- ☆33Aug 25, 2025Updated 7 months ago
- ☆33Nov 26, 2025Updated 4 months ago
- (ICCV 2025) OmniSAM: Omnidirectional Segment Anything Model for UDA in Panoramic Semantic Segmentation☆16Oct 11, 2025Updated 5 months ago
- ☆169Jul 11, 2025Updated 8 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- m&ms: A Benchmark to Evaluate Tool-Use for multi-step multi-modal tasks☆45Sep 26, 2024Updated last year
- ☆19Sep 2, 2025Updated 6 months ago
- [ICCV 2025] A Benchmark for Multi-Step Reasoning in Long Narrative Videos☆25Aug 8, 2025Updated 7 months ago
- [ICCV 23] A Simple Vision Transformer for Weakly Semi-supervised 3D Object Detection☆13Apr 12, 2024Updated last year
- This is the official repository of Daily-Omni: Towards Audio-Visual Reasoning with Temporal Alignment across Modalities☆39Updated this week
- Repository for the PyOpenGL Project (LaunchPad Mirror)☆16Jul 9, 2019Updated 6 years ago
- 抖音电商,分行业topX商品基础数据+销量数据,提供小程序和api查询☆45Mar 7, 2025Updated last year
- ☆17Jan 26, 2025Updated last year
- the project of VR☆11Jul 3, 2019Updated 6 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- Fast CUDA implementation of (differentiable) otam for PyTorch using Numba☆16Jun 21, 2021Updated 4 years ago
- 🎓 [NWEN301] Pintos - Alarm Clock☆808Jul 20, 2025Updated 8 months ago
- [NeurIPS'24 splotlight] Official Repo for AcoustiX used in Acoustic volume rendering for neural impulse response fields.☆37Dec 15, 2025Updated 3 months ago
- [CVPR 2026] Variation-aware Vision Token Dropping for Faster Large Vision-Language Models☆28Mar 18, 2026Updated last week
- ☆292Updated this week
- Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!☆11May 24, 2023Updated 2 years ago
- csghub-server is the backend server for CSGHub which helps user to manage datasets, modes, and also run Model Inference, Finetune and App…☆1,415Updated this week
- ☆29Jun 28, 2025Updated 8 months ago
- Code release for the paper "Progress-Aware Video Frame Captioning" (CVPR 2025)☆21Jul 16, 2025Updated 8 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- This repository contains the Adverbs in Recipes (AIR) dataset and the code published at the CVPR 23 paper: "Learning Action Changes by Me…☆13May 25, 2023Updated 2 years ago
- Mo-AI 俱乐部机器学习开发者沙龙论文资料☆23Jul 31, 2024Updated last year
- [ICML'25] The Price of Freedom: Exploring Expressivity and Runtime Tradeoffs in Equivariant Tensor Products☆18Jul 16, 2025Updated 8 months ago
- Let you in a meta world of The Palace Museum☆20Aug 30, 2025Updated 6 months ago
- Official implementation for the paper: InterTrack☆42Oct 2, 2025Updated 5 months ago
- EPRecon: An Efficient Framework for Real-Time Panoptic 3D Reconstruction from Monocular Video (ICRA2025)☆280Dec 4, 2025Updated 3 months ago
- Enterprise-grade, commercial-friendly agentic workflow platform for building next-generation SuperAgents.☆10,426Mar 20, 2026Updated last week
- Labeled Movie Trailer Dataset☆16Mar 23, 2018Updated 8 years ago
- Implementation of NIPS2023: Unleashing the Full Potential of Product Quantization for Large-Scale Image Retrieva☆11Nov 12, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- 53AI Hub is an open-source AI portal, which enables you to quickly build a operational-level AI portal to launch and operate AI agents, p…☆9,074Mar 4, 2026Updated 3 weeks ago
- Temporal Compact Bilinear Pooling (TCBP)☆11May 27, 2020Updated 5 years ago
- ☆29Jun 5, 2025Updated 9 months ago
- SMILE: A Multimodal Dataset for Understanding Laughter☆13Jun 15, 2023Updated 2 years ago
- ☆13Nov 10, 2025Updated 4 months ago
- ☆13Jul 20, 2024Updated last year
- Assignments from 16-825 Learning for 3D Vision at Carnegie Mellon University☆13Apr 5, 2023Updated 2 years ago