【Accepted by ACM MM'25 🎉🎉】MS-DETR: Towards Effective Video Moment Retrieval and Highlight Detection by Joint Motion-Semantic Learning
☆193Sep 26, 2025Updated 6 months ago
Alternatives and similar repositories for MS-DETR
Users that are interested in MS-DETR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆33Dec 1, 2025Updated 4 months ago
- ☆42Oct 20, 2025Updated 6 months ago
- [ICCV 2025] Factorized Learning for Temporally Grounded Video-Language Models☆24Jan 1, 2026Updated 3 months ago
- Repo for paper "MUSEG: Reinforcing Video Temporal Understanding via Timestamp-Aware Multi-Segment Grounding".☆40Jun 9, 2025Updated 10 months ago
- Claude Code skill for improving website AEO (AI Engine Optimization) and GEO (Generative Engine Optimization) scores — 16 foundational ch…☆843Apr 2, 2026Updated 2 weeks ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Proposed fuzzy reward model with GRPO to improve VLM's abilities in crowd counting task.☆21Apr 11, 2025Updated last year
- Code implementation of paper "MUSE: Mamba is Efficient Multi-scale Learner for Text-video Retrieval (AAAI2025)"☆25Feb 2, 2025Updated last year
- ☆14Oct 30, 2023Updated 2 years ago
- ☆161Apr 14, 2025Updated last year
- ☆243Oct 26, 2025Updated 5 months ago
- (CVPR25) Exploring Contextual Attribute Density in Referring Expression Counting☆20Dec 3, 2025Updated 4 months ago
- 14-stage Fusion Pipeline for LLM token compression — reversible compression, AST-aware code analysis, intelligent content routing. Zero L…☆2,208Apr 1, 2026Updated 2 weeks ago
- Autonomous novel writing AI Agent — agents write, audit, and revise novels with human review gates☆4,522Updated this week
- Forgetting-Aware Curriculum for VLM Self-Evolution — adversarial difficulty scheduling with forgetting detection across 6 VQA skill clust…☆235Apr 1, 2026Updated 2 weeks ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Path planning using Q-learning and DQN with experience replay☆34Apr 7, 2026Updated last week
- [AAAI 2024] GMMFormer: Gaussian-Mixture-Model Based Transformer for Efficient Partially Relevant Video Retrieval☆20May 10, 2024Updated last year
- Decoupled Memory Selection for Multi-target Video Segmentation of SAM3☆49Jan 16, 2026Updated 3 months ago
- ☆10Dec 3, 2024Updated last year
- [NeurIPS 2024] OneRef: Unified One-tower Expression Grounding and Segmentation with Mask Referring Modeling.☆31Nov 13, 2025Updated 5 months ago
- ICCV'23 Dual Learning with Dynamic Knowledge Distillation for Partially Relevant Video Retrieval☆19Aug 22, 2025Updated 7 months ago
- code for GuidedNet☆13Feb 16, 2023Updated 3 years ago
- Steering Forward-Process Reinforcement Learning for Distilled Autoregressive Video Models☆122Mar 24, 2026Updated 3 weeks ago
- [EMNLP 2025 Industry] Datasets and Recipes for Video Temporal Grounding via Reinforcement Learning☆36Oct 22, 2025Updated 5 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Open-source implementation of SeaSeed.ai v1.0 (codename: Clawerse). AI ocean world platform for multi-agent social, tasks and compute col…☆317Feb 15, 2026Updated 2 months ago
- This repo is created to serve simulaion and analysis on the data collected in Jiuyang Bai's dissertation research.☆508Mar 2, 2026Updated last month
- [CVPR 2023] HierVL Learning Hierarchical Video-Language Embeddings☆46Aug 14, 2023Updated 2 years ago
- Safety Is All You Need !Meet OctoGuard, the toughest boss for Lobster AI employees — a security governance and audit system designed for …☆102Apr 4, 2026Updated 2 weeks ago
- ☆242Apr 3, 2026Updated 2 weeks ago
- The ODinMJ RGB-T dataset is an object detection RGB-T dataset for mountain jungle scenes.☆30May 29, 2024Updated last year
- ☆50Sep 13, 2024Updated last year
- The first agentic payment network: policy-controlled, gasless, and real money-ready. OmniClaw CLI + Financial Policy Engine let autonomo…☆348Updated this week
- M-SpecGene: Generalized Foundation Model for RGBT Multispectral Vision (ICCV 2025)☆33Nov 19, 2025Updated 5 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Building a 5GHz WiFi Spoofer with the Realtek RTL8720dn☆70Apr 8, 2026Updated last week
- Model for the manuscript named "Spectral Response Function Guided Deep Optimization-driven Network for Spectral Super-resolution" pbulish…☆16Feb 1, 2021Updated 5 years ago
- [NeurIPS'25] Time-R1: Post-Training Large Vision Language Model for Temporal Video Grounding☆90Dec 14, 2025Updated 4 months ago
- Dataset & Code for ACM Multimedia 2023 paper. "SemanticRT: A Large-Scale Dataset and Method for Robust Semantic Segmentation in Multispec…☆15Apr 14, 2025Updated last year
- Pytorch implementation of the paper 'Gaussian Mixture Proposals with Pull-Push Learning Scheme to Capture Diverse Events for Weakly Super…☆20Jan 19, 2024Updated 2 years ago
- ☆28Mar 12, 2026Updated last month
- RGBD Pretraining code used in DFormer [ICLR 2024]☆21Jul 8, 2025Updated 9 months ago