zjr2000 / Untrimmed-Video-Feature-ExtractorView external linksLinks
A simple and effective feature extractor for untrimmed videos
☆13Sep 1, 2022Updated 3 years ago
Alternatives and similar repositories for Untrimmed-Video-Feature-Extractor
Users that are interested in Untrimmed-Video-Feature-Extractor are comparing it to the libraries listed below
Sorting:
- Official implementation for paper Learning Grounded Vision-Language Representation for Versatile Understanding in Untrimmed Videos☆28Dec 8, 2023Updated 2 years ago
- Winner solution to Generic Event Boundary Captioning task in LOVEU Challenge (CVPR 2023 workshop)☆29Jan 1, 2024Updated 2 years ago
- Accelerating Vision-Language Pretraining with Free Language Modeling (CVPR 2023)☆32May 15, 2023Updated 2 years ago
- ☆13Nov 19, 2020Updated 5 years ago
- ☆13Jun 26, 2022Updated 3 years ago
- LongVALE: Vision-Audio-Language-Event Benchmark Towards Time-Aware Omni-Modal Perception of Long Videos. (CVPR 2025))☆56Jun 9, 2025Updated 8 months ago
- Dense-Localizing Audio-Visual Events in Untrimmed Videos: A Large-Scale Benchmark and Baseline (CVPR 2023)☆70Jan 4, 2026Updated last month
- Event Sequence Generation Network☆14Jun 22, 2021Updated 4 years ago
- Awesome Multimodal Assistant is a curated list of multimodal chatbots/conversational assistants that utilize various modes of interaction…☆78Jun 18, 2023Updated 2 years ago
- [AAAI 2025] Empowering LLMs with Pseudo-Untrimmed Videos for Audio-Visual Temporal Understanding☆34Mar 21, 2025Updated 10 months ago
- Unified Audio-Visual Perception for Multi-Task Video Localization☆30Apr 19, 2024Updated last year
- Progress-Aware Online Action Segmentation for Egocentric Procedural Task Videos☆28Sep 9, 2024Updated last year
- TimeLens: Rethinking Video Temporal Grounding with Multimodal LLMs☆103Feb 2, 2026Updated last week
- Some papers about *diverse* image (a few videos) captioning☆26Apr 4, 2023Updated 2 years ago
- Repository containing all necessary codes to get started on the SoccerNet Dense Video Captioning challenge.☆33Apr 12, 2024Updated last year
- [CVPR2023] Code for the paper, TriDet: Temporal Action Detection with Relative Boundary Modeling☆205Dec 27, 2023Updated 2 years ago
- SurgLaVi: Large-Scale Hierarchical Datasets for Surgical Vision–Language Representation Learning☆23Feb 2, 2026Updated last week
- 如何做好科研写好科研文章?发顶刊顶会总结☆85Jul 17, 2023Updated 2 years ago
- Repo for our work "Systematic Evaluation of Large Vision-Language Models for Surgical Artificial Intelligence"☆19Jun 2, 2025Updated 8 months ago
- Feature Extraction Toolbox from CUHKÐZ&SIAT submission to ActivityNet 2016☆32Mar 31, 2019Updated 6 years ago
- Code for I3D Feature Extraction☆160Aug 7, 2019Updated 6 years ago
- ☆40May 7, 2024Updated last year
- Graph Convolutional Module for Temporal Action Localization in Videos☆10Jul 4, 2020Updated 5 years ago
- CVPR2022 update everyday!☆11Apr 12, 2022Updated 3 years ago
- ☆19Jul 22, 2025Updated 6 months ago
- This is tensorflow 2.2 based SCAMET framework for remote sensing image captioning.☆12Aug 10, 2023Updated 2 years ago
- ☆12Jul 22, 2024Updated last year
- [ICML 2025 Oral] This is the official repository of the paper "What Limits Virtual Agent Application? OmniBench: A Scalable Multi-Dimensi…☆20Jun 12, 2025Updated 8 months ago
- LSTC: Boosting Atomic Action Detection with Long-Short-Term Context☆10Sep 1, 2022Updated 3 years ago
- Repo for the walking robot's vision based navigation code☆10Jun 6, 2023Updated 2 years ago
- 基于MQTT协议,物联网云平台的智慧路灯管理系统,在PC机上(根据相应的开发技术选取开发环境)进行项目软件的Web开发,采集端的数据采用MQTT.fx进行模拟,数据通过MQTT协议进行传输到服务器,再获取服务器数据,并最终显示在前端应用中。☆10Jul 6, 2020Updated 5 years ago
- ☆13Nov 21, 2025Updated 2 months ago
- ☆10Oct 7, 2023Updated 2 years ago
- [NeurIPS 2021] Dynamic Visual Reasoning by Learning Differentiable Physics Models from Video and Language☆47Apr 11, 2023Updated 2 years ago
- Official repo for CVPR 2022 (Oral) paper: Revisiting the "Video" in Video-Language Understanding. Contains code for the Atemporal Probe (…☆51May 29, 2024Updated last year
- [ICCV 2023] How Much Temporal Long-Term Context is Needed for Action Segmentation?☆50Jun 21, 2024Updated last year
- [CVPR 2024] KEPP: Why Not Use Your Textbook? Knowledge-Enhanced Procedure Planning of Instructional Videos☆12Sep 24, 2024Updated last year
- Modular and simple vision language navigation framework☆12Aug 16, 2021Updated 4 years ago
- NICE challenge 2023 Track2 2nd result(total 4th) (CVPR 2023) sponsered by LG AI/Shutterstock/SNU☆11Jun 22, 2023Updated 2 years ago