zjr2000/Untrimmed-Video-Feature-Extractor

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/zjr2000/Untrimmed-Video-Feature-Extractor)

zjr2000 / Untrimmed-Video-Feature-Extractor

A simple and effective feature extractor for untrimmed videos

☆13

Alternatives and similar repositories for Untrimmed-Video-Feature-Extractor

Users that are interested in Untrimmed-Video-Feature-Extractor are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

zjr2000 / GVL
View on GitHub
Official implementation for paper Learning Grounded Vision-Language Representation for Versatile Understanding in Untrimmed Videos
☆28Dec 8, 2023Updated 2 years ago
NIneeeeeem / LangDC
View on GitHub
[EMNLP 2025 Oral] Official codebase for Seeing More, Saying More: Lightweight Language Experts are Dynamic Video Token Compressors.
☆18Sep 7, 2025Updated 10 months ago
zjr2000 / SPES
View on GitHub
Official Implementation for paper "Pretraining A Large Language Model using Distributed GPUs: A Memory-Efficient Decentralized Paradigm"
☆23May 8, 2026Updated 2 months ago
zjr2000 / LLMVA-GEBC
View on GitHub
Winner solution to Generic Event Boundary Captioning task in LOVEU Challenge (CVPR 2023 workshop)
☆29Jan 1, 2024Updated 2 years ago
TencentARC / FLM
View on GitHub
Accelerating Vision-Language Pretraining with Free Language Modeling (CVPR 2023)
☆31May 15, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
zjr2000 / Awesome-Multimodal-Chatbot
View on GitHub
Awesome Multimodal Assistant is a curated list of multimodal chatbots/conversational assistants that utilize various modes of interaction…
☆79Jun 18, 2023Updated 3 years ago
XuMengyaAmy / SwinMLP_TranCAP
View on GitHub
☆13Jun 26, 2022Updated 4 years ago
ttgeng233 / UnAV
View on GitHub
Dense-Localizing Audio-Visual Events in Untrimmed Videos: A Large-Scale Benchmark and Baseline (CVPR 2023)
☆73Jan 4, 2026Updated 6 months ago
mchenwang / PupilOptixLab
View on GitHub
OptiX ray tracing toy framework
☆12Mar 22, 2024Updated 2 years ago
ttengwang / ESGN
View on GitHub
Event Sequence Generation Network
☆14Jun 22, 2021Updated 5 years ago
XuMengyaAmy / ReportDALS
View on GitHub
☆16Nov 19, 2020Updated 5 years ago
ttgeng233 / LongVALE
View on GitHub
LongVALE: Vision-Audio-Language-Event Benchmark Towards Time-Aware Omni-Modal Perception of Long Videos. (CVPR 2025))
☆61Jun 9, 2025Updated last year
ttgeng233 / UniAV
View on GitHub
Unified Audio-Visual Perception for Multi-Task Video Localization
☆33Apr 19, 2024Updated 2 years ago
Ferrum5 / Sorry-Android
View on GitHub
生成为所欲为动图，灵感来自于sorry项目
☆11Mar 28, 2020Updated 6 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
nianhuhu / Experiment4
View on GitHub
基于MQTT协议，物联网云平台的智慧路灯管理系统，在PC机上（根据相应的开发技术选取开发环境）进行项目软件的Web开发，采集端的数据采用MQTT.fx进行模拟，数据通过MQTT协议进行传输到服务器，再获取服务器数据，并最终显示在前端应用中。
☆11Jul 6, 2020Updated 6 years ago
RyanLiut / awesome-diverse-captioning
View on GitHub
Some papers about *diverse* image (a few videos) captioning
☆25Apr 4, 2023Updated 3 years ago
liziwl / operating_system_Lab
View on GitHub
CS302, SUSTech
☆11Oct 7, 2019Updated 6 years ago
happywu / CycleContrast
View on GitHub
Contrastive Learning of Image Representations with Cross-Video Cycle-Consistency
☆17Dec 2, 2021Updated 4 years ago
TencentYoutuResearch / ActionDetection-LSTC
View on GitHub
LSTC: Boosting Atomic Action Detection with Long-Short-Term Context
☆10Sep 1, 2022Updated 3 years ago
dingfengshi / TriDet
View on GitHub
[CVPR2023] Code for the paper, TriDet: Temporal Action Detection with Relative Boundary Modeling
☆219Dec 27, 2023Updated 2 years ago
Alvin-Zeng / GCM
View on GitHub
Graph Convolutional Module for Temporal Action Localization in Videos
☆10Jul 4, 2020Updated 6 years ago
amusi / mlhub123
View on GitHub
给国人的机器学习&深度学习网站资源汇总（Machine Learning Resources）
☆17Dec 17, 2022Updated 3 years ago
dingmyu / VRDP
View on GitHub
[NeurIPS 2021] Dynamic Visual Reasoning by Learning Differentiable Physics Models from Video and Language
☆47Apr 11, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
jy0205 / STCAT
View on GitHub
[NeurIPS 2022] Embracing Consistency: A One-Stage Approach for Spatio-Temporal Video Grounding
☆54Mar 5, 2024Updated 2 years ago
SoccerNet / sn-caption
View on GitHub
Repository containing all necessary codes to get started on the SoccerNet Dense Video Captioning challenge.
☆36Apr 12, 2024Updated 2 years ago
franciszchen / SCA-Net
View on GitHub
☆10Oct 7, 2023Updated 2 years ago
Finspire13 / pytorch-i3d-feature-extraction
View on GitHub
Code for I3D Feature Extraction
☆164Aug 7, 2019Updated 6 years ago
lijunlang / IEMCS
View on GitHub
☆15Jun 8, 2018Updated 8 years ago
LuoweiZhou / anet2016-cuhk-feature
View on GitHub
Feature Extraction Toolbox from CUHK&ETHZ&SIAT submission to ActivityNet 2016
☆32Mar 31, 2019Updated 7 years ago
jbistanbul / MiniROAD
View on GitHub
☆42May 7, 2024Updated 2 years ago
gurkirt / TrackAwareActionDetection
View on GitHub
PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.
☆11Apr 19, 2024Updated 2 years ago
MCG-NJU / RTD-Action
View on GitHub
[ICCV 2021] Relaxed Transformer Decoders for Direct Action Proposal Generation
☆92Apr 5, 2022Updated 4 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
wenz116 / DRFT
View on GitHub
End-to-end Multi-modal Video Temporal Grounding, NeurIPS 2021
☆18Oct 24, 2021Updated 4 years ago
HumamAlwassel / TSP
View on GitHub
TSP: Temporally-Sensitive Pretraining of Video Encoders for Localization Tasks (ICCVW 2021)
☆119Sep 16, 2023Updated 2 years ago
TencentARC / ARC-Chapter
View on GitHub
Structuring Hour-Long Videos into Navigable Chapters and Hierarchical Summaries
☆44Nov 19, 2025Updated 8 months ago
JJBOY / C3D-pytorch
View on GitHub
a C3D impelementation using pytorch
☆14Nov 27, 2018Updated 7 years ago
GauravGajbhiye / SCAMET_RSIC
View on GitHub
This is tensorflow 2.2 based SCAMET framework for remote sensing image captioning.
☆13Aug 10, 2023Updated 2 years ago
cleary-lab / CISI
View on GitHub
code for composite in situ imaging (cisi) analysis
☆12Oct 26, 2020Updated 5 years ago
rxtan2 / Koala-video-llm
View on GitHub
☆37Sep 16, 2024Updated last year