Explore VLM-Eval, a framework for evaluating Video Large Language Models, enhancing your video analysis with cutting-edge AI technology.
☆37Jan 20, 2024Updated 2 years ago
Alternatives and similar repositories for Awesome-Video-LLMs
Users that are interested in Awesome-Video-LLMs are comparing it to the libraries listed below
Sorting:
- ☆10Nov 23, 2023Updated 2 years ago
- [MICCAI-STACOM 2022] This repository is the official implementation of "Learning correspondences of cardiac motion from images using biom…☆13Sep 30, 2022Updated 3 years ago
- The official code and dataset for EMNLP 2022 paper "COPEN: Probing Conceptual Knowledge in Pre-trained Language Models".☆21Mar 9, 2023Updated 3 years ago
- [PR 2024] TFS-ViT: Token-Level Feature Stylization for Domain Generalization☆25Mar 29, 2023Updated 2 years ago
- The official codebase of FineAction dataset. We will update the data and code of our FineAction.☆22Apr 10, 2025Updated 11 months ago
- FlashVTG: Feature Layering and Adaptive Score Handling Network for Video Temporal Grounding. (WACV2025)☆35Apr 17, 2025Updated 10 months ago
- 小飞机翻墙教程☆24Nov 14, 2019Updated 6 years ago
- Data for the MTEB leaderboard☆46Updated this week
- Egocentric Video Understanding Dataset (EVUD)☆33Jul 4, 2024Updated last year
- Code for our EMNLP 2022 paper: Generative Entity Typing with Curriculum Learning.☆13Aug 19, 2023Updated 2 years ago
- Official Code for All-in-One Medical Image Re-Identification (CVPR2025)☆18Jan 11, 2026Updated last month
- Apparel Classification for Indian Ethnic Clothes☆12Feb 10, 2023Updated 3 years ago
- ☆44Oct 20, 2023Updated 2 years ago
- Official implementation of CMMCoT: Enhancing Complex Multi-Image Comprehension via Multi-Modal Chain-of-Thought and Memory Augmentation☆12Dec 5, 2025Updated 3 months ago
- Code for paper 'MulViMotion: Shape-aware 3D Myocardial Motion Tracking from Multi-View Cardiac MRI'☆12Sep 2, 2022Updated 3 years ago
- DOMAINEVAL is an auto-constructed benchmark for multi-domain code generation that consists of 2k+ subjects (i.e., description, reference …☆14Dec 12, 2024Updated last year
- Official Code for Large-vocabulary forensic pathological analyses via prototypical cross-modal contrastive learning☆16Jul 24, 2025Updated 7 months ago
- Open source IMU based motion tracking glove.☆14Jun 3, 2022Updated 3 years ago
- Implementation for the paper "Unified Multimodal Model with Unlikelihood Training for Visual Dialog"☆13May 12, 2023Updated 2 years ago
- Federated Meta-Learning for Emotion and Sentiment Aware Multi-modal Complaint Identification☆10May 30, 2024Updated last year
- Recreating the phase functioned neural network in unreal engine 5☆15May 12, 2024Updated last year
- Adapters Strike Back (CVPR 2024)☆44Jul 24, 2024Updated last year
- VideoHallucer, The first comprehensive benchmark for hallucination detection in large video-language models (LVLMs)☆42Dec 16, 2025Updated 2 months ago
- This project is distributed as a free Unreal Engine Plugin. It consists in a single c++ actor component that handles the playback of anim…☆12Mar 10, 2024Updated last year
- This is a depth-anything-v2 onnxruntime inference by cpp☆15Sep 2, 2024Updated last year
- Build a simple CMD chat interface with llama.cpp and C++☆14Sep 19, 2025Updated 5 months ago
- ☆12Mar 5, 2025Updated last year
- An Efficient Dataset Condensation Plugin and Its Application to Continual Learning. NeurIPS, 2023.☆12Nov 29, 2023Updated 2 years ago
- Code and dataset for the EMNLP 2024 paper: GoldCoin: Grounding Large Language Models in Privacy Laws via Contextual Integrity Theory☆48Sep 26, 2024Updated last year
- ☆12Sep 19, 2021Updated 4 years ago
- Code and resources for EMNLP 2022 paper on 'Robustness of Fusion-based Multimodal Classifiers to Cross-Modal Content Dilutions'☆10Mar 11, 2024Updated last year
- Managed L2D tool libs. (In Dev)☆12Apr 20, 2019Updated 6 years ago
- Code for paper: "Executing Arithmetic: Fine-Tuning Large Language Models as Turing Machines"☆11Oct 11, 2024Updated last year
- yolov8在hisi3536a推理☆11Dec 15, 2023Updated 2 years ago
- Colour Manga using AI☆14Apr 8, 2025Updated 11 months ago
- ☆10May 12, 2023Updated 2 years ago
- Docker&vLLM官方镜像部署DeepSeek模型,在生产环境中提供类OpenAI接口服务。☆15Jul 17, 2025Updated 7 months ago
- the test code of FIP (Fast inertial poser)☆14Jul 10, 2024Updated last year
- Shaping Language Models with Cognitive Insights☆15Feb 29, 2024Updated 2 years ago