Offical Code for GPT4Video: A Unified Multimodal Large Language Model for lnstruction-Followed Understanding and Safety-Aware Generation
☆144Oct 30, 2024Updated last year
Alternatives and similar repositories for GPT4Video
Users that are interested in GPT4Video are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- improve Llama-2's proficiency in comprehension, generation, and translation of Chinese.☆438Mar 25, 2024Updated 2 years ago
- ☆20Jun 17, 2024Updated last year
- ☆14May 31, 2024Updated last year
- TVsub: DCU-Tencent Chinese-English Dialogue Corpus☆46Feb 14, 2018Updated 8 years ago
- STDFormer: Spatio Temporal Disentanglement Learning for 3D Human Mesh Recovery from Monocular Videos with Transformer☆45Mar 14, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Official code for Goldfish model for long video understanding and MiniGPT4-video for short video understanding☆639Dec 10, 2024Updated last year
- ☆12Jan 5, 2024Updated 2 years ago
- ACL'24 (Oral) Tuning Large Multimodal Models for Videos using Reinforcement Learning from AI Feedback☆77Sep 12, 2024Updated last year
- ☆21Oct 10, 2023Updated 2 years ago
- ☆32Jan 25, 2024Updated 2 years ago
- ☆25Dec 21, 2023Updated 2 years ago
- ☆16Nov 28, 2023Updated 2 years ago
- Feature Decay Algorithms☆11Mar 5, 2014Updated 12 years ago
- The benchmark and datasets of the ICML 2024 paper "VisionGraph: Leveraging Large Multimodal Models for Graph Theory Problems in Visual C…☆17May 27, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆13Dec 18, 2023Updated 2 years ago
- ☆18Dec 29, 2023Updated 2 years ago
- [NeurIPS 2023 Datasets and Benchmarks] "FETV: A Benchmark for Fine-Grained Evaluation of Open-Domain Text-to-Video Generation", Yuanxin L…☆57Mar 4, 2024Updated 2 years ago
- ☆16Sep 6, 2024Updated last year
- ☆17Jan 10, 2024Updated 2 years ago
- Official pytorch implementation for SingleInsert☆28Apr 19, 2024Updated 2 years ago
- ☆80Nov 24, 2024Updated last year
- ☆157Oct 31, 2024Updated last year
- ☆14Dec 26, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models (ECCV 2024)☆862Jul 29, 2024Updated last year
- ☆71Mar 3, 2024Updated 2 years ago
- ☆133Feb 13, 2024Updated 2 years ago
- Multilingual Corpus of Web Fiction☆203Jun 28, 2024Updated last year
- Official code for the paper "Does CLIP's Generalization Performance Mainly Stem from High Train-Test Similarity?" (ICLR 2024)☆10Aug 26, 2024Updated last year
- ☆30Dec 19, 2023Updated 2 years ago
- ☆86Jan 2, 2024Updated 2 years ago
- ☆50Jun 26, 2025Updated 9 months ago
- This is the implementation of CounterCurate, the data curation pipeline of both physical and semantic counterfactual image-caption pairs.☆19Jun 27, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆23Mar 21, 2024Updated 2 years ago
- Project for SNARE benchmark☆11Jun 5, 2024Updated last year
- [CVPR 2024] Official PyTorch implementation of the paper "One For All: Video Conversation is Feasible Without Video Instruction Tuning"☆35Feb 2, 2024Updated 2 years ago
- ☆11Aug 28, 2023Updated 2 years ago
- Google's Gemini implemented with GPT-4 Vision, Whisper and Resemble AI☆26Dec 9, 2023Updated 2 years ago
- ☆18May 29, 2024Updated last year
- ☆194Oct 14, 2024Updated last year