DCDmllm / WorldGPTLinks
WorldGPT: Empowering LLM as Multimodal World Model
☆117Updated 11 months ago
Alternatives and similar repositories for WorldGPT
Users that are interested in WorldGPT are comparing it to the libraries listed below
Sorting:
- [ECCV 2024] Efficient Inference of Vision Instruction-Following Models with Elastic Cache☆42Updated 11 months ago
- Chain-of-Spot: Interactive Reasoning Improves Large Vision-language Models☆96Updated last year
- 🚀 [NeurIPS24] Make Vision Matter in Visual-Question-Answering (VQA)! Introducing NaturalBench, a vision-centric VQA benchmark (NeurIPS'2…☆84Updated 3 weeks ago
- ☆31Updated this week
- (ECCV 2024) Empowering Multimodal Large Language Model as a Powerful Data Generator☆111Updated 3 months ago
- GPT4Vis: What Can GPT-4 Do for Zero-shot Visual Recognition?☆186Updated last year
- u-LLaVA: Unifying Multi-Modal Tasks via Large Language Model☆132Updated 3 months ago
- RobustFT: Robust Supervised Fine-tuning for Large Language Models under Noisy Response☆42Updated 6 months ago
- ☆67Updated 4 months ago
- SemiEvol: Semi-supervised Fine-tuning for LLM Adaptation