albertwy / GPT-4V-EvaluationLinks
Data for evaluating GPT-4V
☆11Updated 2 years ago
Alternatives and similar repositories for GPT-4V-Evaluation
Users that are interested in GPT-4V-Evaluation are comparing it to the libraries listed below
Sorting:
- ☆87Updated last year
- ☆59Updated last year
- ☆14Updated 2 years ago
- [ICLR 2025] ChartMimic: Evaluating LMM’s Cross-Modal Reasoning Capability via Chart-to-Code Generation☆129Updated 2 weeks ago
- my commonly-used tools☆63Updated 11 months ago
- [2025-TMLR] A Survey on the Honesty of Large Language Models☆64Updated last year
- 😎 curated list of awesome LMM hallucinations papers, methods & resources.☆150Updated last year
- Paper, dataset and code list for multimodal dialogue.☆22Updated last year
- [EMNLP 2024] mDPO: Conditional Preference Optimization for Multimodal Large Language Models.☆83Updated last year
- ☆12Updated last year
- Code for ACL 2024 paper "Soft Self-Consistency Improves Language Model Agents"☆25Updated last year
- ☆25Updated 2 years ago
- MultiInstruct: Improving Multi-Modal Zero-Shot Learning via Instruction Tuning☆134Updated 2 years ago
- ☆61Updated last year
- Code for "Small Models are Valuable Plug-ins for Large Language Models"☆132Updated 2 years ago
- [ACL2024] Planning, Creation, Usage: Benchmarking LLMs for Comprehensive Tool Utilization in Real-World Complex Scenarios☆66Updated 4 months ago
- The official GitHub page for ''Evaluating Object Hallucination in Large Vision-Language Models''☆239Updated 4 months ago
- Instruct Once, Chat Consistently in Multiple Rounds: An Efficient Tuning Framework for Dialogue (ACL 2024)☆24Updated 2 months ago
- ☆87Updated 2 years ago
- [ICML'2024] Can AI Assistants Know What They Don't Know?☆85Updated last year
- Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning☆168Updated last year
- [ICLR 2024] Analyzing and Mitigating Object Hallucination in Large Vision-Language Models☆155Updated last year
- An benchmark for evaluating the capabilities of large vision-language models (LVLMs)☆46Updated 2 years ago
- ☆17Updated 2 years ago
- [NAACL 2024] A Synthetic, Scalable and Systematic Evaluation Suite for Large Language Models☆33Updated last year
- Codes for Mitigating Unhelpfulness in Emotional Support Conversations with Multifaceted AI Feedback (ACL 2024 Findings)☆16Updated last year
- MoCLE (First MLLM with MoE for instruction customization and generalization!) (https://arxiv.org/abs/2312.12379)☆45Updated 6 months ago
- Code and data for the paper "Steering Conversational Large Language Models for Long Emotional Support Conversations" along with a UI to v…☆13Updated 8 months ago
- code for "Strengthening Multimodal Large Language Model with Bootstrapped Preference Optimization"☆59Updated last year
- [ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".☆138Updated last year