zengxingchen / ChartQA-MLLM
☆43Updated last week
Related projects: ⓘ
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆73Updated 2 months ago
- Official implementation for the paper "Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention M…☆95Updated last month
- Codes for Visual Sketchpad: Sketching as a Visual Chain of Thought for Multimodal Language Models☆93Updated last month
- LongLLaVA: Scaling Multi-modal LLMs to 1000 Images Efficiently via Hybrid Architecture☆115Updated 2 weeks ago
- This repository contains the code for the paper: SirLLM: Streaming Infinite Retentive LLM☆53Updated 3 months ago
- E5-V: Universal Embeddings with Multimodal Large Language Models☆148Updated 2 months ago
- Co-LLM: Learning to Decode Collaboratively with Multiple Language Models☆89Updated 4 months ago
- Code and data for CoachLM, an automatic instruction revision approach LLM instruction tuning.☆56Updated 6 months ago
- ☆55Updated 3 months ago
- InstructDoc: A Dataset for Zero-Shot Generalization of Visual Document Understanding with Instructions (AAAI2024)☆134Updated 3 months ago
- AWM: Agent Workflow Memory☆121Updated last week
- Official PyTorch implementation code for realizing the technical part of Traversal of Layers (TroL) presenting new propagation operation …☆82Updated 2 months ago
- ☆50Updated 2 months ago
- Code associated with the arXiv preprint: "Image, tell me your story!" Predicting the original meta-context of visual misinformation.☆30Updated 3 weeks ago
- The official implementation of "Ada-LEval: Evaluating long-context LLMs with length-adaptable benchmarks"☆49Updated 4 months ago
- ☆70Updated 6 months ago
- The Official Code Repository for GUI-World.☆33Updated last month
- This is the official repository of our paper "What If We Recaption Billions of Web Images with LLaMA-3 ?"☆115Updated 3 months ago
- Official implementation of "Gemini in Reasoning: Unveiling Commonsense in Multimodal Large Language Models"☆35Updated 8 months ago
- ☆35Updated last year
- A task generation and model evaluation system.☆51Updated 2 weeks ago
- Official implementation of the paper "MMInA: Benchmarking Multihop Multimodal Internet Agents"☆36Updated 5 months ago
- code for Optimus-1☆19Updated last month
- ControlLLM: Augment Language Models with Tools by Searching on Graphs☆184Updated 2 months ago
- Official implementation of MAIA, A Multimodal Automated Interpretability Agent☆56Updated last month
- Code and example data for the paper: Rule Based Rewards for Language Model Safety☆131Updated 2 months ago
- "Improving Mathematical Reasoning with Process Supervision" by OPENAI☆55Updated last week
- ☆116Updated 3 months ago
- Official implementation for the paper "LongEmbed: Extending Embedding Models for Long Context Retrieval"☆108Updated 4 months ago
- WebLINX is a benchmark for building web navigation agents with conversational capabilities☆111Updated 2 months ago