potamides / DaTikZLinks
☆17Updated 3 months ago
Alternatives and similar repositories for DaTikZ
Users that are interested in DaTikZ are comparing it to the libraries listed below
Sorting:
- This repository contains the code and data for the paper "VisOnlyQA: Large Vision Language Models Still Struggle with Visual Perception o…☆23Updated 3 months ago
- Code for Paper: Harnessing Webpage Uis For Text Rich Visual Understanding☆51Updated 6 months ago
- In-Context Alignment: Chat with Vanilla Language Models Before Fine-Tuning☆35Updated last year
- Reasoning by Communicating with Agents☆29Updated 2 months ago
- [NeurIPS 2024] OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI☆102Updated 3 months ago
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…☆31Updated last year
- Improving Language Understanding from Screenshots. Paper: https://arxiv.org/abs/2402.14073☆28Updated 11 months ago
- Repository for Skill Set Optimization☆13Updated 11 months ago
- Evaluation framework for paper "VisualWebBench: How Far Have Multimodal LLMs Evolved in Web Page Understanding and Grounding?"☆57Updated 8 months ago
- List of papers on Self-Correction of LLMs.☆73Updated 6 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆57Updated 9 months ago
- ☆32Updated last year
- ☆57Updated 9 months ago
- Official repository for "BLEUBERI: BLEU is a surprisingly effective reward for instruction following"☆23Updated 3 weeks ago
- Verifiers for LLM Reinforcement Learning☆61Updated 2 months ago
- Code & Dataset for Paper: "Distill Visual Chart Reasoning Ability from LLMs to MLLMs"☆54Updated 8 months ago
- M4 experiment logbook☆58Updated last year
- ☆63Updated 9 months ago
- Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch☆30Updated this week
- Index of URLs to pdf files all over the internet and scripts☆24Updated 2 years ago
- LL3M: Large Language and Multi-Modal Model in Jax☆72Updated last year
- code for paper "Accessing higher dimensions for unsupervised word translation"☆21Updated 2 years ago
- Official implementation for "Law of the Weakest Link: Cross capabilities of Large Language Models"☆42Updated 8 months ago
- Code, Data and Red Teaming for ZeroBench☆46Updated last month
- Repository for the Q-Filters method (https://arxiv.org/pdf/2503.02812)☆33Updated 3 months ago
- Official implementation of the paper "MMInA: Benchmarking Multihop Multimodal Internet Agents"☆44Updated 4 months ago
- This repo contains code and data for ICLR 2025 paper MIA-Bench: Towards Better Instruction Following Evaluation of Multimodal LLMs☆31Updated 3 months ago
- Code and data for the ACL 2024 Findings paper "Do LVLMs Understand Charts? Analyzing and Correcting Factual Errors in Chart Captioning"☆26Updated last year
- ☆24Updated 4 months ago
- Plancraft is a minecraft environment and agent suite to test planning capabilities in LLMs☆15Updated 3 weeks ago