potamides / DaTikZLinks
☆23Updated 10 months ago
Alternatives and similar repositories for DaTikZ
Users that are interested in DaTikZ are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2024] OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI☆107Updated 10 months ago
- List of papers on Self-Correction of LLMs.☆80Updated last year
- Challenge LLMs to Reason About Reasoning: A Benchmark to Unveil Cognitive Depth in LLMs☆51Updated last year
- Code for Paper: Harnessing Webpage Uis For Text Rich Visual Understanding☆53Updated last year
- Official implementation for "Law of the Weakest Link: Cross capabilities of Large Language Models"☆43Updated last year
- Multimodal language model benchmark, featuring challenging examples☆183Updated last year
- Pretraining Efficiently on S2ORC!☆179Updated last year
- Improving Language Understanding from Screenshots. Paper: https://arxiv.org/abs/2402.14073☆31Updated last year
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆61Updated last year
- Evaluating LLMs with fewer examples☆169Updated last year
- 🚢 Data Toolkit for Sailor Language Models☆95Updated 11 months ago
- ☆150Updated 2 years ago
- ☆49Updated 2 years ago
- Dataset introduced in PlotQA: Reasoning over Scientific Plots☆82Updated 2 years ago
- ☆161Updated last year
- In-Context Alignment: Chat with Vanilla Language Models Before Fine-Tuning☆35Updated 2 years ago
- ☆39Updated last year
- This is the official repository for Inheritune.☆120Updated 11 months ago
- This repository contains the code and data for the paper "VisOnlyQA: Large Vision Language Models Still Struggle with Visual Perception o…☆28Updated 6 months ago
- [EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search☆102Updated last year
- M4 experiment logbook☆58Updated 2 years ago
- A dataset of LLM-generated chain-of-thought steps annotated with mistake location.☆85Updated last year
- Language models scale reliably with over-training and on downstream tasks☆99Updated last year
- The dataset and code for paper: TheoremQA: A Theorem-driven Question Answering dataset☆160Updated last year
- ☆52Updated last year
- Official github repo for the paper "Compression Represents Intelligence Linearly" [COLM 2024]☆147Updated last year
- ☆71Updated last year
- This is the repo for the paper "PANGEA: A FULLY OPEN MULTILINGUAL MULTIMODAL LLM FOR 39 LANGUAGES"☆118Updated 7 months ago
- ☆63Updated last year
- LL3M: Large Language and Multi-Modal Model in Jax☆74Updated last year