target-benchmark / targetLinks
TARGET is a benchmark for evaluating Table Retrieval for Generative Tasks such as Fact Verification and Text-to-SQL
☆22Updated this week
Alternatives and similar repositories for target
Users that are interested in target are comparing it to the libraries listed below
Sorting:
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆35Updated last year
- ☆54Updated last year
- ☆20Updated 4 months ago
- ReBase: Training Task Experts through Retrieval Based Distillation☆29Updated 5 months ago
- Improving Text Embedding of Language Models Using Contrastive Fine-tuning☆64Updated 11 months ago
- Code for Benchmarking Language Model Agents for Data-Driven Science☆28Updated 8 months ago
- Codebase accompanying the Summary of a Haystack paper.☆79Updated 9 months ago
- Code and Data for "Language Modeling with Editable External Knowledge"☆34Updated last year
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆60Updated 10 months ago
- [ACL 2023] Few-shot Reranking for Multi-hop QA via Language Model Prompting☆27Updated 2 years ago
- Code for EMNLP 2024 paper "Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning"☆55Updated 9 months ago
- Exploring limitations of LLM-as-a-judge☆19Updated 11 months ago
- Aioli: A unified optimization framework for language model data mixing☆27Updated 6 months ago
- Aligning with Human Judgement: The Role of Pairwise Preference in Large Language Model Evaluators (Liu et al.; COLM 2024)☆47Updated 5 months ago
- ☆20Updated 3 months ago
- FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions☆45Updated last year
- ☆45Updated last month
- Code repo for MathAgent☆17Updated last year
- ☆57Updated 7 months ago
- ☆22Updated last month
- Tree prompting: easy-to-use scikit-learn interface for improved prompting.☆38Updated last year
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model☆43Updated last year
- Code and Data for "MIRAI: Evaluating LLM Agents for Event Forecasting"☆66Updated last year
- Learning to Retrieve by Trying - Source code for Grounding by Trying: LLMs with Reinforcement Learning-Enhanced Retrieval☆39Updated 8 months ago
- Code and Dataset for Learning to Solve Complex Tasks by Talking to Agents☆24Updated 3 years ago
- Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models☆97Updated last year
- Code for RATIONALYST: Pre-training Process-Supervision for Improving Reasoning https://arxiv.org/pdf/2410.01044☆33Updated 9 months ago
- Evaluation of neuro-symbolic engines☆38Updated 11 months ago
- Official Code Release for "Training a Generally Curious Agent"☆28Updated 2 months ago
- DSBench: How Far are Data Science Agents from Becoming Data Science Experts?☆58Updated 4 months ago