target-benchmark / target
TARGET is a benchmark for evaluating Table Retrieval for Generative Tasks such as Fact Verification and Text-to-SQL
☆18Updated this week
Alternatives and similar repositories for target:
Users that are interested in target are comparing it to the libraries listed below
- ☆39Updated 6 months ago
- ReBase: Training Task Experts through Retrieval Based Distillation☆28Updated 3 weeks ago
- XTR: Rethinking the Role of Token Retrieval in Multi-Vector Retrieval☆45Updated 8 months ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆76Updated 4 months ago
- DSBench: How Far are Data Science Agents from Becoming Data Science Experts?☆43Updated last week
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆32Updated last year
- ☆48Updated 8 months ago
- Aligning with Human Judgement: The Role of Pairwise Preference in Large Language Model Evaluators (Liu et al.; COLM 2024)☆44Updated last month
- ☆19Updated 3 months ago
- [EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search☆70Updated 3 months ago
- Codes and datasets for the paper Measuring and Enhancing Trustworthiness of LLMs in RAG through Grounded Attributions and Learning to Ref…☆43Updated this week
- Code for RATIONALYST: Pre-training Process-Supervision for Improving Reasoning https://arxiv.org/pdf/2410.01044☆32Updated 5 months ago
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆14Updated 11 months ago
- Aioli: A unified optimization framework for language model data mixing☆21Updated last month
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆54Updated 6 months ago
- Code for EMNLP 2024 paper "Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning"☆52Updated 5 months ago
- ☆15Updated 7 months ago
- Code for Benchmarking Language Model Agents for Data-Driven Science☆23Updated 4 months ago
- ☆60Updated 3 weeks ago
- PyTorch implementation for MRL☆18Updated last year
- RuleRAG: Rule-guided Retrieval-Augmented Generation with Language Models for Question Answering☆18Updated 3 months ago
- ☆23Updated 5 months ago
- Codebase accompanying the Summary of a Haystack paper.☆75Updated 5 months ago
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆54Updated last year
- Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"☆46Updated last year
- Code and Data for "Language Modeling with Editable External Knowledge"☆31Updated 8 months ago