uiuc-kang-lab / ELT-BenchLinks
☆17Updated last week
Alternatives and similar repositories for ELT-Bench
Users that are interested in ELT-Bench are comparing it to the libraries listed below
Sorting:
- Combining Base and Instruction-Tuned Language Models for Better Synthetic Data Generation☆33Updated 4 months ago
- Verifiers for LLM Reinforcement Learning☆61Updated 2 months ago
- ☆35Updated last month
- DSBench: How Far are Data Science Agents from Becoming Data Science Experts?☆55Updated 4 months ago
- ☆24Updated 9 months ago
- ☆65Updated 2 months ago
- A method for steering llms to better follow instructions☆46Updated 3 weeks ago
- ☆20Updated 3 months ago
- [ICLR'25] "Attention in Large Language Models Yields Efficient Zero-Shot Re-Rankers"☆23Updated 2 months ago
- ☆17Updated last year
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆68Updated 3 months ago
- Code for EMNLP 2024 paper "Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning"☆55Updated 8 months ago
- ☆20Updated 2 months ago
- AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories☆18Updated last month
- This is the implementation for the paper "LARGE LANGUAGE MODEL CASCADES WITH MIX- TURE OF THOUGHT REPRESENTATIONS FOR COST- EFFICIENT REA…☆23Updated last year
- Process Reward Models That Think☆41Updated last month
- Aioli: A unified optimization framework for language model data mixing☆27Updated 5 months ago
- ☆36Updated 2 months ago
- ☆45Updated last month
- EMNLP 2024 "Re-reading improves reasoning in large language models". Simply repeating the question to get bidirectional understanding for…☆26Updated 6 months ago
- ☆18Updated 3 months ago
- Code for the paper: CodeTree: Agent-guided Tree Search for Code Generation with Large Language Models☆22Updated 2 months ago
- Large language models for document ranking.☆59Updated last month
- Code for RATIONALYST: Pre-training Process-Supervision for Improving Reasoning https://arxiv.org/pdf/2410.01044☆33Updated 8 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆57Updated 9 months ago
- ☆45Updated 2 months ago
- ☆32Updated 7 months ago
- ☆37Updated 8 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆71Updated this week
- [SIGIR 2024 (Demo)] CoSearchAgent: A Lightweight Collborative Search Agent with Large Language Models☆26Updated last year