stanford-crfm / halie
☆13Updated last year
Alternatives and similar repositories for halie:
Users that are interested in halie are comparing it to the libraries listed below
- ☆34Updated 5 months ago
- Reasoning by Communicating with Agents☆23Updated 3 months ago
- Code and Dataset for Learning to Solve Complex Tasks by Talking to Agents☆23Updated 2 years ago
- ☆38Updated 7 months ago
- Scalable Meta-Evaluation of LLMs as Evaluators☆42Updated 11 months ago
- ☆37Updated 6 months ago
- Code for our paper Resources and Evaluations for Multi-Distribution Dense Information Retrieval☆14Updated last year
- EMNLP 2024 "Re-reading improves reasoning in large language models". Simply repeating the question to get bidirectional understanding for…☆22Updated last month
- Codebase accompanying the Summary of a Haystack paper.☆75Updated 3 months ago
- ☆27Updated 2 weeks ago
- This is the code for our paper: PLACES: Prompting Language Models for Social Conversation Synthesis☆11Updated last year
- This is the official PyTorch repo for "UNIREX: A Unified Learning Framework for Language Model Rationale Extraction" (ICML 2022).☆24Updated last year
- ☆64Updated 11 months ago
- Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models☆90Updated last year
- ☆46Updated 6 months ago
- ☆26Updated 2 years ago
- [SIGIR 2024 (Demo)] CoSearchAgent: A Lightweight Collborative Search Agent with Large Language Models☆22Updated 11 months ago
- ☆69Updated last year
- SCREWS: A Modular Framework for Reasoning with Revisions☆27Updated last year
- ReBase: Training Task Experts through Retrieval Based Distillation☆28Updated 6 months ago
- LLMs as Collaboratively Edited Knowledge Bases☆43Updated 10 months ago
- Code, datasets, models for the paper "Automatic Evaluation of Attribution by Large Language Models"☆53Updated last year
- InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized Rationales☆64Updated 2 months ago
- [arXiv preprint] Official Repository for "Evaluating Language Models as Synthetic Data Generators"☆30Updated last month
- Aligning with Human Judgement: The Role of Pairwise Preference in Large Language Model Evaluators (Liu et al.; COLM 2024)☆40Updated 3 weeks ago
- Middleware for LLMs: Tools Are Instrumental for Language Agents in Complex Environments (EMNLP'2024)☆35Updated 2 weeks ago
- Large-language Model Evaluation framework with Elo Leaderboard and A-B testing☆50Updated 2 months ago
- Code for paper 'Data-Efficient FineTuning'☆29Updated last year
- ☆19Updated 2 months ago
- ☆47Updated last year