Jiaxin-Pei / Prompting-with-Social-Roles
☆31Updated 3 months ago
Alternatives and similar repositories for Prompting-with-Social-Roles:
Users that are interested in Prompting-with-Social-Roles are comparing it to the libraries listed below
- Critique-out-Loud Reward Models☆47Updated 3 months ago
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆124Updated 10 months ago
- FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions☆40Updated 6 months ago
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆53Updated 10 months ago
- Scalable Meta-Evaluation of LLMs as Evaluators☆42Updated 11 months ago
- BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval☆66Updated 2 weeks ago
- ☆56Updated 3 months ago
- ☆34Updated 5 months ago
- Repository for the ACL 2024 conference website☆17Updated 3 months ago
- Inspecting and Editing Knowledge Representations in Language Models☆111Updated last year
- Code and Data for "Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering"☆82Updated 5 months ago
- Retrieval Augmented Generation Generalized Evaluation Dataset☆52Updated last month
- ☆20Updated 6 months ago
- A package to generate summaries of long-form text and evaluate the coherence of these summaries. Official package for our ICLR 2024 paper…☆113Updated 3 months ago
- ☆38Updated 7 months ago
- Official codebase for permutation self-consistency.☆16Updated 11 months ago
- Code, datasets, models for the paper "Automatic Evaluation of Attribution by Large Language Models"☆53Updated last year
- [NeurIPS 2024] Train LLMs with diverse system messages reflecting individualized preferences to generalize to unseen system messages☆41Updated last month
- [EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search☆68Updated last month
- ☆50Updated 2 months ago
- [ICLR 2023] Guess the Instruction! Flipped Learning Makes Language Models Stronger Zero-Shot Learners☆113Updated 4 months ago
- Official Repo for ICLR 2024 paper MINT: Evaluating LLMs in Multi-turn Interaction with Tools and Language Feedback by Xingyao Wang*, Ziha…☆110Updated 7 months ago
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆129Updated 2 months ago
- Source code for our paper: "Put Your Money Where Your Mouth Is: Evaluating Strategic Planning and Execution of LLM Agents in an Auction A…☆42Updated 11 months ago
- ☆64Updated 11 months ago
- Retrieval-Augmented Generation battle!☆48Updated last month
- ☆27Updated 10 months ago
- ☆38Updated 3 months ago
- Data and code for the preprint "In-Context Learning with Long-Context Models: An In-Depth Exploration"☆31Updated 4 months ago
- Companion code for FanOutQA: Multi-Hop, Multi-Document Question Answering for Large Language Models (ACL 2024)☆45Updated 3 months ago