Jiaxin-Pei / Prompting-with-Social-Roles
☆28Updated last month
Related projects ⓘ
Alternatives and complementary repositories for Prompting-with-Social-Roles
- ☆43Updated last month
- FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions☆40Updated 4 months ago
- The Prism Alignment Project☆37Updated 6 months ago
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆122Updated 8 months ago
- Retrieval Augmented Generation Generalized Evaluation Dataset☆51Updated this week
- [EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search☆61Updated 4 months ago
- Benchmarking library for RAG☆123Updated this week
- Scalable Meta-Evaluation of LLMs as Evaluators☆41Updated 9 months ago
- Code and Data for "Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering"☆78Updated 3 months ago
- Critique-out-Loud Reward Models☆37Updated last month
- Retrieval-Augmented Generation battle!☆43Updated last week
- BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval☆57Updated 3 weeks ago
- Evaluating LLMs with fewer examples☆134Updated 7 months ago
- [ICLR 2023] Guess the Instruction! Flipped Learning Makes Language Models Stronger Zero-Shot Learners☆111Updated 2 months ago
- ☆41Updated 3 weeks ago
- Repository for the ACL 2024 conference website☆17Updated last month
- Codebase accompanying the Summary of a Haystack paper.☆72Updated 2 months ago
- LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)☆115Updated last week
- datasets from the paper "Towards Understanding Sycophancy in Language Models"☆62Updated last year
- Finetune mistral-7b-instruct for sentence embeddings☆71Updated 6 months ago
- ☆38Updated 7 months ago
- Code for Multilingual Eval of Generative AI paper published at EMNLP 2023☆65Updated 8 months ago
- Official codebase for permutation self-consistency.☆16Updated 9 months ago
- ☆43Updated 4 months ago
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆74Updated 10 months ago
- Fact-Checking the Output of Generative Large Language Models in both Annotation and Evaluation.☆74Updated 10 months ago
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).☆77Updated 8 months ago
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆49Updated 8 months ago
- A package to generate summaries of long-form text and evaluate the coherence of these summaries. Official package for our ICLR 2024 paper…☆107Updated last month
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆23Updated 8 months ago