Jiaxin-Pei / Prompting-with-Social-RolesLinks
☆43Updated last year
Alternatives and similar repositories for Prompting-with-Social-Roles
Users that are interested in Prompting-with-Social-Roles are comparing it to the libraries listed below
Sorting:
- ☆59Updated last year
- Scalable Meta-Evaluation of LLMs as Evaluators☆43Updated last year
- NAACL 2024. Code & Dataset for "🌁 Bridging the Novice-Expert Gap via Models of Decision-Making: A Case Study on Remediating Math Mistake…☆45Updated last year
- [EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search☆102Updated last year
- Code for the EMNLP 2024 paper "Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps"☆142Updated 3 months ago
- Code accompanying "How I learned to start worrying about prompt formatting".☆113Updated 7 months ago
- ☆108Updated last year
- Evaluating LLMs with fewer examples☆170Updated last year
- ☆49Updated 9 months ago
- LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)☆145Updated last year
- A package dedicated for running benchmark agreement testing☆18Updated 3 months ago
- Repository for the ACL 2024 conference website☆18Updated 11 months ago
- ☆75Updated last year
- Learning to route instances for Human vs AI Feedback (ACL Main '25)☆26Updated 5 months ago
- An open-source library for contamination detection in NLP datasets and Large Language Models (LLMs).☆58Updated last year
- Improving Text Embedding of Language Models Using Contrastive Fine-tuning☆66Updated last year
- ☆55Updated last year
- ☆123Updated last week
- a curated list of the role of small models in the LLM era☆111Updated last year
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆54Updated last year
- Code for Search-in-the-Chain: Interactively Enhancing Large Language Models with Search for Knowledge-intensive Tasks (WWW 2024))☆58Updated last month
- Source code for our paper: "Put Your Money Where Your Mouth Is: Evaluating Strategic Planning and Execution of LLM Agents in an Auction A…☆47Updated last year
- ☆38Updated last year
- FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions☆49Updated last year
- Critique-out-Loud Reward Models☆70Updated last year
- Framework and toolkits for building and evaluating collaborative agents that can work together with humans.☆117Updated last month
- Fact-Checking the Output of Generative Large Language Models in both Annotation and Evaluation.☆110Updated 2 years ago
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆65Updated last year
- A set of utilities for running few-shot prompting experiments on large-language models☆126Updated 2 years ago
- Code and Data for "Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering"☆87Updated last year