jiho283 / SimulatorLinks
Official repository of DialSim
☆24Updated 2 months ago
Alternatives and similar repositories for Simulator
Users that are interested in Simulator are comparing it to the libraries listed below
Sorting:
- Official codes for EMNLP 2024 paper "Multi-expert Prompting Improves Reliability, Safety and Usefulness of Large Language Models"☆37Updated last year
- EMNLP 2024 "Re-reading improves reasoning in large language models". Simply repeating the question to get bidirectional understanding for…☆27Updated last year
- ☆20Updated 9 months ago
- ☆53Updated last year
- When Reasoning Meets Its Laws☆33Updated this week
- Code and Dataset for Learning to Solve Complex Tasks by Talking to Agents☆24Updated 3 years ago
- MetricEval: A framework that conceptualizes and operationalizes four main components of metric evaluation, in terms of reliability and va…☆12Updated 2 years ago
- RuleRAG: Rule Meets Retrieval-Augmented Generation for Question Answering☆31Updated 3 months ago
- ☆14Updated last year
- ☆52Updated 7 months ago
- [NeurIPS 2023 Main Track] This is the repository for the paper titled "Don’t Stop Pretraining? Make Prompt-based Fine-tuning Powerful Lea…☆76Updated last year
- ☆38Updated last year
- Improving Text Embedding of Language Models Using Contrastive Fine-tuning☆66Updated last year
- IntructIR, a novel benchmark specifically designed to evaluate the instruction following ability in information retrieval models. Our foc…☆31Updated last year
- This repo explores how AMR to address tasks difficult for LLMs☆13Updated last year
- Exploring limitations of LLM-as-a-judge☆19Updated last year
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆62Updated last year
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆38Updated last year
- Official code and dataset for our EMNLP 2024 Findings paper: Stark: Social Long-Term Multi-Modal Conversation with Persona Commonsense Kn…☆19Updated last year
- [ICLR'25] "Attention in Large Language Models Yields Efficient Zero-Shot Re-Rankers"☆40Updated 9 months ago
- ☆49Updated 9 months ago
- R3: Robust Rubric-Agnostic Reward Models☆20Updated 5 months ago
- [EACL 2023] CoTEVer: Chain of Thought Prompting Annotation Toolkit for Explanation Verification☆41Updated 2 years ago
- OLAPH: Improving Factuality in Biomedical Long-form Question Answering☆37Updated last year
- [ACL 2025] Analyzing LLMs' Multilingual Knowledge Boundary Cognition Across Languages Through the Lens of Internal Representations☆14Updated 2 months ago
- Are LLMs Capable of Data-based Statistical and Causal Reasoning? Benchmarking Advanced Quantitative Reasoning with Data☆44Updated 10 months ago
- Measuring and Controlling Persona Drift in Language Model Dialogs☆20Updated last year
- ☆39Updated 7 months ago
- [EMNLP 2024 Findings] Benchmarking Language Model Agents for Data-Driven Science☆34Updated last year
- Code and data from the paper 'Human Feedback is not Gold Standard'☆19Updated last year