lyt719 / LLM-evaluation-datasets
☆18Updated last year
Alternatives and similar repositories for LLM-evaluation-datasets:
Users that are interested in LLM-evaluation-datasets are comparing it to the libraries listed below
- Controllable Text Generation for Large Language Models: A Survey☆157Updated 5 months ago
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆104Updated 5 months ago
- Unsupervised Information Refinement Training of Large Language Models for Retrieval-Augmented Generation☆45Updated last month
- [ACL 2024] User-friendly evaluation framework: Eval Suite & Benchmarks: UHGEval, HaluEval, HalluQA, etc.☆158Updated 3 months ago
- A collection of survey papers and resources related to Large Language Models (LLMs).☆40Updated last year
- A collection for math word problem (MWP) works, including datasets, algorithms and so on.☆40Updated 8 months ago
- notes for Multi-hop Reading Comprehension and open-domain question answering☆85Updated 2 years ago
- ☆38Updated last year
- Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning☆161Updated last year
- Large Language Models(LLMs) of Code☆18Updated last year
- ☆14Updated 10 months ago
- ☆24Updated 11 months ago
- Code and data for "ConflictBank: A Benchmark for Evaluating the Influence of Knowledge Conflicts in LLM" (NeurIPS 2024 Track Datasets and…☆36Updated 4 months ago
- ☆80Updated last year
- Official github repo for AutoDetect, an automated weakness detection framework for LLMs.☆41Updated 7 months ago
- ☆24Updated last month
- ☆130Updated 10 months ago
- ☆21Updated last year
- [Preprint] A Neural-Symbolic Self-Training Framework☆102Updated 6 months ago
- Official github repo for SafetyBench, a comprehensive benchmark to evaluate LLMs' safety. [ACL 2024]☆190Updated 7 months ago
- Code Repo for EfficientRAG: Efficient Retriever for Multi-Hop Question Answering☆39Updated 3 months ago
- Repository for the paper "Cognitive Mirage: A Review of Hallucinations in Large Language Models"☆47Updated last year
- Repo for ACL2023 paper "Plug-and-Play Knowledge Injection for Pre-trained Language Models"☆60Updated 10 months ago
- [EMNLP 2024] Source code for the paper "Learning Planning-based Reasoning with Trajectory Collection and Process Rewards Synthesizing".☆68Updated last month
- [ACL 2024] Making Long-Context Language Models Better Multi-Hop Reasoners☆16Updated 8 months ago
- Official Repo of paper "KnowCoder: Coding Structured Knowledge into LLMs for Universal Information Extraction". In the paper, we propose …☆66Updated 6 months ago
- EMNLP'2023: Explore-Instruct: Enhancing Domain-Specific Instruction Coverage through Active Exploration☆36Updated 11 months ago
- ☆37Updated 5 months ago
- The awesome agents in the era of large language models☆59Updated last year
- A trend starts from "Chain of Thought Prompting Elicits Reasoning in Large Language Models".☆41Updated last year