katiekang1998 / llm_hallucinations
☆11Updated 3 months ago
Related projects: ⓘ
- ☆21Updated 4 months ago
- ☆32Updated 5 months ago
- ☆23Updated last year
- ☆24Updated 4 months ago
- A curated list of awesome resources dedicated to Scaling Laws for LLMs☆61Updated last year
- Dataset for Unified Editing, EMNLP 2023. This is a model editing dataset where edits are natural language phrases.☆22Updated 2 weeks ago
- ☆77Updated last year
- trending projects & awesome papers about data-centric llm studies.☆14Updated last month
- BeHonest: Benchmarking Honesty in Large Language Models☆27Updated last month
- This code accompanies the paper DisentQA: Disentangling Parametric and Contextual Knowledge with Counterfactual Question Answering.☆17Updated last year
- GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.☆38Updated 2 months ago
- Methods and evaluation for aligning language models temporally☆24Updated 6 months ago
- The information of NLP PhD application in the world.☆34Updated 3 weeks ago
- Official codebase for “In-Context Learning with Many Demonstration Examples”☆16Updated last year
- [ACL 2023] Learning Multi-step Reasoning by Solving Arithmetic Tasks. https://arxiv.org/abs/2306.01707☆22Updated last year
- [EMNLP 2022] TaCube: Pre-computing Data Cubes for Answering Numerical-Reasoning Questions over Tabular Data☆16Updated last year
- A Portal Site for Structured Knowledge Grounding(SKG) Resources.☆9Updated last year
- The repository of the project "Fine-tuning Large Language Models with Sequential Instructions", code base comes from open-instruct and LA…☆30Updated 2 months ago
- ☆35Updated last year
- Code for ACL 2023 paper "BOLT: Fast Energy-based Controlled Text Generation with Tunable Biases".☆17Updated last year
- Constrained Decoding Project☆17Updated 10 months ago
- The official repository for the paper "From Zero to Hero: Examining the Power of Symbolic Tasks in Instruction Tuning".☆61Updated last year
- ☆32Updated last year
- ☆70Updated 10 months ago
- Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models"☆54Updated 8 months ago
- Provides a minimal implementation to extract FLAN datasets for further processing☆11Updated last year
- Safety-J: Evaluating Safety with Critique☆13Updated last month
- ☆26Updated last year
- The official implementation of "ICDPO: Effectively Borrowing Alignment Capability of Others via In-context Direct Preference Optimization…☆11Updated 7 months ago
- ☆13Updated 6 months ago