WisdomShell / FreeEval
☆15Updated 7 months ago
Alternatives and similar repositories for FreeEval:
Users that are interested in FreeEval are comparing it to the libraries listed below
- [ACL'24] A Knowledge-grounded Interactive Evaluation Framework for Large Language Models☆36Updated 8 months ago
- [EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"☆110Updated 6 months ago
- Awesome-Long2short-on-LRMs is a collection of state-of-the-art, novel, exciting long2short methods on large reasoning models. It contains…☆171Updated this week
- LLM hallucination paper list☆312Updated last year
- ☆19Updated 10 months ago
- ☆52Updated last month
- [COLM'24] Corex: Pushing the Boundaries of Complex Reasoning through Multi-Model Collaboration☆25Updated 5 months ago
- A method of ensemble learning for heterogeneous large language models.☆42Updated 7 months ago
- [ICLR'24] RAIN: Your Language Models Can Align Themselves without Finetuning☆89Updated 10 months ago
- EMNLP'23 survey: a curation of awesome papers and resources on refreshing large language models (LLMs) without expensive retraining.☆132Updated last year
- ☆14Updated last year
- Official Repository for ACL 2024 Paper SafeDecoding: Defending against Jailbreak Attacks via Safety-Aware Decoding☆124Updated 8 months ago
- Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning☆162Updated last year
- [EMNLP 2024] The official GitHub repo for the paper "Course-Correction: Safety Alignment Using Synthetic Preferences"☆19Updated 5 months ago
- LLM Unlearning☆151Updated last year
- [ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".☆116Updated 4 months ago
- A versatile toolkit for applying Logit Lens to modern large language models (LLMs). Currently supports Llama-3.1-8B and Qwen-2.5-7B, enab…☆65Updated last month
- The repository for paper <Evaluating Open-QA Evaluation>☆24Updated 11 months ago
- Large Language Models Meet NL2Code: A Survey☆36Updated 4 months ago
- A novel approach to improve the safety of large language models, enabling them to transition effectively from unsafe to safe state.☆58Updated 2 months ago
- ☆32Updated 5 months ago
- Must-read Papers on Large Language Model (LLM) Continual Learning☆141Updated last year
- paper list on reasoning in NLP☆185Updated last year
- Generative Judge for Evaluating Alignment☆232Updated last year
- The code of arxiv paper: "CoT-based Synthesizer: Enhancing LLM Performance through Answer Synthesis"☆23Updated 2 months ago
- ☆79Updated last week
- Codes for the EMNLP 2023 Findings paper "Self-Polish: Enhance Reasoning in Large Language Models via Problem Refining" by Zhiheng Xi, Sen…☆30Updated last year
- Code associated with Tuning Language Models by Proxy (Liu et al., 2024)☆107Updated last year
- An open-source library for contamination detection in NLP datasets and Large Language Models (LLMs).☆51Updated 7 months ago
- ☆81Updated 3 months ago