[2025-TMLR] A Survey on the Honesty of Large Language Models
☆64Dec 8, 2024Updated last year
Alternatives and similar repositories for LLM-Honesty-Survey
Users that are interested in LLM-Honesty-Survey are comparing it to the libraries listed below
Sorting:
- [ICLR 2025] ChartMimic: Evaluating LMM’s Cross-Modal Reasoning Capability via Chart-to-Code Generation☆131Dec 19, 2025Updated 2 months ago
- Large Language Models Can Self-Improve in Long-context Reasoning☆72Nov 24, 2024Updated last year
- [EMNLP 2024] A Multi-level Hallucination Diagnostic Benchmark for Tool-Augmented Large Language Models.☆21Sep 23, 2024Updated last year
- Self-Supervised Alignment with Mutual Information☆20May 24, 2024Updated last year
- Code for LLM_Catastrophic_Forgetting via SAM.☆11Jun 7, 2024Updated last year
- [ACL 2024 Findings] CriticBench: Benchmarking LLMs for Critique-Correct Reasoning☆30Mar 5, 2024Updated 2 years ago
- A unified framework for controllable caption generation across images, videos, and audio. Supports multi-modal inputs and customizable ca…☆52Jul 24, 2025Updated 7 months ago
- [NeurIPS 2025] Implementation for the paper "The Surprising Effectiveness of Negative Reinforcement in LLM Reasoning"☆161Mar 2, 2026Updated last week
- Lightweight Adapting for Black-Box Large Language Models☆25Feb 15, 2024Updated 2 years ago
- ✨✨The Curse of Multi-Modalities (CMM): Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audio☆52Jul 11, 2025Updated 7 months ago
- ☆28May 29, 2024Updated last year
- Unofficial implementation of Chain of Hindsight (https://arxiv.org/abs/2302.02676) using pytorch and huggingface Trainers.☆11Apr 5, 2023Updated 2 years ago
- Pytorch implementation of HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models☆28Mar 22, 2024Updated last year
- This repo explores how AMR to address tasks difficult for LLMs☆13Jan 15, 2024Updated 2 years ago
- Official implementation of Our NeurIPS 2024 Paper "Boundary Matters: A Bi-Level Active Finetuning Method"☆14Feb 11, 2025Updated last year
- ☆29May 24, 2024Updated last year
- Is In-Context Learning Sufficient for Instruction Following in LLMs? [ICLR 2025]☆32Jan 23, 2025Updated last year
- Code for NAACL 2025 paper "AdaCAD: Adaptively Decoding to Balance Conflicts between Contextual and Parametric Knowledge"☆17Mar 2, 2026Updated last week
- A framework to train language models to learn invariant representations.☆14Jan 24, 2022Updated 4 years ago
- Collection of papers for scalable automated alignment.☆93Oct 22, 2024Updated last year
- 持续追踪ChatGPT相关的技术资料和行业进展。☆11Apr 24, 2023Updated 2 years ago
- [ICCV 2025] Prompt-A-Video☆22Feb 2, 2025Updated last year
- Code for paper "ProgGen: Generating Named Entity Recognition Datasets Step-by-step with Self-Reflexive Large Language Models"☆17Mar 29, 2024Updated last year
- Codebase of 'From Denoising to Refining: A Corrective Framework for Vision-Language Diffusion Model'☆44Jan 19, 2026Updated last month
- Synthesizing realistic and diverse text-datasets from augmented LLMs☆16Jan 26, 2026Updated last month
- LayoutDiT: Exploring Content-Graphic Balance in Layout Generation with Diffusion Transformer☆49Jan 6, 2026Updated 2 months ago
- ☆47May 25, 2025Updated 9 months ago
- [NeurIPS'24 LanGame workshop] On The Planning Abilities of OpenAI's o1 Models: Feasibility, Optimality, and Generalizability☆42Jul 7, 2025Updated 8 months ago
- Paper list for the paper "Authorship Attribution in the Era of Large Language Models: Problems, Methodologies, and Challenges (SIGKDD Exp…☆18Dec 23, 2024Updated last year
- Uncertainty-Aware Reliable Text Classification (KDD 2021)☆18Oct 4, 2022Updated 3 years ago
- CodeUltraFeedback: aligning large language models to coding preferences (TOSEM 2025)☆73Jun 25, 2024Updated last year
- Code release for "TempLM: Distilling Language Models into Template-Based Generators"☆14Jul 21, 2022Updated 3 years ago
- Our research proposes a novel MoGU framework that improves LLMs' safety while preserving their usability.☆18Jan 14, 2025Updated last year
- DELLA-Merging: Reducing Interference in Model Merging through Magnitude-Based Sampling☆36Jul 12, 2024Updated last year
- [NeurIPS 2024] MoME: Mixture of Multimodal Experts for Generalist Multimodal Large Language Models☆79Dec 27, 2025Updated 2 months ago
- [NAACL 2022] This is the code repo for our paper `ACTUNE: Uncertainty-based Active Self-Training for Active Fine-Tuning of Pretrained Lan…☆15Nov 16, 2022Updated 3 years ago
- Social-AI papers across computing communities, courses, and dissertations.☆21Jun 10, 2025Updated 8 months ago
- code for paper Query-Dependent Prompt Evaluation and Optimization with Offline Inverse Reinforcement Learning☆44Mar 20, 2024Updated last year
- Uncertainty quantification for in-context learning of large language models☆15Apr 1, 2024Updated last year