SihengLi99/LLM-Honesty-Survey

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/SihengLi99/LLM-Honesty-Survey)

SihengLi99 / LLM-Honesty-Survey

[2025-TMLR] A Survey on the Honesty of Large Language Models

☆66

Alternatives and similar repositories for LLM-Honesty-Survey

Users that are interested in LLM-Honesty-Survey are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ChartMimic / ChartMimic
View on GitHub
[ICLR 2025] ChartMimic: Evaluating LMM’s Cross-Modal Reasoning Capability via Chart-to-Code Generation
☆132Dec 19, 2025Updated 7 months ago
ToolBeHonest / ToolBeHonest
View on GitHub
[EMNLP 2024] A Multi-level Hallucination Diagnostic Benchmark for Tool-Augmented Large Language Models.
☆22Sep 23, 2024Updated last year
SihengLi99 / RePO
View on GitHub
RePO: Replay-Enhanced Policy Optimization
☆24Jun 12, 2025Updated last year
TianHongZXY / qaap
View on GitHub
[EMNLP 2023] Question Answering as Programming for Solving Time-Sensitive Questions
☆12Dec 18, 2023Updated 2 years ago
jiyt17 / ReDiff
View on GitHub
Codebase of 'From Denoising to Refining: A Corrective Framework for Vision-Language Diffusion Model'
☆45Jun 27, 2026Updated last month
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
Multimedia-Analytics-Laboratory / dpdmd
View on GitHub
[ICML 2026] The offical code of Diversity-Preserved Distribution Matching Distillation for Fast Visual Synthesis
☆87Jun 2, 2026Updated last month
Li-Hyn / LLM_CatastrophicForgetting
View on GitHub
Code for LLM_Catastrophic_Forgetting via SAM.
☆11Jun 7, 2024Updated 2 years ago
qishisuren123 / AnyCap
View on GitHub
A unified framework for controllable caption generation across images, videos, and audio. Supports multi-modal inputs and customizable ca…
☆54Jul 24, 2025Updated last year
scandukuri / assistant-gate
View on GitHub
☆28May 29, 2024Updated 2 years ago
DavidFanzz / SCMoE
View on GitHub
☆29May 24, 2024Updated 2 years ago
AmourWaltz / UAlign
View on GitHub
Project of ACL 2025 "UAlign: Leveraging Uncertainty Estimations for Factuality Alignment on Large Language Models"
☆15Mar 25, 2025Updated last year
janphilippfranken / sami
View on GitHub
Self-Supervised Alignment with Mutual Information
☆20May 24, 2024Updated 2 years ago
tml-epfl / icl-alignment
View on GitHub
Is In-Context Learning Sufficient for Instruction Following in LLMs? [ICLR 2025]
☆33Jan 23, 2025Updated last year
TianHongZXY / RLVR-Decomposed
View on GitHub
[NeurIPS 2025] Implementation for the paper "The Surprising Effectiveness of Negative Reinforcement in LLM Reasoning"
☆166Mar 2, 2026Updated 4 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
LHRYANG / FSD
View on GitHub
Implementation of LREC-COLING 2024 paper A Frustratingly Simple Decoding Method for Neural Text Generation
☆19Feb 23, 2024Updated 2 years ago
Utaotao / ProFit
View on GitHub
☆35Jan 20, 2026Updated 6 months ago
deeplearning-wisc / picle
View on GitHub
Official code for ICML 2024 paper on Persona In-Context Learning (PICLe)
☆28Jun 27, 2024Updated 2 years ago
congvvc / InstructSeg
View on GitHub
[ICCV 2025] Official implementation of "InstructSeg: Unifying Instructed Visual Segmentation with Multi-modal Large Language Models"
☆56Feb 10, 2025Updated last year
alchemistyzz / PeRL
View on GitHub
[NeurIPS'25] The official code of "PeRL: Permutation-Enhanced Reinforcement Learning for Interleaved Vision-Language Reasoning"
☆30Mar 30, 2026Updated 3 months ago
AmourWaltz / Awesome-Reliable-LLM
View on GitHub
☆193Mar 8, 2026Updated 4 months ago
congvvc / LaSagnA
View on GitHub
Project for "LaSagnA: Language-based Segmentation Assistant for Complex Queries".
☆63Apr 29, 2024Updated 2 years ago
DAMO-NLP-SG / CMM
View on GitHub
✨✨The Curse of Multi-Modalities (CMM): Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audio
☆54Jul 11, 2025Updated last year
yuli0103 / LayoutDiT
View on GitHub
LayoutDiT: Exploring Content-Graphic Balance in Layout Generation with Diffusion Transformer
☆49Jan 6, 2026Updated 6 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
hanshen95 / penalized-bilevel-gradient-descent
View on GitHub
An implementation of the penalty-based bilevel gradient descent (PBGD) algorithm and the iterative differentiation (ITD/RHG) methods.
☆19Feb 13, 2023Updated 3 years ago
TianHongZXY / CoRe
View on GitHub
[ACL 2023] Solving Math Word Problems via Cooperative Reasoning induced Language Models (LLMs + MCTS + Self-Improvement)
☆51Dec 15, 2023Updated 2 years ago
syncdoth / Chain-of-Hindsight-PyTorch
View on GitHub
Unofficial implementation of Chain of Hindsight (https://arxiv.org/abs/2302.02676) using pytorch and huggingface Trainers.
☆11Apr 5, 2023Updated 3 years ago
holarissun / Prompt-OIRL
View on GitHub
code for paper Query-Dependent Prompt Evaluation and Optimization with Offline Inverse Reinforcement Learning
☆45Mar 20, 2024Updated 2 years ago
activatedgeek / calibration-tuning
View on GitHub
☆53Apr 9, 2025Updated last year
LHRYANG / Generalization_of_FT-LLM
View on GitHub
Implementation of NAACL 2024 paper Unveiling the Generalization Power of Fine-Tuned Large Language Models
☆11Mar 14, 2024Updated 2 years ago
SihengLi99 / TextBind
View on GitHub
[2024-ACL]: TextBind: Multi-turn Interleaved Multimodal Instruction-following in the Wildrounded Conversation
☆47Sep 19, 2023Updated 2 years ago
chenzhiling9954 / Critical-Tokens-Matter
View on GitHub
☆48May 25, 2025Updated last year
DYR1 / MoGU
View on GitHub
Our research proposes a novel MoGU framework that improves LLMs' safety while preserving their usability.
☆18Jan 14, 2025Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
haotiansun14 / BBox-Adapter
View on GitHub
Lightweight Adapting for Black-Box Large Language Models
☆26Feb 15, 2024Updated 2 years ago
HanNight / AdaCAD
View on GitHub
Code for NAACL 2025 paper "AdaCAD: Adaptively Decoding to Balance Conflicts between Contextual and Parametric Knowledge"
☆16Mar 2, 2026Updated 4 months ago
clownrat6 / Novel_Theft
View on GitHub
轻小说文库 epub 解析打包
☆21May 3, 2020Updated 6 years ago
ali-vilab / Wan-Move
View on GitHub
[NeurIPS 2025] Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance
☆648Jan 5, 2026Updated 6 months ago
baixianghuang / survey-authorship
View on GitHub
Paper list for the paper "Authorship Attribution in the Era of Large Language Models: Problems, Methodologies, and Challenges (SIGKDD Exp…
☆19May 25, 2026Updated 2 months ago
jiyt17 / Prompt-A-Video
View on GitHub
[ICCV 2025] Prompt-A-Video
☆24Feb 2, 2025Updated last year
causalNLP / amr_llm
View on GitHub
This repo explores how AMR to address tasks difficult for LLMs
☆13Jan 15, 2024Updated 2 years ago