BugMakerzzz / toxic_cotLinks

☆12

Alternatives and similar repositories for toxic_cot

Users that are interested in toxic_cot are comparing it to the libraries listed below

Sorting:

ZNLP / Language-Imbalance-Driven-Rewarding
[ICLR 2025] Language Imbalance Driven Rewarding for Multilingual Self-improving
☆24Updated 2 months ago
zepingyu0512 / neuron-attribution
code for EMNLP 2024 paper: Neuron-Level Knowledge Attribution in Large Language Models
☆47Updated last year
D2I-ai / eigenscore
☆38Updated 11 months ago
RUCAIBox / HaluEval-2.0
☆47Updated last year
lancopku / label-words-are-anchors
Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning
☆165Updated last year
nusnlp / FSPO
Official code for our paper "Reasoning Models Hallucinate More: Factuality-Aware Reinforcement Learning for Large Reasoning Models"
☆19Updated 3 weeks ago
xhan77 / context-aware-decoding
☆53Updated last year
alisawuffles / proxy-tuning
Code associated with Tuning Language Models by Proxy (Liu et al., 2024)
☆121Updated last year
HowieHwong / DataGen
[ICLR'25] DataGen: Unified Synthetic Dataset Generation via Large Language Models
☆64Updated 8 months ago
pillowsofwind / Knowledge-Conflicts-Survey
[EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"
☆148Updated last year
AmourWaltz / Reliable-LLM
☆170Updated last year
zhenyu-02 / LogitLens4LLMs
A versatile toolkit for applying Logit Lens to modern large language models (LLMs). Currently supports Llama-3.1-8B and Qwen-2.5-7B, enab…
☆127Updated 3 months ago
Jeryi-Sun / ReDEeP-ICLR
The implement of paper:"ReDeEP: Detecting Hallucination in Retrieval-Augmented Generation via Mechanistic Interpretability"
☆50Updated 5 months ago
Alsace08 / Chain-of-Embedding
[ICLR 2025] Code and Data Repo for Paper "Latent Space Chain-of-Embedding Enables Output-free LLM Self-Evaluation"
☆83Updated 11 months ago
iie-ycx / DEER
This is the repository of DEER, a Dynamic Early Exit in Reasoning method for Large Reasoning Language Models.
☆177Updated 4 months ago
RUCAIBox / Language-Specific-Neurons
☆85Updated 11 months ago
pkunlp-icler / IKE
☆25Updated 2 years ago
sail-sg / sdft
[ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".
☆136Updated last year
oneal2000 / MIND
Source code of our paper MIND, ACL 2024 Long Paper
☆56Updated last week
yuzhaouoe / SAE-based-representation-engineering
[NAACL'25 Oral] Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering
☆67Updated 11 months ago
hahahawu / Long-to-Short-via-Model-Merging
Model merging is a highly efficient approach for long-to-short reasoning.
☆89Updated last month
mlwu22 / RED
Implementation code for ACL2024：Advancing Parameter Efficiency in Fine-tuning via Representation Editing
☆14Updated last year
Hongcheng-Gao / Awesome-Long2short-on-LRMs
Awesome-Long2short-on-LRMs is a collection of state-of-the-art, novel, exciting long2short methods on large reasoning models. It contains…
☆256Updated 3 months ago
circle-hit / SAPT
Code for ACL 2024 accepted paper titled "SAPT: A Shared Attention Framework for Parameter-Efficient Continual Learning of Large Language …
☆36Updated 10 months ago
YJiangcm / LTE
[ACL 2024] Learning to Edit: Aligning LLMs with Knowledge Editing
☆36Updated last year
getao / icae
The repo for In-context Autoencoder
☆149Updated last year
hkust-nlp / Activation_Decoding
In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation (ICML 2024)
☆62Updated last year
LightChen233 / reasoning-boundary
☆69Updated 5 months ago
XMUDeepLIT / QGC
Code for "Retaining Key Information under High Compression Rates: Query-Guided Compressor for LLMs" (ACL 2024)
☆18Updated last year
Blueyee / Efficient-CoT-LRMs
Chain of Thoughts (CoT) is so hot! so long! We need short reasoning process!
☆70Updated 7 months ago