Samsung / NL-ITILinks

☆12

Alternatives and similar repositories for NL-ITI

Users that are interested in NL-ITI are comparing it to the libraries listed below

Sorting:

epfl-dlab / llm-latent-language
Repo accompanying our paper "Do Llamas Work in English? On the Latent Language of Multilingual Transformers".
☆78Updated last year
princeton-nlp / LLMBar
[ICLR 2024] Evaluating Large Language Models at Evaluating Instruction Following
☆127Updated last year
OpenMOSS / Language-Model-SAEs
For OpenMOSS Mechanistic Interpretability Team's Sparse Autoencoder (SAE) research.
☆136Updated this week
alisawuffles / proxy-tuning
Code associated with Tuning Language Models by Proxy (Liu et al., 2024)
☆114Updated last year
HillZhang1999 / ICD
Code & Data for our Paper "Alleviating Hallucinations of Large Language Models through Induced Hallucinations"
☆66Updated last year
WHGTyen / BIG-Bench-Mistake
A dataset of LLM-generated chain-of-thought steps annotated with mistake location.
☆81Updated 11 months ago
lil-lab / icrl
☆24Updated 5 months ago
zankner / CLoud
Critique-out-Loud Reward Models
☆68Updated 9 months ago
GAIR-NLP / scaleeval
Scalable Meta-Evaluation of LLMs as Evaluators
☆42Updated last year
TIGER-AI-Lab / MAmmoTH2
Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]
☆145Updated 8 months ago
penguinnnnn / awesome-llm-and-society
Recent papers on (1) Psychology of LLMs; (2) Biases in LLMs.
☆49Updated last year
ictnlp / TACS
Source code for Truth-Aware Context Selection: Mitigating the Hallucinations of Large Language Models Being Misled by Untruthful Contexts
☆17Updated 10 months ago
chenzhiling9954 / Critical-Tokens-Matter
☆38Updated last month
pillowsofwind / Course-Correction
[EMNLP 2024] The official GitHub repo for the paper "Course-Correction: Safety Alignment Using Synthetic Preferences"
☆19Updated 9 months ago
swj0419 / detect-pretrain-code
This repository provides an original implementation of Detecting Pretraining Data from Large Language Models by *Weijia Shi, *Anirudh Aji…
☆228Updated last year
RUCAIBox / Language-Specific-Neurons
☆75Updated 6 months ago
aryopg / mmlu-redux
☆18Updated 8 months ago
lyy1994 / awesome-data-contamination
The Paper List on Data Contamination for Large Language Models Evaluation.
☆95Updated 3 months ago
oneal2000 / MIND
Source code of our paper MIND, ACL 2024 Long Paper
☆43Updated last year
bytedance / BytevalKit-LLM
☆24Updated 2 weeks ago
DAMO-NLP-SG / multilingual_analysis
[NeurIPS 2024] How do Large Language Models Handle Multilingualism?
☆34Updated 8 months ago
OATML / semantic-entropy-probes
☆36Updated 11 months ago
NanshineLoong / Self-Evolving-Benchmark
A framework for evolving and testing question-answering datasets with various models.
☆16Updated last year
declare-lab / red-instruct
Codes and datasets of the paper Red-Teaming Large Language Models using Chain of Utterances for Safety-Alignment
☆102Updated last year
AI21Labs / factor
Code and data for the FACTOR paper
☆48Updated last year
chujiezheng / LLM-Extrapolation
Official repository for ACL 2025 paper "Model Extrapolation Expedites Alignment"
☆74Updated last month
zhu-minjun / PAlign
Personality Alignment of Language Models
☆37Updated 2 weeks ago
DAMO-NLP-SG / contrastive-cot
Contrastive Chain-of-Thought Prompting
☆64Updated last year
yuzhaouoe / SAE-based-representation-engineering
[NAACL'25 Oral] Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering
☆61Updated 7 months ago
Glaciohound / LM-Steer
Official Code Repository for LM-Steer Paper: "Word Embeddings Are Steers for Language Models" (ACL 2024 Outstanding Paper Award)
☆121Updated this week