vdlad / Remarkable-Robustness-of-LLMsLinks

Codebase the paper "The Remarkable Robustness of LLMs: Stages of Inference?"

☆19

Alternatives and similar repositories for Remarkable-Robustness-of-LLMs

Users that are interested in Remarkable-Robustness-of-LLMs are comparing it to the libraries listed below

Sorting:

tianyang-x / SaySelf
Public code repo for paper "SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales"
☆109Updated last year
ConsequentAI / fneval
Functional Benchmarks and the Reasoning Gap
☆90Updated last year
arcee-ai / DAM
☆55Updated last year
felipemaiapolo / tinyBenchmarks
Evaluating LLMs with fewer examples
☆168Updated last year
katiekang1998 / reasoning_generalization
☆33Updated 10 months ago
ContextualAI / CLAIR_and_APO
Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment
☆60Updated last year
cmu-l3 / neurips2024-inference-tutorial-code
NeurIPS 2024 tutorial on LLM Inference
☆47Updated 11 months ago
salesforce / summary-of-a-haystack
Codebase accompanying the Summary of a Haystack paper.
☆79Updated last year
ahans30 / goldfish-loss
[NeurIPS 2024] Goldfish Loss: Mitigating Memorization in Generative LLMs
☆92Updated last year
r-three / phatgoose
Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"
☆91Updated last year
yidingjiang / ado
The repository contains code for Adaptive Data Optimization
☆28Updated 11 months ago
csinva / tree-prompt
Tree prompting: easy-to-use scikit-learn interface for improved prompting.
☆40Updated 2 years ago
para-lost / ReBase
ReBase: Training Task Experts through Retrieval Based Distillation
☆29Updated 9 months ago
stanfordnlp / axbench
Stanford NLP Python library for benchmarking the utility of LLM interpretability methods
☆145Updated 5 months ago
HazyResearch / aioli
Aioli: A unified optimization framework for language model data mixing
☆31Updated 10 months ago
JoshEngels / MultiDimensionalFeatures
Code for reproducing our paper "Not All Language Model Features Are Linear"
☆84Updated last year
LoryPack / LLM-LieDetector
Code for the ICLR 2024 paper "How to catch an AI liar: Lie detection in black-box LLMs by asking unrelated questions"
☆71Updated last year
KihoPark / LLM_Categorical_Hierarchical_Representations
☆111Updated 9 months ago
JacobPfau / fillerTokens
☆75Updated last year
microsoft / mechanistic-error-probe
A mechanistic approach for understanding and detecting factual errors of large language models.
☆48Updated last year
probabilistic-inference-scaling / probabilistic-inference-scaling
☆52Updated 8 months ago
neelsjain / BYOD
The Official Repository for "Bring Your Own Data! Self-Supervised Evaluation for Large Language Models"
☆107Updated 2 years ago
ckkissane / crosscoder-model-diff-replication
Open source replication of Anthropic's Crosscoders for Model Diffing
☆62Updated last year
KaiNylund / lm-weights-encode-time
☆69Updated last year
casmlab / NPHardEval
Repository for NPHardEval, a quantified-dynamic benchmark of LLMs
☆61Updated last year
dmis-lab / Monet
[ICLR 2025] Monet: Mixture of Monosemantic Experts for Transformers
☆73Updated 5 months ago
wesg52 / universal-neurons
Universal Neurons in GPT2 Language Models
☆31Updated last year
allenai / infinigram-api
☆87Updated this week
tigerchen52 / awesome_role_of_small_models
a curated list of the role of small models in the LLM era
☆110Updated last year
luyug / magix
Supercharge huggingface transformers with model parallelism.
☆77Updated 4 months ago