vdlad / Remarkable-Robustness-of-LLMs
Codebase the paper "The Remarkable Robustness of LLMs: Stages of Inference?"
☆17Updated 9 months ago
Alternatives and similar repositories for Remarkable-Robustness-of-LLMs:
Users that are interested in Remarkable-Robustness-of-LLMs are comparing it to the libraries listed below
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆55Updated 7 months ago
- Exploration of automated dataset selection approaches at large scales.☆39Updated last month
- ☆48Updated 5 months ago
- Scalable Meta-Evaluation of LLMs as Evaluators☆42Updated last year
- ☆60Updated 11 months ago
- Aioli: A unified optimization framework for language model data mixing☆23Updated 3 months ago
- Codebase accompanying the Summary of a Haystack paper.☆77Updated 7 months ago
- Public code repo for paper "SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales"☆104Updated 6 months ago
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model☆45Updated last year
- ☆16Updated 2 months ago
- ReBase: Training Task Experts through Retrieval Based Distillation☆29Updated 2 months ago
- Knowledge Unlearning for Large Language Models☆25Updated 3 weeks ago
- a curated list of the role of small models in the LLM era☆100Updated 7 months ago
- Evaluation of neuro-symbolic engines☆35Updated 8 months ago
- Stanford NLP Python library for benchmarking the utility of LLM interpretability methods☆70Updated 3 weeks ago
- Flow of Reasoning: Training LLMs for Divergent Problem Solving with Minimal Examples☆84Updated last month
- [ICLR 2025] Monet: Mixture of Monosemantic Experts for Transformers☆64Updated 3 months ago
- Universal Neurons in GPT2 Language Models☆27Updated 10 months ago
- ☆15Updated 2 weeks ago
- ☆41Updated 3 weeks ago
- Code for reproducing our paper "Not All Language Model Features Are Linear"☆73Updated 4 months ago
- ☆51Updated last week
- The repository contains code for Adaptive Data Optimization☆23Updated 4 months ago
- Language models scale reliably with over-training and on downstream tasks☆96Updated last year
- ☆78Updated 8 months ago
- NeurIPS 2024 tutorial on LLM Inference☆41Updated 4 months ago
- Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"☆84Updated last year
- [arXiv preprint] Official Repository for "Evaluating Language Models as Synthetic Data Generators"☆33Updated 4 months ago
- Investigating the generalization behavior of LM probes trained to predict truth labels: (1) from one annotator to another, and (2) from e…☆26Updated 11 months ago
- ☆22Updated 2 months ago