vdlad / Remarkable-Robustness-of-LLMs
Codebase the paper "The Remarkable Robustness of LLMs: Stages of Inference?"
☆17Updated 9 months ago
Alternatives and similar repositories for Remarkable-Robustness-of-LLMs:
Users that are interested in Remarkable-Robustness-of-LLMs are comparing it to the libraries listed below
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆55Updated 7 months ago
- Codes and datasets for the paper Measuring and Enhancing Trustworthiness of LLMs in RAG through Grounded Attributions and Learning to Ref…☆47Updated last month
- ☆48Updated 4 months ago
- Scalable Meta-Evaluation of LLMs as Evaluators☆42Updated last year
- Aioli: A unified optimization framework for language model data mixing☆22Updated 2 months ago
- ☆24Updated 6 months ago
- Code for the ICLR 2024 paper "How to catch an AI liar: Lie detection in black-box LLMs by asking unrelated questions"☆67Updated 9 months ago
- ☆20Updated last month
- Exploration of automated dataset selection approaches at large scales.☆35Updated last month
- Public code repo for paper "SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales"☆103Updated 6 months ago
- ☆67Updated 7 months ago
- Stanford NLP Python library for benchmarking the utility of LLM interpretability methods☆62Updated last week
- NeurIPS 2024 tutorial on LLM Inference☆39Updated 3 months ago
- Repository for NPHardEval, a quantified-dynamic benchmark of LLMs☆52Updated last year
- ReBase: Training Task Experts through Retrieval Based Distillation☆28Updated last month
- ☆19Updated 4 months ago
- ☆22Updated 3 months ago
- Functional Benchmarks and the Reasoning Gap☆84Updated 6 months ago
- ☆60Updated 11 months ago
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆33Updated last year
- ☆20Updated 10 months ago
- Tree prompting: easy-to-use scikit-learn interface for improved prompting.☆35Updated last year
- Codebase accompanying the Summary of a Haystack paper.☆76Updated 6 months ago
- Knowledge Unlearning for Large Language Models☆25Updated this week
- Official repository of "LiNeS: Post-training Layer Scaling Prevents Forgetting and Enhances Model Merging"☆24Updated 5 months ago
- Evaluation of neuro-symbolic engines☆35Updated 8 months ago
- PyTorch implementation for MRL☆18Updated last year
- FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions☆42Updated 9 months ago
- Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"☆82Updated last year
- Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"☆46Updated last year