fbarez / neuroplasticity
☆14Updated 5 months ago
Related projects: ⓘ
- Few-shot Learning with Auxiliary Data☆26Updated 9 months ago
- ☆27Updated last year
- datasets from the paper "Towards Understanding Sycophancy in Language Models"☆59Updated 10 months ago
- CausalGym: Benchmarking causal interpretability methods on linguistic tasks☆28Updated 6 months ago
- Tree prompting: easy-to-use scikit-learn interface for improved prompting.☆27Updated 10 months ago
- ☆29Updated 2 weeks ago
- Experiments on GPT-3's ability to fit numerical models in-context.☆14Updated 2 years ago
- Codes and files for the paper Are Emergent Abilities in Large Language Models just In-Context Learning☆34Updated 6 months ago
- ☆23Updated last year
- PyTorch implementation for MRL☆17Updated 6 months ago
- ☆29Updated 10 months ago
- ☆49Updated last year
- Code for paper "Do Language Models Have Beliefs? Methods for Detecting, Updating, and Visualizing Model Beliefs"☆28Updated 2 years ago
- Code and Data Repo for the CoNLL Paper -- Future Lens: Anticipating Subsequent Tokens from a Single Hidden State☆14Updated 8 months ago
- This repository contains the code used for the experiments in the paper "Fine-Tuning Enhances Existing Mechanisms: A Case Study on Entity…☆16Updated 6 months ago
- Embedding Recycling for Language models☆38Updated last year
- Aligning with Human Judgement: The Role of Pairwise Preference in Large Language Model Evaluators (Liu et al.; arXiv preprint arXiv:2403.…☆34Updated 2 months ago
- Official code repo for paper "Great Memory, Shallow Reasoning: Limits of kNN-LMs"☆15Updated 3 weeks ago
- ☆47Updated 3 months ago
- Ranking of fine-tuned HF models as base models.☆35Updated last year
- Repository for "I am a Strange Dataset: Metalinguistic Tests for Language Models"☆34Updated 8 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆39Updated 3 weeks ago
- ☆50Updated last month
- Is In-Context Learning Sufficient for Instruction Following in LLMs?☆19Updated 3 months ago
- Universal Neurons in GPT2 Language Models☆25Updated 3 months ago
- [ACL 2023]: Training Trajectories of Language Models Across Scales https://arxiv.org/pdf/2212.09803.pdf☆22Updated 10 months ago
- Sparse and discrete interpretability tool for neural networks☆51Updated 7 months ago
- ☆44Updated 2 months ago
- Evaluation of neuro-symbolic engines☆29Updated last month
- Teaching Models to Express Their Uncertainty in Words☆36Updated 2 years ago