bpwu1 / confidence-regulation-neuronsLinks
Confidence Regulation Neurons in Language Models (NeurIPS 2024)
☆10Updated 5 months ago
Alternatives and similar repositories for confidence-regulation-neurons
Users that are interested in confidence-regulation-neurons are comparing it to the libraries listed below
Sorting:
- ☆20Updated 2 months ago
- [ICLR 2025] ELICIT: LLM Augmentation Via External In-context Capability☆11Updated 4 months ago
- Exploration of automated dataset selection approaches at large scales.☆46Updated 4 months ago
- ☆12Updated 3 months ago
- This is an official implementation of the paper ``Building Math Agents with Multi-Turn Iterative Preference Learning'' with multi-turn DP…☆27Updated 7 months ago
- ☆51Updated 3 months ago
- [NAACL 2025 Main Selected Oral] Repository for the paper: Prompt Compression for Large Language Models: A Survey☆25Updated 2 months ago
- PaCE: Parsimonious Concept Engineering for Large Language Models (NeurIPS 2024)☆38Updated 8 months ago
- [NeurIPS 2024 Spotlight] Code and data for the paper "Finding Transformer Circuits with Edge Pruning".☆59Updated last week
- Official codebase for "Analyzing the Generalization and Reliability of Steering Vectors"☆14Updated 7 months ago
- Repo for Anonymous purpose, pls don't distribute☆10Updated 9 months ago
- [NAACL 2025] A Closer Look into Mixture-of-Experts in Large Language Models☆52Updated 5 months ago
- ☆12Updated 5 months ago
- Stanford NLP Python library for benchmarking the utility of LLM interpretability methods☆102Updated 3 weeks ago
- ☆18Updated 4 months ago
- ☆14Updated last year
- Official implementation of Bootstrapping Language Models via DPO Implicit Rewards☆44Updated 3 months ago
- ☆26Updated 3 months ago
- Common tools for data processing☆16Updated 3 months ago
- Codebase for Instruction Following without Instruction Tuning☆35Updated 9 months ago
- Long Context Extension and Generalization in LLMs☆57Updated 9 months ago
- Code for Paper (Preserving Diversity in Supervised Fine-tuning of Large Language Models)☆33Updated 2 months ago
- The source code for running LLMs on the AAAR-1.0 benchmark.☆16Updated 3 months ago
- Unofficial Implementation of Chain-of-Thought Reasoning Without Prompting☆32Updated last year
- ☆10Updated last week
- Is In-Context Learning Sufficient for Instruction Following in LLMs? [ICLR 2025]☆30Updated 5 months ago
- [NeurIPS-2024] 📈 Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies https://arxiv.org/abs/2407.13623☆86Updated 9 months ago
- ☆14Updated last year
- ☆28Updated 8 months ago
- [ICLR 2024] Unveiling the Pitfalls of Knowledge Editing for Large Language Models☆22Updated last year