code for EMNLP 2024 paper: How do Large Language Models Learn In-Context? Query and Key Matrices of In-Context Heads are Two Towers for Metric Learning
☆13Nov 17, 2024Updated last year
Alternatives and similar repositories for in-context-mechanism
Users that are interested in in-context-mechanism are comparing it to the libraries listed below
Sorting:
- code for EMNLP 2024 paper: Interpreting Arithmetic Mechanism in Large Language Models through Comparative Neuron Analysis☆12Nov 17, 2024Updated last year
- code for EMNLP 2024 paper: Neuron-Level Knowledge Attribution in Large Language Models☆51Nov 17, 2024Updated last year
- Source code for "Towards Hierarchical Importance Attribution: Explaining Compositional Semantics for Neural Sequence Models", ICLR 2020.☆30Jun 28, 2020Updated 5 years ago
- Bayesian Low-Rank Adaptation for Large Language Models☆37Jun 22, 2024Updated last year
- DELLA-Merging: Reducing Interference in Model Merging through Magnitude-Based Sampling☆36Jul 12, 2024Updated last year
- Enable Next-sentence Prediction for Large Language Models with Faster Speed, Higher Accuracy and Longer Context☆41Aug 16, 2024Updated last year
- EmotionCircuits-LLM: A complete, reproducible framework for discovering and controlling emotion circuits in large language models.☆26Oct 20, 2025Updated 4 months ago
- Geometric Certifications of Neural Nets☆42Nov 22, 2022Updated 3 years ago
- ☆105Oct 30, 2023Updated 2 years ago
- Learning from Indirect Observations☆11Jul 16, 2021Updated 4 years ago
- Official repository for "EduBench: A Comprehensive Benchmarking Dataset for Evaluating Large Language Models in Diverse Educational Scena…☆19May 28, 2025Updated 9 months ago
- Conditional DDPM for characterizing radio sources from dirty images. (autumn 2023)☆11Nov 30, 2023Updated 2 years ago
- The official code for "Advancing Multimodal Large Language Models with Quantization-Aware Scale Learning for Efficient Adaptation" | [MM2…☆14Dec 7, 2024Updated last year
- ☆12Jun 18, 2024Updated last year
- Code for paper "Concrete Subspace Learning based Interference Elimination for Multi-task Model Fusion"☆14Mar 28, 2024Updated last year
- ☆15Nov 22, 2023Updated 2 years ago
- ☆10Feb 12, 2024Updated 2 years ago
- Bayesian scaling laws for in-context learning.☆15Mar 12, 2025Updated 11 months ago
- Official repository for "DEnsity: Open-domain Dialogue Evaluation Metric using Density Estimation (ACL2023 Findings)"☆11May 23, 2023Updated 2 years ago
- Efficient joint input optimization and inference with DEQ☆10Nov 25, 2021Updated 4 years ago
- Minimum viable code for the Decodable Information Bottleneck paper. Pytorch Implementation.☆11Oct 20, 2020Updated 5 years ago
- Can VLMs understand students' hand-drawn math work?☆16Jan 20, 2026Updated last month
- Official implementation of ICML'24 paper "LQER: Low-Rank Quantization Error Reconstruction for LLMs"☆19Jul 11, 2024Updated last year
- ☆14May 21, 2024Updated last year
- Official code for "Algorithmic Capabilities of Random Transformers" (NeurIPS 2024)☆16Sep 28, 2024Updated last year
- ACL 2023 *oral* paper "MGR: Multi-generator based Rationalization"☆10Nov 21, 2024Updated last year
- ☆40Jan 16, 2026Updated last month
- 基于pytorch的不平衡数据的文本分类☆12Dec 26, 2021Updated 4 years ago
- ☆13Jun 22, 2025Updated 8 months ago
- Adaptive-binning for evaluation of confidence calibration☆12Jul 28, 2019Updated 6 years ago
- Fine-tuning Quantized Neural Networks with Zeroth-order Optimization☆16Sep 17, 2025Updated 5 months ago
- Implementation Code of TextHoaxer☆15Aug 21, 2022Updated 3 years ago
- Repository for the code of the paper "Neural Networks Regularization Through Class-wise Invariant Representation Learning".☆12Oct 1, 2017Updated 8 years ago
- Implementation of various generative models☆14Oct 1, 2018Updated 7 years ago
- Inverse Scaling in Test-Time Compute☆25Dec 3, 2025Updated 3 months ago
- ☆12Jul 30, 2025Updated 7 months ago
- An exploration of LLM steering☆24Jun 15, 2024Updated last year
- Probabilistic Solution of Differential Equations☆13Jun 19, 2022Updated 3 years ago
- Effective Unsupervised Domain Adaptation of Neural Rankers by Diversifying Synthetic Query Generation☆15Apr 23, 2025Updated 10 months ago