zepingyu0512 / in-context-mechanismView external linksLinks
code for EMNLP 2024 paper: How do Large Language Models Learn In-Context? Query and Key Matrices of In-Context Heads are Two Towers for Metric Learning
☆13Nov 17, 2024Updated last year
Alternatives and similar repositories for in-context-mechanism
Users that are interested in in-context-mechanism are comparing it to the libraries listed below
Sorting:
- code for EMNLP 2024 paper: Interpreting Arithmetic Mechanism in Large Language Models through Comparative Neuron Analysis☆12Nov 17, 2024Updated last year
- code for EMNLP 2024 paper: Neuron-Level Knowledge Attribution in Large Language Models☆50Nov 17, 2024Updated last year
- Source code for "Towards Hierarchical Importance Attribution: Explaining Compositional Semantics for Neural Sequence Models", ICLR 2020.☆30Jun 28, 2020Updated 5 years ago
- Bayesian Low-Rank Adaptation for Large Language Models☆36Jun 22, 2024Updated last year
- DELLA-Merging: Reducing Interference in Model Merging through Magnitude-Based Sampling☆36Jul 12, 2024Updated last year
- Enable Next-sentence Prediction for Large Language Models with Faster Speed, Higher Accuracy and Longer Context☆41Aug 16, 2024Updated last year
- EmotionCircuits-LLM: A complete, reproducible framework for discovering and controlling emotion circuits in large language models.☆25Oct 20, 2025Updated 3 months ago
- Geometric Certifications of Neural Nets☆42Nov 22, 2022Updated 3 years ago
- ☆104Oct 30, 2023Updated 2 years ago
- ☆15Nov 22, 2023Updated 2 years ago
- ☆14May 21, 2024Updated last year
- ☆40Jan 16, 2026Updated last month
- Bayesian scaling laws for in-context learning.☆15Mar 12, 2025Updated 11 months ago
- Efficient joint input optimization and inference with DEQ☆10Nov 25, 2021Updated 4 years ago
- Official code for "Algorithmic Capabilities of Random Transformers" (NeurIPS 2024)☆16Sep 28, 2024Updated last year
- ☆13Jun 22, 2025Updated 7 months ago
- ACL 2023 *oral* paper "MGR: Multi-generator based Rationalization"☆10Nov 21, 2024Updated last year
- Official repository for "EduBench: A Comprehensive Benchmarking Dataset for Evaluating Large Language Models in Diverse Educational Scena…☆19May 28, 2025Updated 8 months ago
- 基于pytorch的不平衡数据的文本分类☆12Dec 26, 2021Updated 4 years ago
- Official repository for "DEnsity: Open-domain Dialogue Evaluation Metric using Density Estimation (ACL2023 Findings)"☆11May 23, 2023Updated 2 years ago
- Official implementation of ICML'24 paper "LQER: Low-Rank Quantization Error Reconstruction for LLMs"☆19Jul 11, 2024Updated last year
- Code for paper "Concrete Subspace Learning based Interference Elimination for Multi-task Model Fusion"☆14Mar 28, 2024Updated last year
- The official code for "Advancing Multimodal Large Language Models with Quantization-Aware Scale Learning for Efficient Adaptation" | [MM2…☆14Dec 7, 2024Updated last year
- Minimum viable code for the Decodable Information Bottleneck paper. Pytorch Implementation.☆11Oct 20, 2020Updated 5 years ago
- ☆12Jun 18, 2024Updated last year
- ☆10Feb 12, 2024Updated 2 years ago
- Learning from Indirect Observations☆11Jul 16, 2021Updated 4 years ago
- Conditional DDPM for characterizing radio sources from dirty images. (autumn 2023)☆11Nov 30, 2023Updated 2 years ago
- Source code for paper Are Human-generated Demonstrations Necessary for In-context Learning☆12Jan 21, 2024Updated 2 years ago
- ☆17Feb 2, 2024Updated 2 years ago
- MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation☆13Sep 2, 2024Updated last year
- Probabilistic Solution of Differential Equations☆13Jun 19, 2022Updated 3 years ago
- ☆15Jul 5, 2024Updated last year
- Official Implementation of Paper "Learning to Jump: Thinning and Thickening Latent Counts for Generative Modeling" (ICML 2023)☆10Jun 6, 2023Updated 2 years ago
- Materials for paper "Are Large Language Models Temporally Grounded?"☆13Nov 16, 2023Updated 2 years ago
- Truth-Conditional Captions for Time Series Data. EMNLP 2021. Harsh Jhamtani, Taylor Berg-Kirkpatrick☆13Feb 9, 2022Updated 4 years ago
- Implementation of various generative models☆14Oct 1, 2018Updated 7 years ago
- A codebase for ACL 2023 paper: Mitigating Label Biases for In-context Learning☆10Aug 4, 2023Updated 2 years ago
- Inverse Scaling in Test-Time Compute☆25Dec 3, 2025Updated 2 months ago