zepingyu0512/in-context-mechanism

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/zepingyu0512/in-context-mechanism)

zepingyu0512 / in-context-mechanism

code for EMNLP 2024 paper: How do Large Language Models Learn In-Context? Query and Key Matrices of In-Context Heads are Two Towers for Metric Learning

☆13

Alternatives and similar repositories for in-context-mechanism

Users that are interested in in-context-mechanism are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

zepingyu0512 / arithmetic-mechanism
View on GitHub
code for EMNLP 2024 paper: Interpreting Arithmetic Mechanism in Large Language Models through Comparative Neuron Analysis
☆12Nov 17, 2024Updated last year
zepingyu0512 / neuron-attribution
View on GitHub
code for EMNLP 2024 paper: Neuron-Level Knowledge Attribution in Large Language Models
☆52Nov 17, 2024Updated last year
aryamanarora / bayesian-laws-icl
View on GitHub
Bayesian scaling laws for in-context learning.
☆16Mar 12, 2025Updated last year
machinelearning4health / TextHoaxer
View on GitHub
Implementation Code of TextHoaxer
☆15Aug 21, 2022Updated 3 years ago
NYUSHCS / UniGLM
View on GitHub
☆15Jul 5, 2024Updated 2 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
LLM-MI-Research / Actionable-MI
View on GitHub
☆15Jan 20, 2026Updated 6 months ago
taishan1994 / pytorch_unbalanced_text_classification
View on GitHub
基于pytorch的不平衡数据的文本分类
☆12Dec 26, 2021Updated 4 years ago
declare-lab / della
View on GitHub
DELLA-Merging: Reducing Interference in Model Merging through Magnitude-Based Sampling
☆37Jul 12, 2024Updated 2 years ago
showlab / AVA-AVD
View on GitHub
☆22Nov 24, 2022Updated 3 years ago
hannamw / eap-ig-faithfulness
View on GitHub
Code for "Automatic Circuit Finding and Faithfulness"
☆19Jul 11, 2024Updated 2 years ago
jonasrauber / linear-region-attack
View on GitHub
A powerful white-box adversarial attack that exploits knowledge about the geometry of neural networks to find minimal adversarial perturb…
☆12Aug 5, 2020Updated 5 years ago
revbucket / geometric-certificates
View on GitHub
Geometric Certifications of Neural Nets
☆42Nov 22, 2022Updated 3 years ago
INK-USC / hierarchical-explanation-neural-sequence-models
View on GitHub
Source code for "Towards Hierarchical Importance Attribution: Explaining Compositional Semantics for Neural Sequence Models", ICLR 2020.
☆29Jun 28, 2020Updated 6 years ago
yizhongw / truthfulqa_reeval
View on GitHub
☆12Mar 7, 2024Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
Tizzzzy / Demonstration_Selection_Overview
View on GitHub
✨✨ Official repo for "Comparative Analysis of Demonstration Selection Algorithms for LLM In-Context Learning"
☆16Nov 8, 2024Updated last year
kyegomez / MultiModal-ToT
View on GitHub
Multi-Modal Tree of thoughts for DALLE-3 like auto self improvement
☆17Nov 11, 2024Updated last year
Xemin0 / ReadingEmbedding
View on GitHub
Implementation of "From Word Embedding to Reading Embedding Using Large Language Model, EEG and Eye-tracking"
☆46Mar 5, 2025Updated last year
jyansir / Text2Tree
View on GitHub
(EMNLP 2023 Findings) Text2Tree: Aligning Text Representation to the Label Tree Hierarchy for Imbalanced Medical Classification.
☆16Feb 27, 2024Updated 2 years ago
Ksuriuri / LLMCI
View on GitHub
Control LLM generation format efficiently. A simple version of microsoft/aici in vllm and transformers
☆14Jun 7, 2024Updated 2 years ago
stzhang-patrick / ArcMMLU
View on GitHub
☆16Feb 2, 2024Updated 2 years ago
cavedweller509 / SentenceVAE
View on GitHub
Enable Next-sentence Prediction for Large Language Models with Faster Speed, Higher Accuracy and Longer Context
☆42Aug 16, 2024Updated last year
cvenhoff / vlm-mapping
View on GitHub
☆19Jun 20, 2025Updated last year
dhchenx / umls-graph
View on GitHub
Build a medical knowledge graph based on Unified Language Medical System (UMLS)
☆28Dec 25, 2021Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
VProv / uncertainty_example
View on GitHub
☆13Oct 12, 2020Updated 5 years ago
forwchen / LLaVA-MoLE
View on GitHub
☆10Mar 4, 2024Updated 2 years ago
YivanZhang / lio
View on GitHub
Learning from Indirect Observations
☆11Jul 16, 2021Updated 5 years ago
riccardotommasini / imkg
View on GitHub
The Internet Memes Knowledge Graph
☆18Oct 18, 2024Updated last year
SethEBaldwin / mdscuda
View on GitHub
CUDA implementation of Multidimensional Scaling
☆15May 8, 2021Updated 5 years ago
bond005 / impartial_text_cls
View on GitHub
Text classifier, based on the BERT and a Bayesian neural network, which can train on small labeled texts and doubt its decision.
☆14Mar 24, 2023Updated 3 years ago
roeehendel / icl_task_vectors
View on GitHub
☆106Oct 30, 2023Updated 2 years ago
PeterGriffinJin / Patton
View on GitHub
Patton: Language Model Pretraining on Text-rich Networks (ACL 2023 main oral)
☆32Feb 10, 2025Updated last year
QxLabIreland / AQP
View on GitHub
☆23Jun 13, 2022Updated 4 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
jugechengzi / Rationalization-MGR
View on GitHub
ACL 2023 *oral* paper "MGR: Multi-generator based Rationalization"
☆10Nov 21, 2024Updated last year
matchten / LoRA-Models-for-SAEs
View on GitHub
Code for reproducing our paper "Low Rank Adapting Models for Sparse Autoencoder Features"
☆17Mar 31, 2025Updated last year
edenbiran / HoppingTooLate
View on GitHub
Exploring the Limitations of Large Language Models on Multi-Hop Queries
☆33Mar 2, 2025Updated last year
revbucket / lipMIP
View on GitHub
Mixed integer programming for computing lipschitz constants of ReLU Networks
☆17Feb 10, 2023Updated 3 years ago
davinhill / BivariateShapley
View on GitHub
Bivariate Shapley is a Shapley-based method of identifying directional feature interactions and feature redundancy
☆20May 19, 2025Updated last year
yaohungt / Barlow-Twins-HSIC
View on GitHub
☆57Dec 20, 2021Updated 4 years ago
tapilab / aaai-2021-counterfactuals
View on GitHub
☆13Jul 6, 2021Updated 5 years ago