code for EMNLP 2024 paper: How do Large Language Models Learn In-Context? Query and Key Matrices of In-Context Heads are Two Towers for Metric Learning
☆13Nov 17, 2024Updated last year
Alternatives and similar repositories for in-context-mechanism
Users that are interested in in-context-mechanism are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- code for EMNLP 2024 paper: Interpreting Arithmetic Mechanism in Large Language Models through Comparative Neuron Analysis☆12Nov 17, 2024Updated last year
- code for EMNLP 2024 paper: Neuron-Level Knowledge Attribution in Large Language Models☆52Nov 17, 2024Updated last year
- Bayesian scaling laws for in-context learning.☆15Mar 12, 2025Updated last year
- Implementation Code of TextHoaxer☆15Aug 21, 2022Updated 3 years ago
- ☆15Jul 5, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- 基于pytorch的不平衡数据的文本分类☆12Dec 26, 2021Updated 4 years ago
- Code for "Automatic Circuit Finding and Faithfulness"☆17Jul 11, 2024Updated last year
- DELLA-Merging: Reducing Interference in Model Merging through Magnitude-Based Sampling☆36Jul 12, 2024Updated last year
- A powerful white-box adversarial attack that exploits knowledge about the geometry of neural networks to find minimal adversarial perturb…☆12Aug 5, 2020Updated 5 years ago
- Geometric Certifications of Neural Nets☆42Nov 22, 2022Updated 3 years ago
- Source code for "Towards Hierarchical Importance Attribution: Explaining Compositional Semantics for Neural Sequence Models", ICLR 2020.☆29Jun 28, 2020Updated 5 years ago
- UniGraph: Learning a Unified Cross-Domain Foundation Model for Text-Attributed Graphs (KDD'25)☆26Jun 6, 2025Updated 9 months ago
- ☆12Mar 7, 2024Updated 2 years ago
- ✨✨ Official repo for "Comparative Analysis of Demonstration Selection Algorithms for LLM In-Context Learning"☆16Nov 8, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- EmotionCircuits-LLM: A complete, reproducible framework for discovering and controlling emotion circuits in large language models.☆27Oct 20, 2025Updated 5 months ago
- Multi-Modal Tree of thoughts for DALLE-3 like auto self improvement☆17Nov 11, 2024Updated last year
- Can VLMs understand students' hand-drawn math work?☆17Jan 20, 2026Updated 2 months ago
- Control LLM generation format efficiently. A simple version of microsoft/aici in vllm and transformers☆14Jun 7, 2024Updated last year
- ☆17Feb 2, 2024Updated 2 years ago
- Build a medical knowledge graph based on Unified Language Medical System (UMLS)☆26Dec 25, 2021Updated 4 years ago
- ☆13Oct 12, 2020Updated 5 years ago
- Enable Next-sentence Prediction for Large Language Models with Faster Speed, Higher Accuracy and Longer Context☆42Aug 16, 2024Updated last year
- ☆10Mar 4, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Implementation of "From Word Embedding to Reading Embedding Using Large Language Model, EEG and Eye-tracking"☆46Mar 5, 2025Updated last year
- ☆18Jun 20, 2025Updated 9 months ago
- CUDA implementation of Multidimensional Scaling☆15May 8, 2021Updated 4 years ago
- ☆50Dec 24, 2024Updated last year
- Exploring the Limitations of Large Language Models on Multi-Hop Queries☆33Mar 2, 2025Updated last year
- (EMNLP 2023 Findings) Text2Tree: Aligning Text Representation to the Label Tree Hierarchy for Imbalanced Medical Classification.☆15Feb 27, 2024Updated 2 years ago
- ☆105Oct 30, 2023Updated 2 years ago
- The Internet Memes Knowledge Graph☆15Oct 18, 2024Updated last year
- Text classifier, based on the BERT and a Bayesian neural network, which can train on small labeled texts and doubt its decision.☆14Mar 24, 2023Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Code for reproducing our paper "Low Rank Adapting Models for Sparse Autoencoder Features"☆17Mar 31, 2025Updated 11 months ago
- Patton: Language Model Pretraining on Text-rich Networks (ACL 2023 main oral)☆32Feb 10, 2025Updated last year
- Official code base for "Long-Tailed Diffusion Models With Oriented Calibration" ICLR2024☆16Jul 11, 2024Updated last year
- GOPHI: an AMR-to-English Verbalizer☆11Feb 5, 2020Updated 6 years ago
- Official repository for "EduBench: A Comprehensive Benchmarking Dataset for Evaluating Large Language Models in Diverse Educational Scena…☆19May 28, 2025Updated 10 months ago
- Learning from Indirect Observations☆11Jul 16, 2021Updated 4 years ago
- ☆24Jun 13, 2022Updated 3 years ago