JacksonWuxs/Interpret_Instruction_Tuning_LLMs

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/JacksonWuxs/Interpret_Instruction_Tuning_LLMs)

JacksonWuxs / Interpret_Instruction_Tuning_LLMs

Understanding Why and How Instruction Tuning Changes Pre-trained Models

☆25

Alternatives and similar repositories for Interpret_Instruction_Tuning_LLMs

Users that are interested in Interpret_Instruction_Tuning_LLMs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

sycny / SelfSynthX
View on GitHub
[ICLR2025] Enhancing Cognition and Explainability of Multimodal Foundation Models with Self-Synthesized Data
☆29Mar 3, 2025Updated last year
Abbey4799 / CELLO
View on GitHub
Code and data for the paper "Can Large Language Models Understand Real-World Complex Instructions?"(AAAI2024)
☆51Apr 19, 2024Updated 2 years ago
JacksonWuxs / PromptRec
View on GitHub
Prompting Small Language Models for Personalized Cold-Start Recommendation
☆32Mar 9, 2024Updated 2 years ago
ninghaohello / Interpretation2Adversary
View on GitHub
Adversarial learning by utilizing model interpretation
☆10Oct 19, 2018Updated 7 years ago
theshi-1128 / llm-defense
View on GitHub
An easy-to-use Python framework to defend against jailbreak prompts.
☆21Mar 22, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
NYUSHCS / UniGLM
View on GitHub
☆15Jul 5, 2024Updated 2 years ago
pikinder / nn-patterns
View on GitHub
TODO ;)
☆12Aug 13, 2018Updated 7 years ago
alexjfoote / Neuron2Graph
View on GitHub
Tools for exploring Transformer neuron behaviour, including input pruning and diversification.
☆10Jun 6, 2023Updated 3 years ago
gao-xiao-bai / StrategyLLM
View on GitHub
StrategyLLM: Large Language Models as Strategy Generators, Executors, Optimizers, and Evaluators for Problem Solving
☆22Dec 11, 2024Updated last year
mndu / REAT
View on GitHub
PyTorch code for WWW 19 paper: On Attribution of Recurrent Neural Network Predictions via Additive Decomposition
☆11Mar 18, 2021Updated 5 years ago
sycny / GiGaMAE
View on GitHub
[CIKM2023] GiGaMA: Generalizable Graph Masked Autoencoder via Collaborative Latent Space Reconstruction
☆18Aug 31, 2023Updated 2 years ago
JacksonWuxs / UsableXAI_LLM
View on GitHub
Using Explanations as a Tool for Advanced LLMs
☆71Sep 11, 2024Updated last year
nuric / pix2rule
View on GitHub
From pixels to symbolic rule learning
☆12Nov 12, 2021Updated 4 years ago
leanprover-community / mathlib-changelog
View on GitHub
☆15Apr 1, 2026Updated 3 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
nishantsubramani / steering_vectors
View on GitHub
Steering Vector Repo from "Extracting Latent Steering Vectors from Pretrained Language Models" - ACL2022 Findings
☆11Mar 14, 2022Updated 4 years ago
JacksonWuxs / BeeDrive
View on GitHub
BeeDrive: Open Source Privacy File Transfering System for Teams and Individual Developers
☆12Mar 12, 2024Updated 2 years ago
leonweber / spyrolog
View on GitHub
Prolog interpreter with support for weak unification. Fork of https://bitbucket.org/cfbolz/pyrolog/
☆15Jun 23, 2020Updated 6 years ago
studio-ousia / textent
View on GitHub
Representation Learning of Entities and Documents from Knowledge Base Descriptions
☆18Oct 6, 2018Updated 7 years ago
chanind / causal-tracer
View on GitHub
Causal tracing for language models
☆12Apr 2, 2024Updated 2 years ago
Hengrui-Gu / PokeMQA
View on GitHub
[ACL 2024] PokeMQA: Programmable knowledge editing for Multi-hop Question Answering
☆19Jun 8, 2024Updated 2 years ago
sycny / ZIP
View on GitHub
[NeurIPS2023] Black-box Backdoor Defense via Zero-shot Image Purification
☆17Oct 31, 2023Updated 2 years ago
steelsojka / eslint-import-alias
View on GitHub
ESLint rule for restricting imports to path aliases
☆22May 4, 2024Updated 2 years ago
Vilin97 / linear-algebra-done-right
View on GitHub
☆12Jun 30, 2022Updated 4 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
emorynlp / ddr
View on GitHub
Deep Dependency Representation
☆16May 9, 2018Updated 8 years ago
chanind / amr-logic-converter
View on GitHub
Convert Abstract Meaning Representation (AMR) into first-order logic
☆17Aug 7, 2024Updated last year
ShanleiMu / DeepLearningResource
View on GitHub
Deep learning resource
☆21Jul 4, 2020Updated 6 years ago
mndu / guided-feature-inversion
View on GitHub
PyTorch code for KDD 18 paper: Towards Explanation of DNN-based Prediction with Guided Feature Inversion
☆21Feb 4, 2019Updated 7 years ago
anlausch / LIBERT
View on GitHub
Code from the paper "Specializing Unsupervised Pretraining Models for Word-Level Semantic Similarity"
☆19May 8, 2020Updated 6 years ago
justforyou16007 / AllenNLP-Tutorials-Chinese
View on GitHub
中文AllenNLP教程（持续更新）
☆14Jun 3, 2019Updated 7 years ago
mmasdeu / topologygame
View on GitHub
Learn Lean and topology
☆26Apr 28, 2023Updated 3 years ago
dtch1997 / steering-bench
View on GitHub
Official codebase for "Analyzing the Generalization and Reliability of Steering Vectors"
☆22Dec 14, 2024Updated last year
Dakingrai / neuron-analysis-cot-arithmetic-reasoning
View on GitHub
☆14Feb 24, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
MJ-Jang / BECEL
View on GitHub
☆10Jan 28, 2024Updated 2 years ago
THU-KEG / COPEN
View on GitHub
The official code and dataset for EMNLP 2022 paper "COPEN: Probing Conceptual Knowledge in Pre-trained Language Models".
☆21Mar 9, 2023Updated 3 years ago
mpsae / MP-SAE
View on GitHub
☆17May 19, 2026Updated 2 months ago
hungntt / LangXAI
View on GitHub
LangXAI: Integrating Large Vision Models for Generating Textual Explanations to Enhance Explainability in Visual Perception Tasks
☆19Oct 3, 2024Updated last year
jonchardy / typedoc-plugin-no-inherit
View on GitHub
Exclude inherited members from a Typedoc class using @noInheritDoc annotation
☆20Apr 7, 2026Updated 3 months ago
ahxt / G2R
View on GitHub
[WWW2022] Geometric Graph Representation Learning via Maximizing Rate Reduction
☆26May 27, 2022Updated 4 years ago
J3rome / py-requirements-guesser
View on GitHub
Attempt to guess requirements.txt modules versions based on Git history
☆22Jul 30, 2021Updated 4 years ago