The codebase for Inducing Causal Structure for Interpretable Neural Networks
☆11Dec 3, 2021Updated 4 years ago
Alternatives and similar repositories for Interchange-Intervention-Training
Users that are interested in Interchange-Intervention-Training are comparing it to the libraries listed below
Sorting:
- Souce code of "Inter-seasons and Inter-households Domain Adaptation Based on DANNs and Pseudo Labeling for Non-Intrusive Occupancy Detect…☆14Feb 5, 2025Updated last year
- ☆36Jul 14, 2022Updated 3 years ago
- CEBaB: Estimating the Causal Effects of Real-World Concepts on NLP Model Behavior☆13Sep 21, 2022Updated 3 years ago
- Evaluate interpretability methods on localizing and disentangling concepts in LLMs.☆57Oct 30, 2025Updated 4 months ago
- ☆31Oct 13, 2021Updated 4 years ago
- ☆13Oct 5, 2025Updated 4 months ago
- 🪝PISCES - Precise In-Parameter Suppression for Concept EraSure in Large Language Models☆12May 30, 2025Updated 9 months ago
- ☆14Apr 29, 2025Updated 10 months ago
- Forecasting library in python☆13Sep 6, 2019Updated 6 years ago
- A framework for evaluating Machine Translation models.☆12May 26, 2025Updated 9 months ago
- ☆85Updated this week
- Reasoning-based Evaluation and Ranking of Translations.☆19Jul 18, 2025Updated 7 months ago
- Single shot neural network pruning before training the model, based on connection sensitivity☆11Aug 7, 2019Updated 6 years ago
- A reinforcement learning agent that learns to solve mazes using Group Relative Policy Optimization (GRPO).☆12Feb 9, 2025Updated last year
- A library to manipulate Inkscape SVG content using Python 3☆10Apr 28, 2021Updated 4 years ago
- This is the code of our work CISS Certified Robustness Against Natural Language Attacks by Causal Intervention published on ICML 2022☆11Dec 6, 2022Updated 3 years ago
- The source code of paper “HAZY RE-ID: AN INTERFERENCE SUPPRESSION MODEL FOR DOMAIN ADAPTATION PERSON RE-IDENTIFICATION UNDER INCLEMENT WE…☆12May 26, 2021Updated 4 years ago
- Official repository for "Stylized Adversarial Training" (TPAMI 2022)☆11Dec 30, 2022Updated 3 years ago
- Influence of fake news in Twitter during the 2016 US presidential election☆10Jan 7, 2021Updated 5 years ago
- ☆11Apr 4, 2022Updated 3 years ago
- ☆10Mar 19, 2024Updated last year
- Codes for "Benchmarking the Generation of Fact Checking Explanations"☆10Aug 16, 2024Updated last year
- Github repository for "Internalizing World Models via Self-Play Finetuning for Agentic RL"☆33Nov 1, 2025Updated 4 months ago
- Code for the NAACL 2024 HCI+NLP Workshop paper "LLMCheckup: Conversational Examination of Large Language Models via Interpretability Tool…☆13Mar 24, 2024Updated last year
- ☆10Dec 17, 2020Updated 5 years ago
- An implementation of "Subspace Representations for Soft Set Operations and Sentence Similarities" (NAACL 2024)☆10May 31, 2024Updated last year
- 一个基于 GitHub Actions 的自动化工具,每天早上自动追踪和分析 arXiv 最新论文,并通过邮件发送分析报告。该工具使用 DeepSeek AI 进行论文分析和总结。☆21Jun 20, 2025Updated 8 months ago
- Evaluation results for Machine Translation within the BigScience project☆11May 15, 2023Updated 2 years ago
- A simplified version of MPN☆13May 21, 2021Updated 4 years ago
- https://arxiv.org/abs/2404.10917☆14Mar 18, 2025Updated 11 months ago
- Materials for "Multi-property Steering of Large Language Models with Dynamic Activation Composition"☆14Nov 22, 2024Updated last year
- A command line tool for comparing JSON files by degree of similarity.☆12Oct 28, 2019Updated 6 years ago
- Code for EMNLP 2021 paper "Measuring Association Between Labels and Free-Text Rationales"☆12Sep 12, 2023Updated 2 years ago
- A simple Python package for deep learning using forward automatic differentiation based on JAX.☆14Aug 17, 2022Updated 3 years ago
- Applies ROME and MEMIT on Mamba-S4 models☆14Apr 5, 2024Updated last year
- Code for "Discovering Non-monotonic Autoregressive Orderings with Variational Inference" (paper and code updated from ICLR 2021)☆12Mar 7, 2024Updated last year
- ☆13Jun 9, 2021Updated 4 years ago
- Official Implementation of "Transferring Inductive Biases Through Knowledge Distillation"☆15Jun 3, 2020Updated 5 years ago
- Multi-Figurative Language Generation (COLING 2022)☆12Jan 30, 2023Updated 3 years ago