Decomposing and Editing Predictions by Modeling Model Computation
☆139Jun 12, 2024Updated 2 years ago
Alternatives and similar repositories for modelcomponents
Users that are interested in modelcomponents are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for the paper "The Journey, Not the Destination: How Data Guides Diffusion Models"☆25Dec 12, 2023Updated 2 years ago
- ☆53Jan 24, 2024Updated 2 years ago
- Data for "Datamodels: Predicting Predictions with Training Data"☆96May 25, 2023Updated 3 years ago
- ☆25May 20, 2020Updated 6 years ago
- Official repository for our NeurIPS 2021 paper "Unadversarial Examples: Designing Objects for Robust Vision"☆104Jul 25, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A tool to assist in the interpretation of learned features in sparse autoencoders (in particular the four SAE's trained by Joseph Bloom o…☆19Oct 4, 2024Updated last year
- Python library for argument and configuration management☆57Feb 7, 2023Updated 3 years ago
- Code for "Don't trust your eyes: on the (un)reliability of feature visualizations" (ICML 2024)☆34Nov 15, 2023Updated 2 years ago
- ☆32May 24, 2023Updated 3 years ago
- ☆17Dec 19, 2024Updated last year
- This repository provides code for "On Interaction Between Augmentations and Corruptions in Natural Corruption Robustness".☆46Nov 6, 2022Updated 3 years ago
- PyTorch Implementation of "ASTRA: An Action Spotting TRAnsformer for Soccer Videos", ACM MMSports 2023. | 3rd place solution for SoccerNe…☆44May 20, 2024Updated 2 years ago
- Forcing Diffuse Distributions out of Language Models☆18Sep 10, 2024Updated last year
- A fast, effective data attribution method for neural networks in PyTorch☆241Nov 18, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆37Jun 10, 2021Updated 5 years ago
- ModelDiff: A Framework for Comparing Learning Algorithms☆59Aug 15, 2023Updated 2 years ago
- Code for T-MARS data filtering☆35Aug 23, 2023Updated 2 years ago
- Finding trojans in aligned LLMs. Official repository for the competition hosted at SaTML 2024.☆117Jun 13, 2024Updated last year
- A Kernel-Based View of Language Model Fine-Tuning https://arxiv.org/abs/2210.05643☆78Sep 4, 2023Updated 2 years ago
- Distilling Model Failures as Directions in Latent Space☆48Feb 8, 2023Updated 3 years ago
- A simple and efficient baseline for data attribution☆11Nov 10, 2023Updated 2 years ago
- On the Loss Landscape of Adversarial Training: Identifying Challenges and How to Overcome Them [NeurIPS 2020]☆35Jul 3, 2021Updated 4 years ago
- ☆23Jan 25, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Improving Alignment and Robustness with Circuit Breakers☆262Sep 24, 2024Updated last year
- Official implementation of Tabular Transfer Learning via Prompting LLMs (COLM 2024).☆13Aug 6, 2024Updated last year
- Official repository for "Stylized Adversarial Training" (TPAMI 2022)☆11Dec 30, 2022Updated 3 years ago
- [ECCV'24 Workshops Oral] DALDA: Data Augmentation Leveraging Diffusion Model and LLM with Adaptive Guidance Scaling☆31Feb 6, 2026Updated 4 months ago
- The official implementation of Cross-Task Experience Sharing (COPS)☆29Oct 23, 2024Updated last year
- Official repo for EMNLP 2023 paper "Explain-then-Translate: An Analysis on Improving Program Translation with Self-generated Explanations…☆29Dec 5, 2023Updated 2 years ago
- Official repository of the IEEE SLT 2024 paper "Self-Supervised Syllable Discovery Based on Speaker-Disentangled HuBERT"☆45Jun 4, 2026Updated last week
- Dataset Interfaces: Diagnosing Model Failures Using Controllable Counterfactual Generation☆45Feb 27, 2023Updated 3 years ago
- Fine-grained ImageNet annotations☆30May 25, 2020Updated 6 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Influence Functions with (Eigenvalue-corrected) Kronecker-Factored Approximate Curvature☆195May 23, 2026Updated 2 weeks ago
- ☆25Jun 22, 2023Updated 2 years ago
- Sparse and discrete interpretability tool for neural networks☆64Feb 12, 2024Updated 2 years ago
- Attribute statements generated by LLMs to preceding tokens using attention weights.☆26Apr 22, 2025Updated last year
- Comparison of gradient estimation techniques for black-box adversarial examples☆11Oct 31, 2018Updated 7 years ago
- Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"☆27Apr 17, 2024Updated 2 years ago
- Jaehyung Kim et al's ACL 2023 paper on "infoVerse: A Universal Framework for Dataset Characterization with Multidimensional Meta-informat…☆16Jun 28, 2023Updated 2 years ago