epfml / getting-startedLinks
☆25Updated 2 months ago
Alternatives and similar repositories for getting-started
Users that are interested in getting-started are comparing it to the libraries listed below
Sorting:
- Efficient empirical NTKs in PyTorch☆22Updated 3 years ago
- ☆106Updated 8 months ago
- Layer-wise Relevance Propagation for Large Language Models and Vision Transformers [ICML 2024]☆192Updated 3 months ago
- Conformal Language Modeling☆32Updated last year
- 👋 Overcomplete is a Vision-based SAE Toolbox☆90Updated 2 months ago
- ☆32Updated 10 months ago
- ☆32Updated last year
- A fast, effective data attribution method for neural networks in PyTorch☆218Updated 10 months ago
- ☆127Updated last week
- ☆81Updated 7 months ago
- Using sparse coding to find distributed representations used by neural networks.☆274Updated last year
- ☆240Updated last year
- Official code implementation for the paper "Do Vision & Language Decoders use Images and Text equally? How Self-consistent are their Expl…☆12Updated 6 months ago
- ☆51Updated 3 weeks ago
- [ICLR 2025] Unintentional Unalignment: Likelihood Displacement in Direct Preference Optimization☆31Updated 8 months ago
- ☆31Updated 8 months ago
- `dattri` is a PyTorch library for developing, benchmarking, and deploying efficient data attribution algorithms.☆86Updated 3 weeks ago
- [ICLR 23 spotlight] An automatic and efficient tool to describe functionalities of individual neurons in DNNs☆55Updated last year
- (ICML 2023) Discover and Cure: Concept-aware Mitigation of Spurious Correlation☆42Updated last year
- DataInf: Efficiently Estimating Data Influence in LoRA-tuned LLMs and Diffusion Models (ICLR 2024)☆76Updated last year
- Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".☆104Updated 2 years ago
- Relative representations can be leveraged to enable solving tasks regarding "latent communication": from zero-shot model stitching to lat…☆63Updated 2 years ago
- An Open Source Implementation of Anthropic's Paper: "Towards Monosemanticity: Decomposing Language Models with Dictionary Learning"☆49Updated last year
- Localizing Memorized Sequences in Language Models☆18Updated 6 months ago
- Spurious Features Everywhere - Large-Scale Detection of Harmful Spurious Features in ImageNet☆32Updated 2 years ago
- ☆63Updated 7 months ago
- Stanford NLP Python library for benchmarking the utility of LLM interpretability methods☆136Updated 3 months ago
- Official code for the ICML 2024 paper "The Entropy Enigma: Success and Failure of Entropy Minimization"☆53Updated last year
- Code for the paper "Post-hoc Concept Bottleneck Models". Spotlight @ ICLR 2023☆84Updated last year
- PaCE: Parsimonious Concept Engineering for Large Language Models (NeurIPS 2024)☆40Updated 11 months ago