☆19Jan 28, 2024Updated 2 years ago
Alternatives and similar repositories for TransformerLens-intro
Users that are interested in TransformerLens-intro are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Accompanying codebase for neuroscope.io, a website for displaying max activating dataset examples for language model neurons☆13Feb 13, 2023Updated 3 years ago
- TransformerLens + HuggingFace☆11Nov 4, 2023Updated 2 years ago
- ☆92Dec 18, 2025Updated 3 months ago
- Official Code for our paper: "Language Models Learn to Mislead Humans via RLHF""☆19Oct 11, 2024Updated last year
- Mechanistic Interpretability Visualizations using React☆338Dec 18, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Implementation of the paper "Improving the Accuracy-Robustness Trade-off of Classifiers via Adaptive Smoothing".☆10Feb 6, 2024Updated 2 years ago
- Universal Neurons in GPT2 Language Models☆30May 28, 2024Updated last year
- Understanding Rare Spurious Correlations in Neural Network☆12Jun 5, 2022Updated 3 years ago
- [NeurIPS 2023] "Learning to Augment Distributions for Out-of-distribution Detection"☆11Nov 14, 2023Updated 2 years ago
- ☆16Dec 18, 2023Updated 2 years ago
- ☆15Dec 19, 2022Updated 3 years ago
- Python package to accelerate research on generalized out-of-distribution (OOD) detection.☆15Jun 19, 2024Updated last year
- Code for CVPR 2024 Oral "Neural Lineage"☆17Jun 18, 2024Updated last year
- ICML 2024 Paper "Adversarial Robustness Limits via Scaling-Law and Human-Alignment Studies"☆18Jul 10, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Keeping language models honest by directly eliciting knowledge encoded in their activations.☆217Apr 6, 2026Updated last week
- A library for mechanistic interpretability of GPT-style language models☆3,304Updated this week
- Yazi plugin to paste clipboard content to file.☆15Feb 16, 2026Updated 2 months ago
- ☆24Nov 11, 2024Updated last year
- my hammerspoon config☆11Jun 8, 2025Updated 10 months ago
- PyTorch adversarial attack baselines for ImageNet, CIFAR10, and MNIST (state-of-the-art attacks comparison)☆20Mar 12, 2021Updated 5 years ago
- Certified robustness of deep neural networks☆19Aug 20, 2024Updated last year
- The code for the Ensemble everything everywhere: Multi-scale aggregation for adversarial robustness paper☆22Nov 8, 2024Updated last year
- Run test on demand with support for many test runners☆10Aug 13, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code for "Don't trust your eyes: on the (un)reliability of feature visualizations" (ICML 2024)☆34Nov 15, 2023Updated 2 years ago
- Robust Principles: Architectural Design Principles for Adversarially Robust CNNs☆24Jan 13, 2024Updated 2 years ago
- ☆1,031Mar 29, 2026Updated 2 weeks ago
- Demo repository showcasing how to use reusable workflows to build artifact attestations☆15Apr 6, 2026Updated last week
- Sparkline weather forecasts in Emacs☆26Mar 17, 2026Updated 3 weeks ago
- Chat interface and library for interacting with different LLMs via Emacs.☆13Mar 19, 2025Updated last year
- Putting Visual Object Recognition in Context☆18Aug 3, 2021Updated 4 years ago
- GPI-Space: Memory Driven Computing and Big Data☆10Mar 17, 2026Updated 3 weeks ago
- 🍰☆12Apr 2, 2026Updated last week
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Config files for a shortcut system based on Karabiner (via Goku), Yabai, and Übersicht (via Nero.)☆11Jan 16, 2026Updated 3 months ago
- A new repo to demonstrate tutorials for using HuggingFace on Graphcore IPUs.☆12May 3, 2023Updated 2 years ago
- A benchmark of programming tasks for LLMs that supports almost any programming language.☆13Jun 30, 2025Updated 9 months ago
- Notebooks accompanying Anthropic's "Toy Models of Superposition" paper☆145Sep 14, 2022Updated 3 years ago
- Paper dataset for "Factored Verification: Detecting and Reducing Hallucination in Summaries of Academic Papers"☆13Oct 20, 2024Updated last year
- PSI-MOD ontology for modified and unmodified amino acid residues☆15Jan 8, 2026Updated 3 months ago
- Emacs org mode export backend for beamer lectures.☆10Sep 18, 2025Updated 6 months ago