☆114Feb 10, 2026Updated last month
Alternatives and similar repositories for subliminal-learning
Users that are interested in subliminal-learning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆17Dec 21, 2023Updated 2 years ago
- A collection of different ways to implement accessing and modifying internal model activations for LLMs☆20Oct 18, 2024Updated last year
- PyTorch and NNsight implementation of AtP* (Kramar et al 2024, DeepMind)☆20Jan 19, 2025Updated last year
- ☆52Oct 23, 2023Updated 2 years ago
- AlgZoo: uninterpreted models with fewer than 1,500 parameters☆47Jan 19, 2026Updated 2 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- A toolkit that provides a range of model diffing techniques including a UI to visualize them interactively.☆69Updated this week
- James' cookbook of evaluations and finetuning experiments☆23Feb 19, 2026Updated last month
- ☆48Sep 29, 2024Updated last year
- Code for our paper "Localizing Lying in Llama"☆13Apr 24, 2025Updated 11 months ago
- Modified to support crosscoder training.☆25Feb 4, 2026Updated last month
- ☆20Jan 21, 2023Updated 3 years ago
- ☆12Oct 23, 2022Updated 3 years ago
- Utilities for the HuggingFace transformers library☆75Jan 21, 2023Updated 3 years ago
- Applying Reinforcement Learning from Human Feedback to language models to teach them to write short story responses to writing prompts.☆14May 5, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ppx_system is a syntax extension to known operating system at compile time☆12May 9, 2023Updated 2 years ago
- Simple partially ordered sets for Julia☆10Jul 29, 2024Updated last year
- Investigating the generalization behavior of LM probes trained to predict truth labels: (1) from one annotator to another, and (2) from e…☆28May 23, 2024Updated last year
- @ngrok/mantle ui component library | https://develop.mantle.ngrok.com☆13Updated this week
- ☆119Feb 11, 2025Updated last year
- ☆25Feb 23, 2026Updated last month
- A library for training crosscoders☆16May 28, 2025Updated 10 months ago
- Particle Syntax Website☆16Sep 16, 2024Updated last year
- I read and summarized an academic paper every day for a year.☆11Dec 27, 2020Updated 5 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- I added selfplay functionality to openai gyms☆10Jan 16, 2021Updated 5 years ago
- Prompts used in the Automated Auditing Blog Post☆146Jul 24, 2025Updated 8 months ago
- Efficiently inverting a probabilistic graphics program of face generation with an inference network. Includes computational models and ne…☆17Feb 17, 2026Updated last month
- Interpreting Learned Search and Planning: Reverse-engineering recurrent convolutional networks (DRC) that play Sokoban☆17Jun 29, 2025Updated 8 months ago
- ☆27Oct 6, 2024Updated last year
- ☆13Jun 30, 2020Updated 5 years ago
- never forget anything again! combine AI and intelligent tooling for a local knowledge base to track catalogue, annotate, and plan for you…☆37May 14, 2024Updated last year
- Runtime library and schema compiler for the Avro serialization format☆21Dec 13, 2021Updated 4 years ago
- Geometry of the moduli space of a closed linkage☆12Feb 7, 2026Updated last month
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- A powerful keybind library and daemon for Linux.☆11Jul 24, 2022Updated 3 years ago
- Delphi was the home of a temple to Phoebus Apollo, which famously had the inscription, 'Know Thyself.' This library lets language models …☆246Updated this week
- Brutaltester compatible referee for coders strike back☆12Nov 27, 2018Updated 7 years ago
- ☆33Mar 27, 2025Updated last year
- Scala Native 3 bindings for SFML library☆15Jul 9, 2023Updated 2 years ago
- ☆18May 14, 2022Updated 3 years ago
- A library for mechanistic interpretability of GPT-style language models☆3,223Updated this week