☆111Feb 10, 2026Updated 3 weeks ago
Alternatives and similar repositories for subliminal-learning
Users that are interested in subliminal-learning are comparing it to the libraries listed below
Sorting:
- A collection of different ways to implement accessing and modifying internal model activations for LLMs☆20Oct 18, 2024Updated last year
- a network tunneling proxy written in go☆34Jan 1, 2026Updated 2 months ago
- data from 2,961 in-person dates☆14Aug 31, 2023Updated 2 years ago
- Code for our paper "Localizing Lying in Llama"☆13Apr 24, 2025Updated 10 months ago
- A toolkit that provides a range of model diffing techniques including a UI to visualize them interactively.☆63Feb 26, 2026Updated last week
- ☆35May 9, 2025Updated 9 months ago
- ☆37Feb 11, 2025Updated last year
- Applying Reinforcement Learning from Human Feedback to language models to teach them to write short story responses to writing prompts.☆14May 5, 2022Updated 3 years ago
- ☆17Dec 21, 2023Updated 2 years ago
- ☆48Sep 29, 2024Updated last year
- PyTorch and NNsight implementation of AtP* (Kramar et al 2024, DeepMind)☆20Jan 19, 2025Updated last year
- ☆20Jan 21, 2023Updated 3 years ago
- ☆37Jul 4, 2025Updated 8 months ago
- Modified to support crosscoder training.☆25Feb 4, 2026Updated last month
- ☆27Oct 6, 2024Updated last year
- Open source replication of Anthropic's Crosscoders for Model Diffing☆64Oct 27, 2024Updated last year
- Investigating the generalization behavior of LM probes trained to predict truth labels: (1) from one annotator to another, and (2) from e…☆28May 23, 2024Updated last year
- ☆399Aug 21, 2025Updated 6 months ago
- A power user focused interface for LLM base models.☆53Feb 17, 2026Updated 2 weeks ago
- Inference API for many LLMs and other useful tools for empirical research☆107Feb 27, 2026Updated last week
- Create and deploy virtual-experiments - co-processing computational workflows☆10Jan 28, 2026Updated last month
- ☆142Aug 20, 2025Updated 6 months ago
- Delphi was the home of a temple to Phoebus Apollo, which famously had the inscription, 'Know Thyself.' This library lets language models …☆244Updated this week
- never forget anything again! combine AI and intelligent tooling for a local knowledge base to track catalogue, annotate, and plan for you…☆37May 14, 2024Updated last year
- Prompts used in the Automated Auditing Blog Post☆139Jul 24, 2025Updated 7 months ago
- PARADIS, a lightweight and flexible weather forecast model that tries to Keep It Simple.☆26Feb 4, 2026Updated last month
- mReasoner is a unified computational implementation of the model theory of thinking and reasoning☆13Aug 17, 2023Updated 2 years ago
- The System Stacks for Linux* OS are a collection of production ready docker images for Deep Learning, Media and Storage optimized for 2nd…☆34Jan 10, 2023Updated 3 years ago
- ☆14Dec 5, 2025Updated 3 months ago
- ext_mpi_collectives☆11Apr 1, 2025Updated 11 months ago
- The public web API of the National Museum of Australia☆11Sep 12, 2023Updated 2 years ago
- Open source version of Anthropic's Clio: A system for privacy-preserving insights into real-world AI use☆61Aug 19, 2025Updated 6 months ago
- Understanding how features learned by neural networks evolve throughout training☆41Oct 24, 2024Updated last year
- Machine Learning for Alignment Bootcamp☆82Apr 27, 2022Updated 3 years ago
- ☆52Oct 23, 2023Updated 2 years ago
- ☆36Apr 30, 2024Updated last year
- Code and data for the Walert large language model-based chatbot☆12Aug 14, 2025Updated 6 months ago
- Wikimedia Enterprise - client SDK in Python☆20Nov 11, 2025Updated 3 months ago
- Library on Arduino to add over the air (OTA) Update Capabilities to bw16/rtl8720DN☆11Aug 6, 2024Updated last year