☆12Jul 12, 2024Updated last year
Alternatives and similar repositories for sleeper-agents
Users that are interested in sleeper-agents are comparing it to the libraries listed below
Sorting:
- Decoder only transformer, built from scratch with PyTorch☆33Oct 22, 2023Updated 2 years ago
- Repository with sample code using Apollo's suggested engineering practices☆15Dec 16, 2024Updated last year
- ControlArena is a collection of settings, model organisms and protocols - for running control experiments.☆160Feb 27, 2026Updated last week
- Code for Preventing Language Models From Hiding Their Reasoning, which evaluates defenses against LLM steganography.☆25Jan 26, 2024Updated 2 years ago
- Improving Steering Vectors by Targeting Sparse Autoencoder Features☆27Nov 20, 2024Updated last year
- Investigating the generalization behavior of LM probes trained to predict truth labels: (1) from one annotator to another, and (2) from e…☆28May 23, 2024Updated last year
- ☆33Jun 4, 2025Updated 9 months ago
- Code for the paper "SMACE: A New Method for the Interpretability of Composite Decision Systems", ECML 2022☆15Apr 17, 2023Updated 2 years ago
- Enhanced Explainable Neural Network☆10Dec 25, 2021Updated 4 years ago
- Code for the paper "Distinguishing the Knowable from the Unknowable with Language Models"☆11Apr 15, 2024Updated last year
- ☆36Apr 30, 2024Updated last year
- Official code for Conformal Isometry of Lie Group Representation in Recurrent Network of Grid Cells (NeurIPS workshop on Symmetry and Geo…☆13Nov 1, 2022Updated 3 years ago
- JSSP dataset for LLMs☆16May 29, 2025Updated 9 months ago
- Software package for intertemporal pricing optimization under reference effects and consumer heterogeneity estimation. Please see REAMDE.…☆10Mar 7, 2024Updated 2 years ago
- ☆10Jan 25, 2019Updated 7 years ago
- ☆11Dec 14, 2022Updated 3 years ago
- Temporal summarization framework☆10Dec 4, 2023Updated 2 years ago
- ☆11Jan 13, 2026Updated last month
- ☆12Feb 27, 2023Updated 3 years ago
- Comparing sequential forecasters via confidence sequences & e-processes☆11Oct 24, 2023Updated 2 years ago
- ☆13Feb 4, 2025Updated last year
- ☆10Oct 26, 2022Updated 3 years ago
- ☆10Nov 15, 2023Updated 2 years ago
- A Chrome extension that generates binaural beats.☆23Aug 23, 2023Updated 2 years ago
- ☆10Apr 15, 2022Updated 3 years ago
- SODEN: A Scalable Continuous-Time Survival Model through Ordinary Differential Equation Networks☆14Mar 2, 2023Updated 3 years ago
- Deep Generative Model (Torch)☆11Apr 19, 2016Updated 9 years ago
- Multi-resource Dynamic Coordinated Planning of Flexible Distribution Network☆15Jun 11, 2024Updated last year
- This repository reproduces the results in the paper "How expressive are transformers in spectral domain for graphs?"(published in TMLR)☆12Jul 10, 2022Updated 3 years ago
- solver for discrete Mixed Observable Markov Decision Processes☆11Oct 30, 2020Updated 5 years ago
- ☆10Aug 16, 2023Updated 2 years ago
- ☆133Oct 16, 2025Updated 4 months ago
- Code for the paper: Kernel Distributionally Robust Optimization☆13Feb 21, 2021Updated 5 years ago
- TextMate plugin (Cocoa) shell for running 'ack'☆25Jul 5, 2013Updated 12 years ago
- ☆12Feb 19, 2025Updated last year
- ☆10Apr 26, 2023Updated 2 years ago
- a Hadoop Map Reduce application that retrieves data/articles related to sports from sources like NY Times, Commoncrawl, and Twitter and c…☆13Oct 3, 2019Updated 6 years ago
- ☆12Jan 19, 2024Updated 2 years ago
- Module to parse lines from OCR’d New York City directories into separate fields, such as names, occupations, and addresses.☆10Dec 15, 2017Updated 8 years ago