Situational Awareness Dataset
☆46Dec 14, 2024Updated last year
Alternatives and similar repositories for sad
Users that are interested in sad are comparing it to the libraries listed below
Sorting:
- ☆20Nov 15, 2024Updated last year
- A TinyStories LM with SAEs and transcoders☆14Apr 3, 2025Updated 10 months ago
- (Model-written) LLM evals library☆18Dec 13, 2024Updated last year
- ☆36Apr 30, 2024Updated last year
- ☆25Sep 5, 2024Updated last year
- A quick way to get started with Transformer Lens☆14Dec 13, 2023Updated 2 years ago
- A framework for evaluating function calls made by LLMs☆40Jul 23, 2024Updated last year
- A library for efficient patching and automatic circuit discovery.☆90Dec 31, 2025Updated 2 months ago
- ☆119Jan 19, 2026Updated last month
- ☆17Updated this week
- METR Task Standard☆177Feb 3, 2025Updated last year
- ☆22Sep 2, 2025Updated 5 months ago
- Approximating the joint distribution of language models via MCTS☆22Nov 3, 2024Updated last year
- ☆27Oct 6, 2024Updated last year
- ☆28Jun 13, 2019Updated 6 years ago
- This repository contains data, code and models for contextual noncompliance.☆25Jul 18, 2024Updated last year
- ☆28May 4, 2023Updated 2 years ago
- ☆89Dec 18, 2025Updated 2 months ago
- Auditing agents for fine-tuning safety☆20Oct 21, 2025Updated 4 months ago
- B-Spline Density Estimation Library - nonparametric density estimation using B-Spline density estimator from univariate sample.☆16Aug 22, 2021Updated 4 years ago
- Attribution-based Parameter Decomposition☆34Jun 11, 2025Updated 8 months ago
- Delphi was the home of a temple to Phoebus Apollo, which famously had the inscription, 'Know Thyself.' This library lets language models …☆243Feb 23, 2026Updated last week
- Measuring the situational awareness of language models☆40Feb 12, 2024Updated 2 years ago
- ☆35Sep 13, 2023Updated 2 years ago
- A simple GPT-based evaluation tool for multi-aspect, interpretable assessment of LLMs.☆90Jan 29, 2024Updated 2 years ago
- Project exploring 3D volumetric rendering of NEXRAD radar data.☆11Oct 23, 2023Updated 2 years ago
- Automation tool for testing C* OSS that assembles cassandra-diff, nosqlbench, fqltool☆11Mar 20, 2023Updated 2 years ago
- ☆10Nov 7, 2022Updated 3 years ago
- PatientSim: A Persona-Driven Simulator for Realistic Doctor-Patient Interactions (NeurIPS 2025 D&B track, Spotlight)☆23Feb 11, 2026Updated 2 weeks ago
- A nonparametric variational information bottleneck (NVIB) layer in Pytorch☆11Apr 15, 2025Updated 10 months ago
- Bayesian adaptive stimulus placement of psychometric function for MATLAB.☆10Nov 7, 2018Updated 7 years ago
- Eliza Agent Weaver enables you to develop a set of Character files based on your own lore, and connects the narratives of multiple agents…☆10Dec 12, 2024Updated last year
- An open-source platform for building and deploying real-time, low-latency AI voice agents for call automation for marketing.☆18Oct 16, 2025Updated 4 months ago
- A Python library for building modular, reproducible simulation pipelines in minutes☆32Aug 22, 2025Updated 6 months ago
- Fast, free, easy, and object-agnostic video anonymization☆11Dec 12, 2020Updated 5 years ago
- BanterBot: An OpenAI ChatGPT-powered chatbot with Azure Neural Voices. Supports multilingual speech-to-text and text-to-speech interactio…☆11Jan 23, 2026Updated last month
- automatic music transcription application written in java☆12Jan 13, 2013Updated 13 years ago
- Repo for our work "Systematic Evaluation of Large Vision-Language Models for Surgical Artificial Intelligence"☆19Jun 2, 2025Updated 8 months ago
- A simple repository showcasing a few LLM Evaluation strategies and leverages W&B Sweeps to optimize the LLM system.☆12Jul 11, 2023Updated 2 years ago