quantified-uncertainty / ai-safety-papersView external linksLinks
β22Sep 9, 2021Updated 4 years ago
Alternatives and similar repositories for ai-safety-papers
Users that are interested in ai-safety-papers are comparing it to the libraries listed below
Sorting:
- gpt completions in vscodeβ35Mar 24, 2023Updated 2 years ago
- πππππππππ Reading everythingβ15Sep 12, 2025Updated 5 months ago
- The Happy Faces Benchmarkβ15Jul 20, 2023Updated 2 years ago
- A formalisation of Cartesian Frames, a perspective on embedded agency, in the HOL theorem prover.β20Dec 20, 2021Updated 4 years ago
- An analog touch screen joystick that pretends to be a bevy gamepadβ13Jul 13, 2024Updated last year
- Customizable charts made with TikZ and LaTeX3β14Feb 11, 2023Updated 3 years ago
- β11Mar 13, 2023Updated 2 years ago
- A varitation graph toolβ10Dec 23, 2019Updated 6 years ago
- βοΈ A gallery of experiments with Scalable Vector Graphics (SVG) and interactive visualizations.β13Jan 6, 2023Updated 3 years ago
- The AI that helps you achieve your goalsβ11Feb 4, 2024Updated 2 years ago
- Experiments in applying interpretability techniques to learned reward functions.β10Dec 11, 2020Updated 5 years ago
- Accompanying codebase for neuroscope.io, a website for displaying max activating dataset examples for language model neuronsβ13Feb 13, 2023Updated 3 years ago
- Pin files for contextual, codebase-level AI assistance.β16Jul 11, 2024Updated last year
- Simple (fast) transformer inference in PyTorch with torch.compile + lit-llama codeβ10Aug 29, 2023Updated 2 years ago
- A dataset of alignment research and code to reproduce itβ78Jun 22, 2023Updated 2 years ago
- β12Oct 23, 2022Updated 3 years ago
- Benchmarking LLM Inference Speedsβ13Feb 4, 2026Updated last week
- AI Safety Q&A web frontendβ41Feb 9, 2026Updated last week
- Transcribe with ease :Dβ16Jun 21, 2023Updated 2 years ago
- Implementation of a multi-agent system for the modeling of carpooling in a city with one-way streets. Used Python and the Mesa package foβ¦β14Jan 19, 2022Updated 4 years ago
- Experimental LLM interface exploring new ways to use AI to improve human thinkingβ20Updated this week
- β66Feb 16, 2023Updated 2 years ago
- Automatically create Anki cards from text using language modelsβ20Jan 7, 2023Updated 3 years ago
- PyTorch Language Modeling Toolkit for Fast Weight Programmersβ19Jun 11, 2025Updated 8 months ago
- Implementation of Influence Function approximations for differently sized ML models, using PyTorchβ16Sep 15, 2023Updated 2 years ago
- A visionOS project that demonstrates how to scale a volume to account for Window Zoom changesβ18Apr 3, 2024Updated last year
- β17Updated this week
- Contains random samples referenced in the paper "Sleeper Agents: Training Robustly Deceptive LLMs that Persist Through Safety Training".β127Mar 9, 2024Updated last year
- A command line utility for doing polarization simulationsβ17Aug 21, 2019Updated 6 years ago
- Paul Graham's onlisp book in org mode formatβ21May 11, 2024Updated last year
- This is a public repository for collecting excellent visualizations of knowledge and/or data.β17Jan 6, 2021Updated 5 years ago
- Tephigram plotting in Pythonβ22Feb 6, 2026Updated last week
- A gym environment for Stuart Armstrong's model of a treacherous turn.β18Jul 28, 2018Updated 7 years ago
- Bindings to Nvidia Labs's κ»LIP image comparison and error visualization libraryβ22Nov 24, 2025Updated 2 months ago
- IonSolver is a magnetohydrodynamic simulation software featuring an extended Lattice Boltzmann method and GPU accelerationβ22Nov 10, 2025Updated 3 months ago
- β27Jul 29, 2025Updated 6 months ago
- Unified notation for Markov Decision Processes PO(MDP)sβ24Apr 27, 2018Updated 7 years ago
- Princeton University - COS/ECE 470 : Principles of Blockchainsβ20Dec 7, 2022Updated 3 years ago
- Submissions for AI and Efficiency SOTA'sβ56Jun 1, 2020Updated 5 years ago