Code for "Utility Engineering: Analyzing and Controlling Emergent Value Systems in AIs"
☆90Feb 27, 2025Updated last year
Alternatives and similar repositories for emergent-values
Users that are interested in emergent-values are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This project contains the original white paper for Language Construct Modeling (LCM) v1.13, authored by Vincent Shing Hin Chong. It intro…☆15Jul 23, 2025Updated 9 months ago
- ☆19Jun 21, 2025Updated 10 months ago
- ☆12Jul 21, 2024Updated last year
- Implementation of the paper "Large Language Models as Simulated Economic Agents: What Can We Learn from Homo Silicus?"☆23May 12, 2024Updated last year
- A collection of different ways to implement accessing and modifying internal model activations for LLMs☆24Oct 18, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆27Oct 6, 2024Updated last year
- 🤖 Implementation of Self Normalizing Networks (SNN) in PyTorch.☆13Jun 19, 2017Updated 8 years ago
- A curated list of awesome resources, libraries, frameworks, and tools for multi-agent systems (MAS) research and development.☆32Feb 17, 2025Updated last year
- [EMNLP 2024] "Revisiting Who's Harry Potter: Towards Targeted Unlearning from a Causal Intervention Perspective"☆33Jul 22, 2024Updated last year
- Utility for creating advisory pidfiles (lock files)☆12May 31, 2023Updated 2 years ago
- ☆19Mar 25, 2025Updated last year
- MirrorDataGenerator is a python tool that generates synthetic data based on user-specified causal relations among features in the data. I…☆24Jun 22, 2022Updated 3 years ago
- A Julia package aims to provide several extensible interfaces and reusable components for Reinforcement Learning.☆13Feb 8, 2020Updated 6 years ago
- ☆15Mar 3, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Marketplace ML experiment - training without backprop☆27Sep 9, 2025Updated 8 months ago
- Minimax Estimation of Conditional Moment Models☆32Jun 12, 2023Updated 2 years ago
- ☆77May 31, 2023Updated 2 years ago
- Code for experiments on self-prediction as a way to measure introspection in LLMs☆16Dec 10, 2024Updated last year
- Open source version of Anthropic's Clio: A system for privacy-preserving insights into real-world AI use☆72Aug 19, 2025Updated 8 months ago
- LLM experiments done during SERI MATS - focusing on activation steering / interpreting activation spaces☆103Sep 21, 2023Updated 2 years ago
- Better profiling reports for Julia☆14Feb 8, 2020Updated 6 years ago
- ☆14Aug 9, 2023Updated 2 years ago
- [NeurIPS XAIA & Springer] Code and notebooks to paper "A Fresh Look at Sanity Checks for Saliency Maps"☆25Jul 12, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- CodebaseMD: A VS Code extension that converts codebases into structured Markdown documentation, optimized for LLMs and agentic coding too…☆15May 22, 2025Updated 11 months ago
- This is the code for our paper: PLACES: Prompting Language Models for Social Conversation Synthesis☆11Feb 17, 2023Updated 3 years ago
- Subliminal learning in LLMs: language models can transmit hidden preferences through seemingly unrelated training data.☆22Nov 9, 2025Updated 6 months ago
- A modern tool for data exploration☆16Feb 8, 2020Updated 6 years ago
- ☆20May 3, 2025Updated last year
- Collection of papers, tools, datasets for fairness of LLM☆19Oct 7, 2024Updated last year
- ☆12Nov 15, 2022Updated 3 years ago
- [NeurIPS 2024] Large Language Model Unlearning via Embedding-Corrupted Prompts☆40Sep 26, 2024Updated last year
- [ICLR 2025] On Evluating the Durability of Safegurads for Open-Weight LLMs☆13Jun 20, 2025Updated 10 months ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- ☆12Apr 24, 2024Updated 2 years ago
- mini cli search engine for your docs, knowledge bases, meeting notes, whatever. Tracking current sota approaches while being all local☆28Mar 9, 2026Updated 2 months ago
- ☆15May 21, 2022Updated 3 years ago
- Download and Transcribe X Spaces☆11Nov 16, 2024Updated last year
- Official repository for the SAI Simulator, a new tool to explore the effects of stratospheric aerosol injection on the climate.☆18Mar 19, 2026Updated last month
- ☆10Mar 26, 2024Updated 2 years ago
- Code for the paper "Data Feedback Loops: Model-driven Amplification of Dataset Biases"☆18Sep 9, 2022Updated 3 years ago