Code for "Utility Engineering: Analyzing and Controlling Emergent Value Systems in AIs"
☆89Jun 3, 2026Updated 2 weeks ago
Alternatives and similar repositories for emergent-values
Users that are interested in emergent-values are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆54May 9, 2025Updated last year
- ☆19Jun 21, 2025Updated 11 months ago
- ☆24May 30, 2024Updated 2 years ago
- Implementation of the paper "Large Language Models as Simulated Economic Agents: What Can We Learn from Homo Silicus?"☆24May 12, 2024Updated 2 years ago
- ☆27Oct 6, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 🤖 Implementation of Self Normalizing Networks (SNN) in PyTorch.☆13Jun 19, 2017Updated 9 years ago
- ☆10Jul 14, 2020Updated 5 years ago
- [EMNLP 2024] "Revisiting Who's Harry Potter: Towards Targeted Unlearning from a Causal Intervention Perspective"☆34Jul 22, 2024Updated last year
- An optimized prime sieve in Julia☆14Dec 10, 2024Updated last year
- CS341 for Spring 2024☆11Jul 15, 2024Updated last year
- A Julia package aims to provide several extensible interfaces and reusable components for Reinforcement Learning.☆13Feb 8, 2020Updated 6 years ago
- ☆24Nov 7, 2024Updated last year
- Optimized primitives for collective multi-GPU communication☆11May 8, 2024Updated 2 years ago
- ☆15Mar 3, 2025Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Marketplace ML experiment - training without backprop☆28Sep 9, 2025Updated 9 months ago
- MishformerLens intends to be a drop-in replacement for TransformerLens that AST patches HuggingFace Transformers rather than implementing…☆10Oct 7, 2024Updated last year
- ☆77May 31, 2023Updated 3 years ago
- Open source version of Anthropic's Clio: A system for privacy-preserving insights into real-world AI use☆76Aug 19, 2025Updated 10 months ago
- LLM experiments done during SERI MATS - focusing on activation steering / interpreting activation spaces☆103Sep 21, 2023Updated 2 years ago
- Better profiling reports for Julia☆14Feb 8, 2020Updated 6 years ago
- ☆14Aug 9, 2023Updated 2 years ago
- 中科大2022春《深度学习导论》课程资源☆10Aug 7, 2022Updated 3 years ago
- CodebaseMD: A VS Code extension that converts codebases into structured Markdown documentation, optimized for LLMs and agentic coding too…☆15May 22, 2025Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- A modern tool for data exploration☆16Feb 8, 2020Updated 6 years ago
- ☆21May 14, 2026Updated last month
- ☆84Mar 11, 2025Updated last year
- Collection of papers, tools, datasets for fairness of LLM☆20Oct 7, 2024Updated last year
- [ICLR 2025] On Evluating the Durability of Safegurads for Open-Weight LLMs☆13Jun 20, 2025Updated 11 months ago
- [ICLR 2025] Official Repository for "Tamper-Resistant Safeguards for Open-Weight LLMs"☆67Jun 9, 2025Updated last year
- mini cli search engine for your docs, knowledge bases, meeting notes, whatever. Tracking current sota approaches while being all local☆29Mar 9, 2026Updated 3 months ago
- Code to reproduce our paper on probabilistic algorithmic recourse: https://arxiv.org/abs/2006.06831☆37Dec 27, 2022Updated 3 years ago
- ☆15May 21, 2022Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This repository contains the functions for calculating movement smoothness using different metrics.☆19Jan 10, 2019Updated 7 years ago
- Subliminal learning in LLMs: language models can transmit hidden preferences through seemingly unrelated training data.☆24Nov 9, 2025Updated 7 months ago
- Code for the paper "Data Feedback Loops: Model-driven Amplification of Dataset Biases"☆18Sep 9, 2022Updated 3 years ago
- A collection of useful Claude Code skills☆41Updated this week
- Project of ACL 2025 "UAlign: Leveraging Uncertainty Estimations for Factuality Alignment on Large Language Models"☆14Mar 25, 2025Updated last year
- [ICLR 2025] FLAT: LLM Unlearning via Loss Adjustment with Only Forget Data☆14Feb 26, 2025Updated last year
- Experiments with representation engineering☆14Feb 28, 2024Updated 2 years ago