ai-safety-graph/AISafetyGraph

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ai-safety-graph/AISafetyGraph)

ai-safety-graph / AISafetyGraph

AI Safety Graph

☆18

Alternatives and similar repositories for AISafetyGraph

Users that are interested in AISafetyGraph are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

CentreSecuriteIA / BELLS
View on GitHub
Benchmarks for the Evaluation of LLM Supervision
☆35Jan 19, 2026Updated 6 months ago
forpublicai / publicai.network
View on GitHub
The website of the Public AI Network
☆25Jul 13, 2026Updated last week
collect-intel / llm-judge-bias-suite
View on GitHub
☆27May 20, 2025Updated last year
felixbinder / introspection_self_prediction
View on GitHub
Code for experiments on self-prediction as a way to measure introspection in LLMs
☆16Dec 10, 2024Updated last year
HallerPatrick / pecc
View on GitHub
[LREC-Coling 2024] PECC: Problem Extraction and Coding Challenges
☆14May 30, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
Helloworld10011 / Adversarial-Reasoning
View on GitHub
A new algorithm that formulates jailbreaking as a reasoning problem.
☆26Jul 2, 2025Updated last year
Cadenza-Labs / sleeper-agents
View on GitHub
☆15Jul 12, 2024Updated 2 years ago
DFRobot / DFRobot_AS7341
View on GitHub
We live in a colorful world, but how much do you really know about color? You eyes may deceive you, while the sensors don’t lie. This AS7…
☆12Jan 20, 2022Updated 4 years ago
ShiboYao / LatentSemanticImputation
View on GitHub
A method to combine entity representation defined in different spaces.
☆11Dec 5, 2021Updated 4 years ago
aclu-national / tracking-ll144-bias-audits
View on GitHub
A crowd-sourced public tracker of bias audits of automated employment decision tools (AEDTs) released by employers related to NYC's Local…
☆19Nov 5, 2024Updated last year
lpdkt / arch
View on GitHub
my arch linux dotfiles
☆16Jan 22, 2025Updated last year
apartresearch / readingwhatwecan
View on GitHub
📚📚📚📚📚📚📚📚📚 Reading everything
☆16Mar 11, 2026Updated 4 months ago
mapmeld / hindi-bert
View on GitHub
Hindi NLP work
☆14Apr 4, 2022Updated 4 years ago
zsviczian / obsidian-codeeditor
View on GitHub
Support js and css file editing in Obsidian.
☆26Sep 10, 2021Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
segyges / ideas-in-ai
View on GitHub
Important ideas
☆18Oct 13, 2025Updated 9 months ago
redwoodresearch / Text-Steganography-Benchmark
View on GitHub
Code for Preventing Language Models From Hiding Their Reasoning, which evaluates defenses against LLM steganography.
☆25Jan 26, 2024Updated 2 years ago
ProjectTech4DevAI / kaapi-backend
View on GitHub
Responsible AI for the development sector
☆18Updated this week
OpenTermsArchive / genai-contrib-versions
View on GitHub
Documents versions for most popular generative AI services.
☆16Updated this week
cerai-iitm / AIEvaluationTool
View on GitHub
A comprehensive evaluation tool for verifying conversational AI applications.
☆16Updated this week
Aitslab / corona
View on GitHub
data and tools related to corona virus research
☆11Feb 24, 2025Updated last year
EffiSciencesResearch / ML4G-2.0
View on GitHub
Improved version of the technical workshops for the 10-day ML4G camp on safety of AI systems
☆20May 22, 2026Updated last month
rgreenblatt / control-evaluations
View on GitHub
☆25May 25, 2024Updated 2 years ago
XuchanBao / behavioral-self-awareness
View on GitHub
☆37Feb 20, 2025Updated last year
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
dsakovych / deep-learning-coursera
View on GitHub
☆19Sep 17, 2018Updated 7 years ago
CIRISAI / CIRISAgent
View on GitHub
☆45Updated this week
alignedai / HappyFaces
View on GitHub
The Happy Faces Benchmark
☆15Jul 20, 2023Updated 3 years ago
aschern / semeval2020_task11
View on GitHub
SemEval-2020 Task 11: Detection of Propaganda Techniques in News Articles
☆34Sep 30, 2022Updated 3 years ago
AsaCooperStickland / situational-awareness-evals
View on GitHub
Measuring the situational awareness of language models
☆41Feb 12, 2024Updated 2 years ago
timfduffy / syco-bench
View on GitHub
Benchmark to estimate model sycophancy
☆33Nov 30, 2025Updated 7 months ago
Aatrox103 / SAP
View on GitHub
☆49May 9, 2024Updated 2 years ago
dhimmel / clintrials
View on GitHub
Cataloging pharmacotherapies in clinical trial from ClinicalTrials.gov
☆26Jun 17, 2016Updated 10 years ago
bluedotimpact / bluedot
View on GitHub
✨ Monorepo containing most of BlueDot Impact's custom software.
☆28Updated this week
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
orsee / orsee
View on GitHub
Online Recruitment System for Economic Experiments
☆38Jun 18, 2026Updated last month
leap-laboratories / PIZZA
View on GitHub
An attribution library for LLMs
☆46Sep 17, 2024Updated last year
cthoyt / pystow
View on GitHub
👜 Easily pick a place to store data for your Python code.
☆42Jul 4, 2026Updated 2 weeks ago
DataCTE / Camel-Coder
View on GitHub
Camel-Coder: Collaborative task completion with multiple agents. Role-based prompts, intervention mechanism, and thoughtful suggestions
☆35Jul 3, 2023Updated 3 years ago
erwanlemerrer / awesome-audit-algorithms
View on GitHub
A curated list of algorithms and papers for auditing black-box algorithms.
☆120Jun 17, 2026Updated last month
moirage / alignment-research-dataset
View on GitHub
A dataset of alignment research and code to reproduce it
☆80Jun 22, 2023Updated 3 years ago
METR / hawk
View on GitHub
Run Inspect AI evals in the cloud
☆32Updated this week