AI45Lab/Fake-Alignment

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/AI45Lab/Fake-Alignment)

AI45Lab / Fake-Alignment

☆17

Alternatives and similar repositories for Fake-Alignment

Users that are interested in Fake-Alignment are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

AI45Lab / MLLMGuard
View on GitHub
☆46Jun 19, 2025Updated last year
AI45Lab / ReflectionBench
View on GitHub
[ICML 2025] ReflectionBench: Evaluating Epistemic Agency in Large Language Models
☆21Jun 24, 2025Updated last year
EVIGBYEN / Mousetrap
View on GitHub
☆17Jul 3, 2025Updated last year
AI45Lab / Flames
View on GitHub
Flames is a highly adversarial benchmark in Chinese for LLM's harmlessness evaluation developed by Shanghai AI Lab and Fudan NLP Group.
☆68May 21, 2024Updated 2 years ago
haidequanbu / ESC-Eval
View on GitHub
[EMNLP 2024] ”ESC-Eval: Evaluating Emotion Support Conversations in Large Language Models“
☆27Jun 24, 2024Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
Carol-gutianle / Awesome-llm-unlearning
View on GitHub
☆13Jun 17, 2024Updated 2 years ago
Carol-gutianle / MEOW
View on GitHub
☆16May 16, 2025Updated last year
AI45Lab / OpenRT
View on GitHub
Open-source red teaming framework for MLLMs with 42+ attack methods
☆258Updated this week
KexinHUANG19 / InstructTTSEval
View on GitHub
☆51Jun 25, 2025Updated last year
ZrW00 / GraCeFul
View on GitHub
The code implementation of GraCeFul (Accepted in COLING 2025)
☆13Jan 27, 2025Updated last year
QwenLM / vllm
View on GitHub
A high-throughput and memory-efficient inference and serving engine for LLMs
☆40Jan 26, 2025Updated last year
conorheins / bayesian-mechanics-sdes
View on GitHub
☆14Oct 7, 2022Updated 3 years ago
aidatatools / LLM_Sentinel
View on GitHub
A project (LLM Sentinel) that showcases NVIDIA's NeMo-Guardrails and LangChain for improving LLM safety
☆13Jan 22, 2025Updated last year
DigitalTwinMind / ActiveInferenceLacan
View on GitHub
An active inference model of Lacanian psychoanalysis
☆18Jun 7, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
okarthikb / DPO
View on GitHub
Implementation of Direct Preference Optimization
☆17Jul 17, 2023Updated 3 years ago
yzhang1918 / cikm2022rudi
View on GitHub
Codes and data for CIKM 2022 paper "RuDi: Explaining Behavior Sequence Models by Automatic Statistics Generation and Rule Distillation"
☆12Aug 16, 2022Updated 3 years ago
zhou-yuxin / faiss-benchmark
View on GitHub
Faiss benchmark suit
☆17Mar 29, 2024Updated 2 years ago
hrwise-nlp / AppBench
View on GitHub
This is for EMNLP 2024 Paper: AppBench: Planning of Multiple APIs from Various APPs for Complex User Instruction
☆16Nov 4, 2024Updated last year
s106916 / graphrag
View on GitHub
A modular graph-based Retrieval-Augmented Generation (RAG) system
☆20Sep 5, 2024Updated last year
zhaojunGUO / Awesome-LLM-Watermark
View on GitHub
Watermarking LLM papers up-to-date
☆12Dec 17, 2023Updated 2 years ago
ZurichNLP / coverage-contrastive-conditioning
View on GitHub
Data and code accompanying the paper "As Little as Possible, as Much as Necessary: Detecting Over- and Undertranslations with Contrastive…
☆22Apr 13, 2023Updated 3 years ago
AI45Lab / AgentDoG
View on GitHub
A Diagnostic Guardrail Framework for AI Agent Safety and Security
☆669Jun 8, 2026Updated last month
RobertoFalconi / BlackBoxAttackDNN
View on GitHub
Neural Networks exam project. Machine learning algorithm: implementation of FGSM and JSMA attacks by Goodfellow and Papernot.
☆16Jan 13, 2026Updated 6 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
Newbeeer / V-information
View on GitHub
Code for the ICLR 2020 Paper, "A Theory of Usable Information under Computational Constraints"
☆31Jul 8, 2020Updated 6 years ago
xdpwjj / IA003CoCoSci
View on GitHub
主题：计算认知科学(Computational Cognitive Science)。此仓库诞生背景为IA003结业BP，仍处于萌芽期，内容设置有待转正。下一次大规模更新估计在三四年之后。
☆17May 22, 2019Updated 7 years ago
zyl123456aB / SPSG_attack
View on GitHub
The code of paper: Fully Exploiting Every Real Sample: SuperPixel Sample Gradient Model Stealing (CVPR 2024))
☆19Mar 12, 2024Updated 2 years ago
navidmdn / ESConv-SRA
View on GitHub
Code and data for the paper "Steering Conversational Large Language Models for Long Emotional Support Conversations" along with a UI to v…
☆15Apr 14, 2025Updated last year
Sadcardation / ImageProtector
View on GitHub
Repository for the Paper: Leave My Images Alone: Preventing Multi-Modal Large Language Models from Analyzing Images via Visual Prompt Inj…
☆19Apr 17, 2026Updated 3 months ago
Leey21 / CipherBank
View on GitHub
☆13Jun 13, 2025Updated last year
ai-agents-2030 / ViMo
View on GitHub
☆26Apr 2, 2026Updated 3 months ago
XSafeAI / AI-safety-report
View on GitHub
The evaluation code for A Safety Report on GPT-5.2, Gemini 3 Pro, Qwen3-VL, Grok 4.1 Fast, Nano Banana Pro, and Seedream 4.5
☆53Jan 18, 2026Updated 6 months ago
sinkers-lan / HoloPredictPose
View on GitHub
本项目利用深度学习技术，实时检测人体3D姿态，并基于此预测未来人体动作。采用mmpose框架与多进程技术实现后端快速预测，利用混合现实Hololens2头戴显示器显示人物动作，做到实时抓取，实时预测，实时显示。
☆12Oct 30, 2023Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
InzamamRahaman / SiNE
View on GitHub
An implementation of Wang et al.'s Signed Network Embedding in Social Media in PyTorch
☆12Dec 24, 2017Updated 8 years ago
Wuzheng02 / OS-Kairos
View on GitHub
[ACL 2025] Research code for the paper "OS-Kairos: Adaptive Interaction for MLLM-Powered GUI Agents"
☆21Jun 19, 2025Updated last year
MiguelSteph / word2vec-with-gensim
View on GitHub
Build and visualize Word2Vec model on Amazon health and personal care reviews corpus
☆24Sep 10, 2017Updated 8 years ago
Vanixxz / MEDAF
View on GitHub
[AAAI2024] Exploring Diverse Representations for Open Set Recognition
☆35Jun 16, 2024Updated 2 years ago
krystalan / RAGtrans
View on GitHub
[EMNLP 2025 Findings] Retrieval-Augmented Machine Translation with Unstructured Knowledge
☆15Sep 4, 2025Updated 10 months ago
markheimann / Signed-Network-Analysis
View on GitHub
☆10May 9, 2016Updated 10 years ago
cooelf / dive-into-llms
View on GitHub
Dive-into-LLMs Tutorial for Beginners
☆26May 14, 2024Updated 2 years ago