☆17Mar 22, 2024Updated 2 years ago
Alternatives and similar repositories for Fake-Alignment
Users that are interested in Fake-Alignment are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆46Jun 19, 2025Updated last year
- Flames is a highly adversarial benchmark in Chinese for LLM's harmlessness evaluation developed by Shanghai AI Lab and Fudan NLP Group.☆65May 21, 2024Updated 2 years ago
- [EMNLP 2024] ”ESC-Eval: Evaluating Emotion Support Conversations in Large Language Models“☆27Jun 24, 2024Updated 2 years ago
- ☆16May 16, 2025Updated last year
- ☆13Jun 17, 2024Updated 2 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- ☆23Apr 2, 2026Updated 2 months ago
- Open-source red teaming framework for MLLMs with 42+ attack methods☆257Mar 25, 2026Updated 3 months ago
- ☆48Jun 25, 2025Updated last year
- An active inference model of Lacanian psychoanalysis☆18Jun 7, 2025Updated last year
- The code implementation of GraCeFul (Accepted in COLING 2025)☆13Jan 27, 2025Updated last year
- Dive-into-LLMs Tutorial for Beginners☆26May 14, 2024Updated 2 years ago
- ☆14Oct 7, 2022Updated 3 years ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆42Jan 26, 2025Updated last year
- A project (LLM Sentinel) that showcases NVIDIA's NeMo-Guardrails and LangChain for improving LLM safety☆13Jan 22, 2025Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Implementation of Direct Preference Optimization☆17Jul 17, 2023Updated 2 years ago
- ☆13Jun 13, 2025Updated last year
- Skill-Inject: Measuring Agent Vulnerability to Skill File Attacks☆79May 7, 2026Updated last month
- Codes and data for CIKM 2022 paper "RuDi: Explaining Behavior Sequence Models by Automatic Statistics Generation and Rule Distillation"☆12Aug 16, 2022Updated 3 years ago
- Faiss benchmark suit☆17Mar 29, 2024Updated 2 years ago
- Data and code accompanying the paper "As Little as Possible, as Much as Necessary: Detecting Over- and Undertranslations with Contrastive…☆22Apr 13, 2023Updated 3 years ago
- This is for EMNLP 2024 Paper: AppBench: Planning of Multiple APIs from Various APPs for Complex User Instruction☆16Nov 4, 2024Updated last year
- Qualifying Exam Preparing☆17May 7, 2025Updated last year
- Code for Words That Make Language Models Perceive☆43Oct 14, 2025Updated 8 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Code for the ICLR 2020 Paper, "A Theory of Usable Information under Computational Constraints"☆31Jul 8, 2020Updated 5 years ago
- The code of paper: Fully Exploiting Every Real Sample: SuperPixel Sample Gradient Model Stealing (CVPR 2024))☆19Mar 12, 2024Updated 2 years ago
- Watermarking LLM papers up-to-date☆12Dec 17, 2023Updated 2 years ago
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆20Sep 5, 2024Updated last year
- 主题:计算认知科学(Computational Cognitive Science)。此仓库诞生背景为IA003结业BP,仍处于萌芽期,内容设置有待转正。下一次大规模更新估计在三四年之后。☆17May 22, 2019Updated 7 years ago
- Neural Networks exam project. Machine learning algorithm: implementation of FGSM and JSMA attacks by Goodfellow and Papernot.☆16Jan 13, 2026Updated 5 months ago
- ☆15Feb 26, 2025Updated last year
- [AAAI2024] Exploring Diverse Representations for Open Set Recognition☆34Jun 16, 2024Updated 2 years ago
- Fleming-R1: Toward Expert-Level Medical Reasoning via Reinforcement Learning☆31Sep 29, 2025Updated 9 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The Ever-Evolving Science Exam☆34Jan 18, 2026Updated 5 months ago
- ☆11Jun 11, 2025Updated last year
- Build and visualize Word2Vec model on Amazon health and personal care reviews corpus☆24Sep 10, 2017Updated 8 years ago
- The evaluation code for A Safety Report on GPT-5.2, Gemini 3 Pro, Qwen3-VL, Grok 4.1 Fast, Nano Banana Pro, and Seedream 4.5☆53Jan 18, 2026Updated 5 months ago
- Repository for the Paper: Leave My Images Alone: Preventing Multi-Modal Large Language Models from Analyzing Images via Visual Prompt Inj…☆19Apr 17, 2026Updated 2 months ago
- An implementation of Wang et al.'s Signed Network Embedding in Social Media in PyTorch☆12Dec 24, 2017Updated 8 years ago
- Code and data for the paper "Steering Conversational Large Language Models for Long Emotional Support Conversations" along with a UI to v…☆15Apr 14, 2025Updated last year