☆15Mar 22, 2024Updated last year
Alternatives and similar repositories for Fake-Alignment
Users that are interested in Fake-Alignment are comparing it to the libraries listed below
Sorting:
- ☆45Jun 19, 2025Updated 9 months ago
- Flames is a highly adversarial benchmark in Chinese for LLM's harmlessness evaluation developed by Shanghai AI Lab and Fudan NLP Group.☆63May 21, 2024Updated last year
- [EMNLP 2024] ”ESC-Eval: Evaluating Emotion Support Conversations in Large Language Models“☆26Jun 24, 2024Updated last year
- ☆16May 16, 2025Updated 10 months ago
- ☆13Jun 17, 2024Updated last year
- ☆39Jun 25, 2025Updated 8 months ago
- Open-source red teaming framework for MLLMs with 42+ attack methods☆233Updated this week
- Skill-Inject: Measuring Agent Vulnerability to Skill File Attacks☆34Feb 24, 2026Updated 3 weeks ago
- An active inference model of Lacanian psychoanalysis☆16Jun 7, 2025Updated 9 months ago
- Dive-into-LLMs Tutorial for Beginners☆12May 14, 2024Updated last year
- The code implementation of GraCeFul (Accepted in COLING 2025)☆13Jan 27, 2025Updated last year
- ☆14Oct 7, 2022Updated 3 years ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆38Jan 26, 2025Updated last year
- A project (LLM Sentinel) that showcases NVIDIA's NeMo-Guardrails and LangChain for improving LLM safety☆12Jan 22, 2025Updated last year
- Implementation of Direct Preference Optimization☆17Jul 17, 2023Updated 2 years ago
- Codes and data for CIKM 2022 paper "RuDi: Explaining Behavior Sequence Models by Automatic Statistics Generation and Rule Distillation"☆12Aug 16, 2022Updated 3 years ago
- Code for Words That Make Language Models Perceive☆37Oct 14, 2025Updated 5 months ago
- Faiss benchmark suit☆17Mar 29, 2024Updated last year
- Data and code accompanying the paper "As Little as Possible, as Much as Necessary: Detecting Over- and Undertranslations with Contrastive…☆22Apr 13, 2023Updated 2 years ago
- This is for EMNLP 2024 Paper: AppBench: Planning of Multiple APIs from Various APPs for Complex User Instruction☆15Nov 4, 2024Updated last year
- Qualifying Exam Preparing☆17May 7, 2025Updated 10 months ago
- Code for the ICLR 2020 Paper, "A Theory of Usable Information under Computational Constraints"☆30Jul 8, 2020Updated 5 years ago
- The code of paper: Fully Exploiting Every Real Sample: SuperPixel Sample Gradient Model Stealing (CVPR 2024))☆19Mar 12, 2024Updated 2 years ago
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆20Sep 5, 2024Updated last year
- Watermarking LLM papers up-to-date☆11Dec 17, 2023Updated 2 years ago
- 主题:计算认知科学(Computational Cognitive Science)。此仓库诞生背景为IA003结业BP,仍处于萌芽期,内容设置有待转正。下一次大规模更新估计在三四年之后。☆17May 22, 2019Updated 6 years ago
- Neural Networks exam project. Machine learning algorithm: implementation of FGSM and JSMA attacks by Goodfellow and Papernot.☆16Jan 13, 2026Updated 2 months ago
- ☆14Feb 26, 2025Updated last year
- [AAAI2024] Exploring Diverse Representations for Open Set Recognition☆33Jun 16, 2024Updated last year
- A user-friendly and efficient knowledge distillation framework for LLMs.☆53Updated this week
- Fleming-R1: Toward Expert-Level Medical Reasoning via Reinforcement Learning☆30Sep 29, 2025Updated 5 months ago
- ☆12Jun 11, 2025Updated 9 months ago
- Build and visualize Word2Vec model on Amazon health and personal care reviews corpus☆24Sep 10, 2017Updated 8 years ago
- The evaluation code for A Safety Report on GPT-5.2, Gemini 3 Pro, Qwen3-VL, Grok 4.1 Fast, Nano Banana Pro, and Seedream 4.5☆53Jan 18, 2026Updated 2 months ago
- Repository for the Paper: Refusing Safe Prompts for Multi-modal Large Language Models☆18Oct 16, 2024Updated last year
- An implementation of Wang et al.'s Signed Network Embedding in Social Media in PyTorch☆12Dec 24, 2017Updated 8 years ago
- Code and data for the paper "Steering Conversational Large Language Models for Long Emotional Support Conversations" along with a UI to v…☆15Apr 14, 2025Updated 11 months ago
- 本项目利用深度学习技术,实时检测人体3D姿态,并基于此预测未来人体动作。采用mmpose框架与多进程技术实现后端快速预测,利用混合现实Hololens2头戴显示器显示人物动作,做到实时抓取,实时预测,实时显示。☆12Oct 30, 2023Updated 2 years ago
- ☆10May 9, 2016Updated 9 years ago