Meirtz/BabyBLUE-llm

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Meirtz/BabyBLUE-llm)

Meirtz / BabyBLUE-llm

[COLING 2025] Official repo of paper: "Not Aligned" is Not "Malicious": Being Careful about Hallucinations of Large Language Models' Jailbreak

☆12

Alternatives and similar repositories for BabyBLUE-llm

Users that are interested in BabyBLUE-llm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Meirtz / FocusOnSlang-Toolbox
View on GitHub
[EMNLP 2024 Main] Official repository of paper "SLANG: New Concept Comprehension of Large Language Models"
☆14Oct 27, 2024Updated last year
byronBBL / Context-DPO
View on GitHub
Official repository of paper "Context-DPO: Aligning Language Models for Context-Faithfulness"
☆23Feb 17, 2025Updated last year
NLie2 / what_features_jailbreak_LLMs
View on GitHub
☆18Mar 30, 2025Updated last year
xirui-li / DrAttack
View on GitHub
Official implementation of paper: DrAttack: Prompt Decomposition and Reconstruction Makes Powerful LLM Jailbreakers
☆68Aug 25, 2024Updated last year
wangyu-ustc / LargeScaleWashing
View on GitHub
The official implementation of the paper "Large Scale Knowledge Washing"
☆10Jun 12, 2024Updated 2 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
Jinxiaolong1129 / Foot-in-the-door-Jailbreak
View on GitHub
☆23May 14, 2025Updated last year
MJy1023 / MyArxivPodcast
View on GitHub
🎙️ 一个全自动的学术论文播客生成系统，支持从arXiv网站爬取最新科技资讯，使用LLM生成结构化对话脚本，并通过语音合成技术输出专业的播客音频。集新闻采集、内容生成、语音合成于一体的AI播客工具。
☆25Nov 1, 2024Updated last year
YancyKahn / CoA
View on GitHub
Chain of Attack: a Semantic-Driven Contextual Multi-Turn attacker for LLM
☆39Jan 17, 2025Updated last year
nayeon7lee / factuality_enhanced_lm_hf
View on GitHub
☆13Nov 11, 2022Updated 3 years ago
smartyfh / CMF-CTF
View on GitHub
Outlier-Resilient Web Service QoS Prediction
☆10Feb 7, 2021Updated 5 years ago
TianyuFan0504 / awesome-spatio-temporal-graph
View on GitHub
This repository contains a list of papers on spatio-temporal graph, especially about GNNs on S-T graph.
☆18Sep 8, 2023Updated 2 years ago
botextractai / ai-langchain-react-agent
View on GitHub
Create a LangChain ReAct agent with multiple tools (Python REPL and DuckDuckGo Search)
☆14Updated this week
yiksiu-chan / SpeakEasy
View on GitHub
[ICML 2025] Speak Easy: Eliciting Harmful Jailbreaks from LLMs with Simple Interactions
☆15Mar 7, 2026Updated 4 months ago
kensho-technologies / pathpiece
View on GitHub
PathPiece tokenizer
☆14Nov 10, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
TuringEyeTest / TuringEyeTest
View on GitHub
Pixels, Patterns, but no Poetry: To See the World like Humans
☆18Aug 11, 2025Updated 11 months ago
takomc / amp
View on GitHub
【NeurIPS 2024】The official code of paper "Automated Multi-level Preference for MLLMs"
☆22Sep 26, 2024Updated last year
kevinyaobytedance / llm_eval
View on GitHub
LLM evaluation.
☆16Nov 7, 2023Updated 2 years ago
wprojectsn / codes
View on GitHub
Concept-Pointer-Network-for-Abstractive-Summarization
☆19May 17, 2019Updated 7 years ago
zihao-ai / unthinking_vulnerability
View on GitHub
To Think or Not to Think: Exploring the Unthinking Vulnerability in Large Reasoning Models
☆33May 21, 2025Updated last year
wangyu-ovo / MML
View on GitHub
Code for the paper "Jailbreak Large Vision-Language Models Through Multi-Modal Linkage"
☆35Dec 6, 2024Updated last year
wangywUST / DeepEdit
View on GitHub
Repository for our paper "DeepEdit: Knowledge Editing as Decoding with Constraints". https://arxiv.org/abs/2401.10471
☆21Jun 19, 2024Updated 2 years ago
ydzhang-stormstout / LGCN
View on GitHub
Source code for WWW 2021 paper "Lorentzian Graph Convolutional Networks"
☆14Jun 11, 2021Updated 5 years ago
allenai / super-benchmark
View on GitHub
☆54Apr 4, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
sfeucht / footprints
View on GitHub
https://footprints.baulab.info
☆17Oct 4, 2024Updated last year
yuki-younai / Jailbreak-R1
View on GitHub
offical implementation of Jailbreak-R1
☆15Jul 16, 2025Updated last year
ucas-xiang / QIG
View on GitHub
[CVPR 2026] Fine-Grained Post-Training Quantization for Large Vision Language Models with Quantization-Aware Integrated Gradients
☆23Jun 21, 2026Updated last month
AIDefender / TSCAC
View on GitHub
[WWW 2023] The official code for the paper "Two-Stage Constrained Actor-Critic for Short Video Recommendation"
☆15Jul 21, 2023Updated 3 years ago
ByteDance-Seed / DATAMASK
View on GitHub
Joint Selection for Large-Scale Pre-Training Data via Policy Gradient-based Mask Learning
☆21Jan 4, 2026Updated 6 months ago
enai4bio / BridgeDPI
View on GitHub
BridgeDPI: A Novel Graph Neural Network for Predicting Drug-Protein Interactions
☆20Dec 25, 2024Updated last year
MEICRS / GDSRec
View on GitHub
☆16Sep 19, 2023Updated 2 years ago
mlwu22 / RED
View on GitHub
Implementation code for ACL2024：Advancing Parameter Efficiency in Fine-tuning via Representation Editing
☆15Apr 20, 2024Updated 2 years ago
jatinarora2702 / gail-pytorch
View on GitHub
PyTorch implementation of GAIL and PPO reinforcement learning algorithms
☆26May 7, 2021Updated 5 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
997261095 / point-generate
View on GitHub
指针生成网络在中英文数据集下的应用
☆16Mar 10, 2020Updated 6 years ago
Ten-Mao / DiscRec
View on GitHub
The implementation for the work "DiscRec: Disentangled Semantic–Collaborative Modeling for Generative Recommendation".
☆16Jul 13, 2025Updated last year
DLFC / ps-mpi
View on GitHub
A parameter server implement with MPI.
☆11Nov 15, 2017Updated 8 years ago
AI45Lab / ActorAttack
View on GitHub
☆134Jun 29, 2026Updated last month
weiyezhimeng / SQL-Injection-Jailbreak
View on GitHub
☆22Jul 26, 2025Updated last year
paarthneekhara / convolutional-vqa
View on GitHub
☆38Oct 7, 2017Updated 8 years ago
tinybeing-agape / ST-GAT
View on GitHub
ST-GAT: Spatio-Temporal Graph Attention Network for TrafficFlow Prediction
☆20Dec 30, 2022Updated 3 years ago