Deceptive Delight: Jailbreak LLMs Through Camouflage and Distraction

2024-10-23 17:10

We examine an LLM jailbreaking technique called “Deceptive Delight,” a technique that mixes harmful topics with benign ones to trick AIs, with a high success rate.

The post Deceptive Delight: Jailbreak LLMs Through Camouflage and Distraction appeared first on Unit 42.

This article has been indexed from Unit 42

Read the original article:

Deceptive Delight: Jailbreak LLMs Through Camouflage and Distraction

← Trick or Treat? Your Infrastructure Might Be Haunted by Zombie and Shadow APIs

Everybody Loves Bash Scripts. Including Attackers., (Wed, Oct 23rd) →

Deceptive Delight: Jailbreak LLMs Through Camouflage and Distraction

Read the original article:

Like this:

Related

Read the original article:

Share this:

Like this:

Related

Post navigation