‘Skeleton Key’ attack unlocks the worst of AI, says Microsoft

2024-06-28 08:06

Simple jailbreak prompt can bypass safety guardrails on major models

Microsoft on Thursday published details about Skeleton Key – a technique that bypasses the guardrails used by makers of AI models to prevent their generative chatbots from creating harmful content.…

This article has been indexed from The Register – Security

Read the original article:

‘Skeleton Key’ attack unlocks the worst of AI, says Microsoft

← TeamViewer Detects Security Breach in Corporate IT Environment

US announces a $10M reward for Russia’s GRU hacker behind attacks on Ukraine →

Simple jailbreak prompt can bypass safety guardrails on major models

Read the original article:

Related

Post navigation