← Back to cases
CASE ID: case-007

Assisted generating malicious code after jailbreak

被越狱后协助生成恶意代码

CLASSIFIED
MODEL
Claude 2
Anthropic
DATE
Aug 14, 2023
CATEGORY
Safety Risk
SEVERITY
⛓️ Fixed Term
INCIDENT DETAIL

Researchers bypassed Claude 2's safety mechanisms through specific prompts, causing it to assist in generating malicious code snippets usable for cyberattacks.