一句话摘要
Prompt shields 新增 Spotlighting 功能,通过标记文档信任级别防御间接注入攻击。
详细描述
Spotlighting enhances protection against indirect prompt injection attacks by tagging input documents with special formatting to indicate lower trust to the model.
原文摘录
Spotlighting is a sub-feature of prompt shields that enhances protection against indirect (embedded document) attacks by tagging input documents with special formatting to indicate lower trust to the model.