Formalizing and Benchmarking Prompt Injection Attacks and Defenses

Metadane

Autorzy: Yupei Liu, Yuqi Jia, Runpeng Geng, Jinyuan Jia, Neil Zhenqiang Gong
Rok: 2024
Zrodlo: 33rd USENIX Security Symposium (USENIX Security 24), pages 1831-1847
Status: #to-read
Kategoria: Security

Notatki

Wyekstrahowane z: hasan-llm-phishing-detection-2025

Praca formalizuje i benchmarkuje ataki prompt injection i mechanizmy obrony. Pokazuje ze obecne LLM fundamentalnie nie rozrozniaja miedzy legalnymi instrukcjami a zlosliwym inputem. Kluczowa dla zrozumienia granic bezpieczenstwa LLM-based systemow detekcji phishingu.

Research

Przeglądaj

Formalizing and Benchmarking Prompt Injection Attacks and Defenses

Formalizing and Benchmarking Prompt Injection Attacks and Defenses

Metadane

Notatki