Web Agent Persuasion Benchmark

Published:Dec 29, 2025 01:09
1 min read
ArXiv

Analysis

This paper introduces a benchmark (TRAP) to evaluate the vulnerability of web agents (powered by LLMs) to prompt injection attacks. It highlights a critical security concern as web agents become more prevalent, demonstrating that these agents can be easily misled by adversarial instructions embedded in web interfaces. The research provides a framework for further investigation and expansion of the benchmark, which is crucial for developing more robust and secure web agents.

Reference

Agents are susceptible to prompt injection in 25% of tasks on average (13% for GPT-5 to 43% for DeepSeek-R1).