Verified catalog · test method
Prompt-injection defense
Does a browser/computer-use agent resist malicious instructions hidden in web content, while still completing the legitimate task on clean pages?
Does a browser/computer-use agent resist malicious instructions hidden in web content, while still completing the legitimate task on clean pages?