Computer-Use Agent Sandboxing Checklist
Computer-use agents can inspect screenshots and control a desktop with mouse and keyboard actions. That makes them useful for legacy interfaces and visual workflows, but it also means untrusted UI text can influence the agent.
Minimum sandbox
Run the agent in a dedicated virtual machine or container. Do not expose the user's real browser profile, password manager, local filesystem, SSH keys, payment sessions, or production admin consoles unless the workflow explicitly requires them and has controls.
Network controls
Use domain allowlists for narrow workflows. Block unknown downloads, local network access, and unexpected redirects where possible. For broad browsing tasks, add stricter approval and logging.
Data controls
Avoid sensitive accounts and secrets in the sandbox. If the agent must authenticate, use scoped test accounts or least-privilege service accounts with revocable credentials.
Action controls
Require human confirmation before purchases, account changes, legal acceptance, external messages, production writes, data deletion, or any task that affects another person.
Prompt-injection controls
Assume webpages and screenshots can contain adversarial instructions. Keep the user's goal visible, label untrusted content, restrict tools, and test pages that explicitly try to override the agent's instructions.
Sources
- Anthropic computer use documentation: https://platform.claude.com/docs/en/agents-and-tools/tool-use/computer-use-tool