Computer-Use Agent Sandboxing Checklist

Computer-use agents can inspect screenshots and control a desktop with mouse and keyboard actions. That makes them useful for legacy interfaces and visual workflows, but it also means untrusted UI text can influence the agent.

Minimum sandbox

Run the agent in a dedicated virtual machine or container. Do not expose the user's real browser profile, password manager, local filesystem, SSH keys, payment sessions, or production admin consoles unless the workflow explicitly requires them and has controls.

Network controls

Use domain allowlists for narrow workflows. Block unknown downloads, local network access, and unexpected redirects where possible. For broad browsing tasks, add stricter approval and logging.

Data controls

Avoid sensitive accounts and secrets in the sandbox. If the agent must authenticate, use scoped test accounts or least-privilege service accounts with revocable credentials.

Action controls

Require human confirmation before purchases, account changes, legal acceptance, external messages, production writes, data deletion, or any task that affects another person.

Prompt-injection controls

Assume webpages and screenshots can contain adversarial instructions. Keep the user's goal visible, label untrusted content, restrict tools, and test pages that explicitly try to override the agent's instructions.

Sources

Anthropic computer use documentation: https://platform.claude.com/docs/en/agents-and-tools/tool-use/computer-use-tool

Computer-Use Agent Sandboxing Checklist

Key insights

Use cases

Limitations & trade-offs

Computer-Use Agent Sandboxing Checklist

Minimum sandbox

Network controls

Data controls

Action controls

Prompt-injection controls

Sources