Guardrails

Guardrails define safety boundaries for AI operations in SignalSmith, ensuring the AI agent operates within approved limits.

Overview

Guardrails are configurable policies that control what the AI agent can do with and without human approval. They prevent accidental data loss, unauthorized activation, and other high-impact operations.

Policy Types

Require Approval

Operations matching this policy require human approval before execution. The agent will pause and wait for an admin to approve or deny the action.

Example: Require approval before deleting any audience or triggering a sync to a production destination.

Deny

Operations matching this policy are blocked entirely. The agent cannot perform them even with approval.

Example: Deny deletion of sources or destinations via the agent.

Rate Limit

Limits how frequently the agent can perform certain operations within a time window.

Example: Limit the agent to triggering at most 5 syncs per hour.

Configuring Policies

Navigate to AI Policies in the sidebar
Click Create Policy
Select the policy type (require_approval, deny, rate_limit)
Select the resource type (audiences, syncs, destinations, etc.)
Select the operation (create, update, delete, trigger)
Click Save

Default Policies

By default, SignalSmith includes conservative guardrails:

Operation	Default Policy
Delete any resource	Require approval
Trigger sync to production	Require approval
Modify destination credentials	Deny
Create/update audiences	Allowed
Read/list operations	Allowed

Approval Workflow

When an operation requires approval:

The agent pauses and displays the pending action
A notification appears for workspace admins
An admin reviews the action details
The admin clicks Approve or Deny
If approved, the agent continues execution
If denied, the agent acknowledges and suggests alternatives

Audit Log

All AI operations are logged in the AI Audit Log, accessible at AI Audit Log in the sidebar. Each entry includes:

The agent session and message
The tool called and parameters
Whether approval was required
The approval decision (if applicable)
The result of the operation

MCP Server Overview