Guardrails
Guardrails define safety boundaries for AI operations in SignalSmith, ensuring the AI agent operates within approved limits.
Overview
Guardrails are configurable policies that control what the AI agent can do with and without human approval. They prevent accidental data loss, unauthorized activation, and other high-impact operations.
Policy Types
Require Approval
Operations matching this policy require human approval before execution. The agent will pause and wait for an admin to approve or deny the action.
Example: Require approval before deleting any audience or triggering a sync to a production destination.
Deny
Operations matching this policy are blocked entirely. The agent cannot perform them even with approval.
Example: Deny deletion of sources or destinations via the agent.
Rate Limit
Limits how frequently the agent can perform certain operations within a time window.
Example: Limit the agent to triggering at most 5 syncs per hour.
Configuring Policies
- Navigate to AI Policies in the sidebar
- Click Create Policy
- Select the policy type (require_approval, deny, rate_limit)
- Select the resource type (audiences, syncs, destinations, etc.)
- Select the operation (create, update, delete, trigger)
- Click Save
Default Policies
By default, SignalSmith includes conservative guardrails:
| Operation | Default Policy |
|---|---|
| Delete any resource | Require approval |
| Trigger sync to production | Require approval |
| Modify destination credentials | Deny |
| Create/update audiences | Allowed |
| Read/list operations | Allowed |
Approval Workflow
When an operation requires approval:
- The agent pauses and displays the pending action
- A notification appears for workspace admins
- An admin reviews the action details
- The admin clicks Approve or Deny
- If approved, the agent continues execution
- If denied, the agent acknowledges and suggests alternatives
Audit Log
All AI operations are logged in the AI Audit Log, accessible at AI Audit Log in the sidebar. Each entry includes:
- The agent session and message
- The tool called and parameters
- Whether approval was required
- The approval decision (if applicable)
- The result of the operation