G

Guardrails

The boundaries you set on AI agents to control what they can discuss, what actions they can take, and when they must escalate to a human.

What it is

Guardrails are the safety constraints configured on Agentforce agents. They include topic restrictions (what the agent can and cannot discuss), action permissions (what operations it can perform), escalation rules (when to hand off to a human), and output filters (what it cannot say). Guardrails are enforced by the Atlas Reasoning Engine before any action is taken.

Why it matters

Without guardrails, AI agents are unpredictable. With them, agents operate within defined boundaries — which is what enterprise buyers, compliance teams, and regulators require. Guardrails are not limitations; they are what make AI deployable in production.

Key components

  • Topic restrictions
  • Action permissions
  • Escalation rules
  • Output filters
  • PII masking

How it connects

You configure guardrails in Agent Builder as part of topic and action definitions. The Trust Layer adds additional guardrails at the platform level.

Good to know

Guardrails should be tested with adversarial prompts — try to make the agent break its own rules. If you can break it in testing, customers will break it in production.

Need Help Implementing This?

We specialize in putting AI and Agentforce to work for Salesforce customers. Let's talk about your use case.

Book Intro Call