Yes, we had better results when separating the responsibilities across the sections instead of putting everything into the Role field.
The Role worked better when it only defined identity/purpose, not operational logic.
This section seemed to work better for behavioral expectations and conversational style.
We also had better consistency when the guardrails included explicit escalation triggers instead of only prohibitions.
made the runtime behavior much more predictable in our tests.
Original Message:
Sent: 05-21-2026 11:33
From: Kevin Goodwin
Subject: Agentic Virtual Agents - out of scope items
Thanks for the suggestion around being more specific. I will give that a shot.
Also, in your example above, did you put all that information in the Role of the AVA, or did you break out the pieces into the Role, Guidelines and Guardrails(rules) sections?
------------------------------
Kevin Goodwin
Senior Consultant
------------------------------
Original Message:
Sent: 05-20-2026 21:52
From: Gabriel Garcia
Subject: Agentic Virtual Agents - out of scope items
This is something we also noticed during AVA testing.
One important thing with AVA is that the model is naturally optimized to be "helpful," so broad negative instructions like:
"do not troubleshoot"
"do not suggest fixes"
often are not strong enough by themselves because the assistant still tries to move the customer toward resolution conversationally.
What worked better for us was shifting from:
generic prohibitions
to:
very explicit behavioral boundaries + mandatory escalation behavior.
For example, instead of:
"Do not troubleshoot technical issues."
we had much better results with wording closer to:
"If the customer describes a technical issue, you must NOT provide diagnostic steps, recommendations, workaround suggestions, configuration guidance, assumptions, or possible causes. Your ONLY allowed behavior is:
acknowledge the issue briefly
collect the required routing information
escalate to a live agent."
The key difference is:
you are not only forbidding behavior,
you are defining the exact replacement behavior.
Another thing that helped significantly was avoiding ambiguous verbs such as:
help
assist
guide
recommend
try
suggest
because the model interprets those very broadly.
We also started separating guidance into:
Role → what the AVA IS
Guidelines → operational rules
Forbidden behaviors → explicit hard constraints
For example:
Role:
"You are an intake and routing assistant."
Guideline:
"You collect information and route requests."
Forbidden behavior:
"You must never troubleshoot, diagnose, recommend fixes, speculate on causes, or provide procedural instructions."
One important observation:
the more tools/knowledge access the AVA has, the more likely it is to attempt problem solving autonomously.
So reducing unnecessary tool access can improve containment of behavior considerably.
We also found that examples are extremely powerful with AVA behavior shaping.
For example:
BAD:
Customer: "My VPN is failing."
AVA: "Try restarting your VPN client."
GOOD:
Customer: "My VPN is failing."
AVA: "I'm unable to troubleshoot VPN issues directly. I'll connect you with a support specialist."
In practice, these behavioral examples often influence the runtime behavior more reliably than long policy paragraphs.
Another recommendation:
avoid conflicting instructions.
For example, combinations like:
"Be proactive and helpful"
together with:
"Do not troubleshoot"
can create unstable behavior because the model tries to satisfy both objectives simultaneously.
In production, the AVAs that behaved most predictably for us were the ones with:
very narrow role definitions
explicit forbidden actions
clear escalation conditions
few overlapping objectives
constrained tool access
example-driven behavioral guidance
AVA behaves much more consistently when it understands:
"what it should do instead"
rather than only:
"what it must not do."
------------------------------
Gabriel Garcia
NA