Hi @Jose Ruiz,
I fail to see how dictionary management would help in any meaningful way with regard to the stated objective of this thread. That objective is accurately collecting an email address from a caller. At present, I cannot do this in a reliable way with any of the Genesys STT engines or our own integration with Azure STT. I have tried both regular bot flows as well as using a exorbitantly expensive virtual agent flows with a fully-described AI slot, as noted in my original post.
I feel like you either skimmed my post or just blindly replied to my "STT isn't working well" sentiment, without looking at the specific context. This has literally nothing to do with Virtual Supervisor/Supervisor Copilot.
As an aside, dictionary management, at present, is a painfully manual, slow process, so we're not really leveraging it. Feel free to prioritize this idea if you want to help get adoption of it. I opened it in March, and it has 32 upvotes at this point. https://genesyscloud.ideas.aha.io/ideas/DARSTA-I-340
------------------------------
Paul McGurn
Senior Manager, Telecom & DevOps
Persistent Systems
------------------------------
Original Message:
Sent: 05-29-2025 20:21
From: Jose Ruiz
Subject: Voice virtual agent to collect email address - impossible?
Hello @Paul McGurn
Thank you for the thoughtful and detailed feedback - you've highlighted a crucial aspect of delivering high-performing AI-driven features like Virtual Supervisor and Supervisor Copilot: high-quality transcription. We completely agree that accurate speech-to-text output is foundational to ensuring reliable interaction scoring, sentiment analysis, real-time translation, and AI-generated summaries.
To address the specific challenges you've raised, I'd like to share more about the Custom Dictionary feature, which is designed to improve transcription accuracy by tuning the model to recognize key business-specific terms - including organizational names, product terminology, and industry jargon.
With Custom Dictionary, you can:
Add custom phrases (e.g., "Thank you for calling Presbyterian") to help the model understand and prioritize relevant business language.
Improve recognition of difficult or misheard terms by adding "sounds like" alternatives. This is particularly helpful when commonly misrecognized phrases yield similar-sounding but incorrect words, as in your example.
We recommend starting with boost-only entries for important terms and then iteratively enhancing them with "sounds like" variants for any recurring misrecognitions - as long as they don't conflict with valid words in the language.
This feature effectively gives you a way to embed business context directly into the transcription model, improving accuracy over time, especially in areas most important to your organization.
I'd be happy to connect directly to review your current transcription challenges and provide tailored recommendations on Custom Dictionary usage and other tuning options. Together, we can ensure that transcription-dependent features are delivering maximum value and reliability for your team.
Please don't hesitate to reach out!
------------------------------
Jose Ruiz
Genesys - Employees
Product Manager
jose.ruiz@genesys.com
------------------------------