Mostafa,
you're right that the real challenge is to forward RTP stream to ASR engine. TTS is not so complicated because you could synthesize a text, trasforming it in a .wav and play back it. It is not very efficient but, in many circumstances, it could work. The very big problem is related to ASR because, if you save the speech utterance by the customer in a file to process later by ASR, you could get a definitely bad experience.
The only alternative you could have is to develop a MRCP interface that could invoke ASR (and TTS) services through the interface you have (websocket). It's not an easy task but you can do it, even because there should be some free MRCP code available on internet.
But, at the end, the most important thing: you're talking about
IBM ASR/TTS. As far as I know, they should provide a MRCP interface and, maybe, also a gRPC interface (you need to upgrade MCP to v9 to use that). We should start a trial with them very soon but they told me they already integrated GVP through a MRCP interface. I would suggest to double check this with them, because it could be the easier solution.
Hope this helps you a bit.
------------------------------
Giuliano Ferri
Indra Italia spa
------------------------------
Original Message:
Sent: 07-24-2021 12:10
From: Mostafa Awad
Subject: VoiceBot Development With IBM TTS/ASR via WebSocket APIs
Dears,
Do anybody know the best practice to develop a voicebot that integrates with IBM TTS/ASR knowing that in normal TTS/ASR integrations the GVP is involved to integrate via MRCPv1/MRCPv2 from MCP but here the challenge is that the TTS/ASR engines are providing integration methods via WebSocket which I can't find any clue for this in Genesys docs.
I'm asking if there is a way to do this from Composer/GIA how it could be achieved? If we decided to make a custom application that acts as a middleware between the MCP and IBM TTS/ASR there will be two challenges here which are how could we integrate with MCP to retrieve the RTP customer voice of the current call and send it to IBM ASR and how to send back the TTS RTP/Stream to MCP so it can play it back to the customer.
Appreciate your help in advance.
#GenesysEngageDev
#Implementation
#Integrations
#Self-Service
------------------------------
Mostafa Awad
IST Integration Services And Technologies Co.
------------------------------