The only way to do that is use a bunch of tasks.
Task 1: Play Audio for the English part, Set Language to Spanish, then jump to
Task 2: Play Audio for Spanish part, Set Language to Mandarin, then jump to
Task 3: Play Audio for Mandarin, . . .
At the end you probably want to switch back to English, then jump to a task with a Collect Input action.
The cons of doing this (other than being complicated) is that if the caller presses a digit while the flow is still going through the list of actions, the Collect Input action will not hear it b/c it hasn't started listening yet.
Honestly using prompts is a much better solution.
------------------------------
Melissa Bailey
Genesys - Employees
------------------------------