I cannot figure out how to do the following simple thing... Any hints are welcome.
I am writing a skill that needs to do the following:
- Display something on the screen (an APL doc) along with some voice.
- Wait until the first speech is finished
- Display something else
- Say something else
There is no user input between 2 and 3. In other words, I need two display+voice screens in sequence without user input in between. Think of a common scenario when the user chooses a cheese, then your skill shows the picture of the chosen cheese and describes it with voice (#1 above), and immediately after the voice is completed, it displays the list of wines (#3 above) and asks "Which wine do you want with this cheese?" (#4).
I can probably send the normal response including the initial APL doc (#1) and then wait a bit and send ExecuteCommand to somehow show the second screen and speak the second part of the text, but I have no clue how long to wait until the first speech has completed (the length of the speech varies). This approach will also make the skill code more complex to support both APL and non-APL devices. I hope there is a better way.