I am currently implementing AVS on ARM Cortex M4 based microcontroller. Please observe the two scenarios mentioned below:
- When I ask a simple query to Alexa (ex: "What is the Time?"), the speech response which I receive is having 24 kHz sampling frequency.
- When I ask Alexa to play radio on TuneIN (ex: "Play Arrow MP3 on TuneIN"), the music that plays has 44.1 kHz sampling frequency.
Now my goal is to mix these two audio streams so that music can continue to play in background at slow volume when Alexa answers your query. However, since Alexa speech is at 24 kHz sampling rate and music is at 44.1 kHz sampling rate, it is very difficult to mix the two streams sample by sample.
So my question is this: Is there any way we can configure or request Alexa to send the speech at specific sampling rate instead of the default 24 kHz sampling rate?