- initialize speak device.
- beginspeak, if this step is necessary, why?
- speak / endspeak, what is the request that I have to make to send the data? because in the documentation I only find this [generic byte data - stream type].
Currently, I have the audio that I record in WAV format, what conversion do you recommend to send to the server? I already saw that they mention [generic byte data - stream type], but I would like to have more detailed information to be able to encode it correctly.