Appium is great for automating user touches on screen and analyzing screen displays, but user/app interaction often includes voice commands and messages. I was wondering if it might be possible to automatically test this as well, like you specify a mp3 file in your code and the driver sends it to the device as a voice command, or it detects a sound file was played and returns information about it (name, duration...). I know it may be impossible since Appium relies on official tools from Google and Apple, but i'm just wondering.