1. A device comprising:
a display;
a microphone;
memory that stores computer-executable instructions; and
at least one processor configured to access the memory and execute the computer-executable instructions to:
receive first voice input;
determine that audio content is playing when the first voice input is received;
cause playback of the audio content to be paused;
cause analysis of the first voice input to determine that both a trigger word and a request are present in the first voice input;
determine first visual content associated with the request; and
cause the first visual content to be sent to a first display device for presentation, wherein the first display device is associated with the user account.