| CPC G10L 15/187 (2013.01) [G06F 16/632 (2019.01); G06F 16/683 (2019.01); G06F 16/686 (2019.01); G06N 5/02 (2013.01); G10L 15/1822 (2013.01); G10L 2015/025 (2013.01)] | 20 Claims |

|
1. A method comprising:
maintaining, by a media delivery service, a database of media items available for delivery, via a network, to a plurality of devices subscribed to the media delivery service;
determining that a media item will become available for delivery at a first time;
accessing metadata of the media item, the metadata comprising a text term;
generating a first phonetic representation of the text term pronounced as a word;
generating a second phonetic representation of the text term by concatenating a phonetic representation of each letter in the text term;
tracking a plurality of voice queries, received by the media delivery service, from the plurality of devices subscribed to the media delivery service, wherein the tracking is performed over a predefined period of time prior to the first time when the media item will become available for delivery to determine:
a first number of a first subset of the plurality of voice queries that matched the first phonetic representation over the predefined period of time; and
a second number of a second subset of the plurality of voice queries that matched the second phonetic representation over the predefined period of time;
after the predefined period of time, based at least in part on comparing the first number to the second number, selecting one of the first phonetic representation or the second phonetic representation;
modifying the database of media items to associate the media item with the selected phonetic representation; and
outputting an identifier of the media item based on a subsequent voice query of a device of the plurality of devices matching the selected phonetic representation.
|