US 12,315,501 B2
	Systems and methods for phonetic-based natural language understanding
Ajay Kumar Mishra, Karnataka (IN); and Jeffry Copps Robert Jose, Tamil Nadu (IN)
Assigned to ADEIA GUIDES INC., San Jose, CA (US)
Filed by Adeia Guides Inc., San Jose, CA (US)
Filed on Jan. 26, 2024, as Appl. No. 18/423,556.
Application 18/423,556 is a continuation of application No. 17/363,651, filed on Jun. 30, 2021, granted, now 11,922,931.
Prior Publication US 2024/0249718 A1, Jul. 25, 2024
Int. Cl. G06F 17/00 (2019.01); G06F 16/632 (2019.01); G06F 16/68 (2019.01); G06F 16/683 (2019.01); G06N 5/02 (2023.01); G10L 15/18 (2013.01); G10L 15/187 (2013.01); G10L 15/02 (2006.01)

CPC G10L 15/187 (2013.01) [G06F 16/632 (2019.01); G06F 16/683 (2019.01); G06F 16/686 (2019.01); G06N 5/02 (2013.01); G10L 15/1822 (2013.01); G10L 2015/025 (2013.01)]

20 Claims

1. A method comprising:

maintaining, by a media delivery service, a database of media items available for delivery, via a network, to a plurality of devices subscribed to the media delivery service;

determining that a media item will become available for delivery at a first time;

accessing metadata of the media item, the metadata comprising a text term;

generating a first phonetic representation of the text term pronounced as a word;

generating a second phonetic representation of the text term by concatenating a phonetic representation of each letter in the text term;

tracking a plurality of voice queries, received by the media delivery service, from the plurality of devices subscribed to the media delivery service, wherein the tracking is performed over a predefined period of time prior to the first time when the media item will become available for delivery to determine:

a first number of a first subset of the plurality of voice queries that matched the first phonetic representation over the predefined period of time; and

a second number of a second subset of the plurality of voice queries that matched the second phonetic representation over the predefined period of time;

after the predefined period of time, based at least in part on comparing the first number to the second number, selecting one of the first phonetic representation or the second phonetic representation;

modifying the database of media items to associate the media item with the selected phonetic representation; and

outputting an identifier of the media item based on a subsequent voice query of a device of the plurality of devices matching the selected phonetic representation.