US 11,990,121 B2
Distilling to a target device based on observed query patterns
Matthew Sharifi, Kilchberg (CH); and Victor Carbune, Zürich (CH)
Assigned to Google LLC, Mountain View, CA (US)
Filed by Google LLC, Mountain View, CA (US)
Filed on Dec. 15, 2021, as Appl. No. 17/644,427.
Claims priority of provisional application 63/262,465, filed on Oct. 13, 2021.
Prior Publication US 2023/0111618 A1, Apr. 13, 2023
Int. Cl. G10L 15/065 (2013.01); G10L 15/01 (2013.01); G10L 15/06 (2013.01); G10L 15/18 (2013.01); G10L 15/26 (2006.01); G10L 15/30 (2013.01)
CPC G10L 15/065 (2013.01) [G10L 15/01 (2013.01); G10L 15/063 (2013.01); G10L 15/18 (2013.01); G10L 15/26 (2013.01); G10L 15/30 (2013.01)] 32 Claims
OG exemplary drawing
 
17. A system comprising:
data processing hardware; and
memory hardware in communication with the data processing hardware and storing instructions that when executed on the data processing hardware cause the data processing hardware to perform the operations comprising:
receiving, from a plurality of client devices each associated with a respective user, user queries directed toward a cloud-based assistant service executing on the data processing hardware;
for each received user query directed toward the cloud-based assistant service:
extracting one or more attributes from the user query; and
logging the user query into one or more of a plurality of category buckets based on the one or more attributes extracted from the user query;
determining when at least one of the plurality of category buckets includes a threshold number of the user queries logged into the at least one category bucket; and
when the at least one of the plurality of category buckets includes the threshold number of the user queries, generating a distilled model of the cloud-based assistant service, the distilled model of the cloud-based assistant service configured to execute on one or more target client devices of the plurality of client devices.