US 12,002,452 B2
	Background audio identification for speech disambiguation
Jason Sanders, New York, NY (US); Gabriel Taubman, Brooklyn, NY (US); and John J. Lee, Long Island City, NY (US)
Assigned to Google LLC, Mountain View, CA (US)
Filed by Google LLC, Mountain View, CA (US)
Filed on Dec. 21, 2022, as Appl. No. 18/069,663.
Application 18/069,663 is a continuation of application No. 17/101,946, filed on Nov. 23, 2020, granted, now 11,557,280.
Application 17/101,946 is a continuation of application No. 16/249,211, filed on Jan. 16, 2019, granted, now 10,872,600, issued on Dec. 22, 2020.
Application 16/249,211 is a continuation of application No. 15/622,341, filed on Jun. 14, 2017, granted, now 10,224,024, issued on Mar. 5, 2019.
Application 15/622,341 is a continuation of application No. 14/825,648, filed on Aug. 13, 2015, granted, now 9,812,123, issued on Nov. 7, 2017.
Application 14/825,648 is a continuation of application No. 13/804,986, filed on Mar. 14, 2013, granted, now 9,123,388, issued on Sep. 1, 2015.
Application 16/249,211 is a continuation of application No. 14/825,648, filed on Aug. 13, 2015, granted, now 9,812,123, issued on Nov. 7, 2017.
Claims priority of provisional application 61/778,570, filed on Mar. 13, 2013.
Claims priority of provisional application 61/654,407, filed on Jun. 1, 2012.
Claims priority of provisional application 61/654,518, filed on Jun. 1, 2012.
Claims priority of provisional application 61/654,387, filed on Jun. 1, 2012.
Prior Publication US 2023/0125170 A1, Apr. 27, 2023
This patent is subject to a terminal disclaimer.
Int. Cl. G10L 15/22 (2006.01); G06F 16/683 (2019.01); G10L 15/08 (2006.01); G10L 15/18 (2013.01); G10L 15/26 (2006.01); G10L 21/0272 (2013.01); G10L 25/48 (2013.01); H04M 3/493 (2006.01); G10L 21/0208 (2013.01)

CPC G10L 15/08 (2013.01) [G06F 16/685 (2019.01); G10L 15/1815 (2013.01); G10L 15/22 (2013.01); G10L 15/26 (2013.01); G10L 21/0272 (2013.01); G10L 25/48 (2013.01); H04M 3/4936 (2013.01); G10L 2015/225 (2013.01); G10L 21/0208 (2013.01); H04M 2201/40 (2013.01); H04M 2203/352 (2013.01)]

20 Claims

1. A computer-implemented method executed on data processing hardware that causes the data processing hardware to perform operations comprising:

receiving first audio data and second audio data captured by a computing device associated with a user;

processing the first audio data to identify an entity associated with the first audio data;

retrieving a set of terms related to the identified entity;

influencing, using the retrieved set of terms related to the identified entity, a speech recognition language model; and

generating, using the influenced speech recognition language model, a transcription of the second audio data.