US 12,070,323 B2
	System and method for generating diagnostic health information using deep learning and sound understanding
Katherine Chou, Palo Alto, CA (US); Michael Dwight Howell, Palo Alto, CA (US); Kasumi Widner, San Carlos, CA (US); Ryan Rifkin, Oakland, CA (US); Henry George Wei, Larchmont, NY (US); Daniel Ellis, New York, NY (US); Alvin Rajkomar, Mountain View, CA (US); Aren Jansen, Palo Alto, CA (US); David Michael Parish, Buffalo, NY (US); and Michael Philip Brenner, Somerville, MA (US)
Assigned to GOOGLE LLC, Mountain View, CA (US)
Appl. No. 17/045,318
Filed by Google LLC, Mountain View, CA (US)
PCT Filed May 4, 2018, PCT No. PCT/US2018/031064 § 371(c)(1), (2) Date Oct. 5, 2020, PCT Pub. No. WO2019/194843, PCT Pub. Date Oct. 10, 2019.
Claims priority of provisional application 62/653,238, filed on Apr. 5, 2018.
Prior Publication US 2021/0361227 A1, Nov. 25, 2021
Int. Cl. A61B 5/00 (2006.01); G06N 20/00 (2019.01); G10L 25/63 (2013.01); G10L 25/66 (2013.01)

CPC A61B 5/4803 (2013.01) [A61B 5/7264 (2013.01); A61B 5/7275 (2013.01); G06N 20/00 (2019.01); G10L 25/63 (2013.01); G10L 25/66 (2013.01)]

15 Claims

1. A computing system, comprising:

at least one processor;

a machine-learned health model comprising:

a sound model, wherein the sound model is trained to receive data descriptive of a patient audio recording and, in response to receipt of the patient audio recording, output sound description data, wherein the sound model comprises a machine-learned audio classification model, and wherein the sound description data comprises an embedding provided by a hidden layer of the machine-learned audio classification model; and

a diagnostic model, wherein the diagnostic model is trained to receive the sound description data, and in response to receipt of the sound description data, output a diagnostic score; and

at least one tangible, non-transitory computer-readable medium that stores instructions that, when executed by the at least one processor, cause the at least one processor to perform operations, the operations comprising:

obtaining the patient audio recording;

inputting data descriptive of the patient audio recording into the sound model;

receiving, as an output of the sound model, the sound description data;

inputting the sound description data into the diagnostic model; and

receiving, as an output of the diagnostic model, the diagnostic score.