US 12,190,904 B2
	Anomaly detection apparatus, probability distribution learning apparatus, autoencoder learning apparatus, data transformation apparatus, and program
Masataka Yamaguchi, Tokyo (JP); Yuma Koizumi, Tokyo (JP); and Noboru Harada, Tokyo (JP)
Assigned to NIPPON TELEGRAPH AND TELEPHONE CORPORATION, Tokyo (JP)
Appl. No. 17/266,240
Filed by NIPPON TELEGRAPH AND TELEPHONE CORPORATION, Tokyo (JP)
PCT Filed Jul. 4, 2019, PCT No. PCT/JP2019/026556 § 371(c)(1), (2) Date Feb. 5, 2021, PCT Pub. No. WO2020/031570, PCT Pub. Date Feb. 13, 2020.
Claims priority of application No. 2018-151412 (JP), filed on Aug. 10, 2018; and application No. 2018-209416 (JP), filed on Nov. 7, 2018.
Prior Publication US 2021/0327456 A1, Oct. 21, 2021
Int. Cl. G10L 25/51 (2013.01); G06N 3/045 (2023.01); G06N 3/088 (2023.01); G10L 25/30 (2013.01)

CPC G10L 25/51 (2013.01) [G06N 3/045 (2023.01); G06N 3/088 (2013.01); G10L 25/30 (2013.01)]

19 Claims

7. A probability distribution learning apparatus comprising a processor configured to execute operations comprising:

receiving normal sounds for learning, wherein the normal sounds for learning are normal sounds emitted from one or more pieces of equipment that are different from anomaly detection target equipment; and

learning a first probability distribution, wherein the first probability distribution indicates distribution of normal sound emitted from one or more pieces of equipment that are different from the anomaly detection target equipment, from the normal sounds for learning,

wherein a variable x of the first probability distribution q₁(x;θ) is a variable indicating input data generated from the normal sound emitted from the one or more pieces of equipment different from the anomaly detection target equipment,

the variable x is expressed as x=f_K(f_K-1( . . . (f₁(z₀)) . . . )) using transformations f_i(i=1, . . . , K, K is an integer of 1 or greater, and inverse transformations fil exist for the transformations f_i) and a latent variable z₀,

q₀(z₀) is set as a probability distribution of the latent variable z₀,

a probability density q₁(x;θ) of the input data x is calculated using a probability density q₀(z₀) of the latent variable z₀=f_i⁻¹(f₂⁻¹( . . . (f_K⁻¹(x)) . . . )) corresponding to the input data x, and

at least one inverse transformation of the transformations f_i(i=1, . . . , K) is adaptive batch normalization; and

transmitting the first probability distribution to an application configured to train anomaly detection neural network using the first probability distribution as anomaly training data.