US 12,008,464 B2
	Neural network based face detection and landmark localization
Haoxiang Li, San Jose, CA (US); Zhe Lin, Fremont, CA (US); Jonathan Brandt, Santa Cruz, CA (US); and Xiaohui Shen, San Jose, CA (US)
Assigned to ADOBE INC., San Jose, CA (US)
Filed by ADOBE INC., San Jose, CA (US)
Filed on Nov. 16, 2017, as Appl. No. 15/815,635.
Prior Publication US 2019/0147224 A1, May 16, 2019
Int. Cl. G06N 3/08 (2023.01); G06F 3/04812 (2022.01); G06F 18/2413 (2023.01); G06N 3/045 (2023.01); G06T 15/04 (2011.01); G06T 15/20 (2011.01); G06V 10/44 (2022.01); G06V 10/764 (2022.01); G06V 10/82 (2022.01); G06V 40/16 (2022.01)

CPC G06N 3/08 (2013.01) [G06F 3/04812 (2013.01); G06F 18/24143 (2023.01); G06N 3/045 (2023.01); G06T 15/04 (2013.01); G06T 15/205 (2013.01); G06V 10/454 (2022.01); G06V 10/764 (2022.01); G06V 10/82 (2022.01); G06V 40/165 (2022.01); G06V 40/171 (2022.01)]

18 Claims

1. A method comprising:

jointly predicting, by a neural network, (1) a scale change or offset vector representing an adjustment to an initial bounding box of a face in an input image, the adjustment to the initial bounding box defining an adjusted bounding box of the face and (2) initial facial landmark locations of the face by outputting a representation of both the scale change or offset vector for the initial bounding box and the initial facial landmark locations from a common fully-connected layer of the neural network, each of the initial facial landmark locations corresponding to a two-dimensional point in the input image;

generating refined facial landmark locations in the input image from the initial facial landmark locations; and

causing presentation of a representation of the input image using the refined facial landmark locations.