US 12,412,098 B2
Apparatus and method for neural architecture searching with target data
Sungjoo Yoo, Seoul (KR)
Assigned to Samsung Electronics Co., Ltd., Suwon-si (KR); and SNU R&DB FOUNDATION, Seoul (KR)
Filed by Samsung Electronics Co., Ltd., Suwon-si (KR); and SNU R&DB FOUNDATION, Seoul (KR)
Filed on Jul. 23, 2021, as Appl. No. 17/384,050.
Claims priority of application No. 10-2021-0028924 (KR), filed on Mar. 4, 2021; and application No. 10-2021-0034493 (KR), filed on Mar. 17, 2021.
Prior Publication US 2022/0284302 A1, Sep. 8, 2022
Int. Cl. G06N 3/088 (2023.01); G06N 3/045 (2023.01)
CPC G06N 3/088 (2013.01) [G06N 3/045 (2023.01)] 34 Claims
OG exemplary drawing
 
1. A processor-implemented method, the method comprising:
obtaining target data;
sampling a trained first neural network into a plurality of second neural networks,
wherein the sampling comprises randomly extracting architecture parameters for a plurality of sub-networks of the second neural network;
training each of the second neural networks based on a portion of the target data, wherein the training of each of the second neural networks comprises training an architecture parameter of each of the second neural networks using respective outputs obtained by inputting the portion of the target data to each of the second neural networks;
selecting a second neural network satisfying a predetermined condition among the trained second neural networks for performing an inference operation; and
performing the inference operation on the target data using the selected second neural network to output an operation result based on the target data.