| CPC G06N 3/063 (2013.01) [G06F 9/5027 (2013.01); G06F 18/217 (2023.01); G06F 18/29 (2023.01); G06N 3/08 (2013.01); G06N 20/00 (2019.01)] | 24 Claims |

|
1. A processor-implemented method with model optimization, comprising:
determining a graph representing operations to be performed in a target model related to input data input into the target model;
determining, by using a first machine learning (ML) model, separate from the target model, the first ML model receiving the input data, an attribute of the input data based on the inputted input data;
determining a predicted performance of the target model based on a behavior pattern of hardware that executes the target model;
optimizing the operations performed in the target model based on the graph, the attribute of the input data, and the predicted performance of the target model; and
executing the target model, including inputting the input data to the target model, by the hardware according to the optimized operations.
|