CPC G06N 3/063 (2013.01) [G06F 9/30014 (2013.01); G06F 9/30025 (2013.01); G06F 9/30043 (2013.01); G06N 3/044 (2023.01); G06N 3/045 (2023.01); G06N 3/084 (2013.01)] | 20 Claims |
1. An apparatus comprising:
a processor to:
expose embedded cast operations in at least one of a load instruction or a store instruction of a stream of instructions;
determine, for each layer of a multi-layer deep learning neural network (DNN), a target precision level for the cast operations at each layer and data types of a plurality of different data types for the cast operations at each layer, wherein the target precision level for the cast operations at each layer is determined from the plurality of different data types that are used to represent various weights in different layers of the multi-layer DNN, and wherein high precision floating point data is utilized for a first subset of the different layers, low precision floating point data is utilized for a second subset of the different layers, and integer data is utilized for a third subset of the different layers; and
load the cast operations at the target precision level and the data types determined for the cast operations at each layer of the multi-layer DNN.
|