CPC G06N 3/063 (2013.01) [G06F 17/16 (2013.01); G06F 18/217 (2023.01); G06F 18/29 (2023.01); G06N 3/08 (2013.01)] | 18 Claims |
1. A method implemented by a system of one or more processors, the method comprising:
obtaining neural network model information comprising a plurality of decision points associated with a neural network, wherein one or more first decision points are associated with a layout of the neural network;
determining, based on the platform neural network model information, constraints associated with adapting the neural network model information to a hardware platform, wherein a first constraint is associated with a processing resource of the hardware platform, wherein a second constraint is associated with a performance metric, and wherein the constraints are enforced on a per-layer or per-tensor basis; and
generating a candidate configuration for the neural network via execution of a satisfiability solver based on the constraints, wherein the candidate configuration assigns values to the plurality of decision points; and
responsive to updated constraints corresponding to the candidate configuration as a negation, generating an updated candidate configuration, such that the neural network is configured to the hardware platform.
|