| CPC G06F 21/105 (2013.01) [G06F 8/65 (2013.01); G06F 9/45558 (2013.01); G06F 9/5027 (2013.01); G06F 21/602 (2013.01); G06N 3/04 (2013.01); G06N 3/08 (2013.01); G06T 1/20 (2013.01); G06T 1/60 (2013.01); H04L 9/085 (2013.01); H04L 9/0891 (2013.01); H04L 9/3242 (2013.01); H04L 63/061 (2013.01); G06F 2009/45587 (2013.01)] | 18 Claims |

|
7. At least one non-transitory computer readable storage medium comprising a set of instructions, which when executed by a computing system, cause the computing system to:
partition a neural network inference model into a plurality of slices including a first slice and a second slice;
allocate different sets of resources to the plurality of slices based on one or more of a performance condition or a security condition, wherein the different sets of resources include a first set of resources and a second set of resources; and
coordinate, via a shared key, an execution of the plurality of slices by the different sets of resources by
a protection of a shared memory with the shared key;
an instruction to the first set of resources to store temporary values associated with the first slice to the shared memory; and
an instruction to the second set of resources to execute the second slice based on the temporary values to obtain an inference result.
|