US 11,915,154 B2
Managing artificial intelligence model partitions for edge computing environment
Jinpeng Liu, Shanghai (CN); Jin Li, Shanghai (CN); Zhen Jia, Shanghai (CN); and Christopher S. MacLellan, Uxbridge, MA (US)
Assigned to EMC IP Holding Company LLC, Hopkinton, MA (US)
Filed by EMC IP Holding Company LLC, Hopkinton, MA (US)
Filed on Jul. 10, 2020, as Appl. No. 16/925,864.
Prior Publication US 2022/0012607 A1, Jan. 13, 2022
Int. Cl. G06N 20/00 (2019.01); G06F 9/48 (2006.01); G06N 5/04 (2023.01); G06F 9/50 (2006.01)
CPC G06N 5/04 (2013.01) [G06F 9/4881 (2013.01); G06F 9/5077 (2013.01); G06N 20/00 (2019.01)] 20 Claims
OG exemplary drawing
 
1. An apparatus, comprising:
a processor; and
a memory storing computer program instructions, the processor executing the computer program instructions in the memory to control the apparatus to:
obtain an intermediate representation of an artificial intelligence model;
obtain a computation graph generated based on the intermediate representation;
partition the computation graph into a set of partitions;
schedule the set of partitions for respective execution on a set of computing devices in an edge computing environment; and
cause deployment of the set of partitions respectively to the set of computing devices for execution in the edge computing environment;
wherein the obtaining, partitioning, scheduling, and causing deployment operations are performed in accordance with a model parallelism manager layer in coordination with an application manager layer; and
wherein the application manager layer is interposed between the model parallelism manager layer and the edge computing environment.