| CPC G06N 3/063 (2013.01) [G06F 9/30007 (2013.01); G06F 9/545 (2013.01)] | 20 Claims |

|
1. A compiler-implemented method of generating executable instructions for a computing system, comprising:
receiving a first set of instructions including a kernel of a first operator and a kernel of a second operator, the kernel of the first operator including instructions of the first operator and write instructions to a virtual data node, the kernel of the second operator including instructions of the second operator and read instructions to the virtual data node;
determining a mapping between the write instructions and read instructions based on common tensor addresses shared by the write instructions and the read instructions;
determining, based on a mapping between the write instructions and read instructions, instructions of data transfer operations between the first operator and the second operator, wherein the instructions of the data transfer operations are determined such that a duplication of a same write instruction from the write instructions is reduced; and
generating a second set of instructions representing a fused operator of the first operator and the second operator, the second set of instructions being executable by the computing system and including the instructions of the first operator, the instructions of the second operator, and the instructions of the data transfer operations.
|