| CPC G06F 15/825 (2013.01) [G06F 9/3867 (2013.01)] | 17 Claims |

|
1. A system for reducing latency and increasing throughput in reconfigurable dataflow processors, the system comprising:
a host computer comprising a graph optimization module configured to conduct a method comprising:
receiving a compute graph for execution on a reconfigurable dataflow computing system, the compute graph comprising a node specifying an operation on a tensor;
splitting the node into multiple nodes that each specify the operation on a distinctive portion of the tensor to produce a first modified compute graph; and
a reconfigurable dataflow processor (RDP) configured to execute the first modified compute graph.
|