| CPC G06F 9/5027 (2013.01) [G06F 9/5083 (2013.01); G06F 13/4022 (2013.01); H04L 45/24 (2013.01); H04L 45/74 (2013.01); G06F 2209/505 (2013.01)] | 20 Claims |

|
1. A method comprising:
providing a network fabric including: (i) a plurality of graphical processing unit (GPU) clusters, the plurality of GPU clusters including at least a first GPU cluster operating at a first speed and a second GPU cluster operating at a second speed that is different than the first speed, and (ii) a plurality of blocks, wherein each block includes: (a) one or more racks that host a GPU cluster, and (b) a plurality of switches arranged in a hierarchical structure that communicatively couple the block to other blocks included in the network fabric; and
responsive to receiving a request to execute a workload, allocating one or more GPUs from the plurality of GPU clusters to execute the workload.
|