US 12,033,063 B2
	Scheduling configuration for deep learning networks
Eran Ben-Avi, Haifa (IL); Neta Zmora, Tzur Moshe (IL); Guy Jacob, Netanya (IL); Lev Faivishevsky, Kfar Saba (IL); Jeremie Dreyfuss, Tel-Aviv (IL); Tomer Bar-On, Petah Tikva (IL); Jacob Subag, Kiryat Haim (IL); Yaniv Fais, Tel-Aviv (IL); Shira Hirsch, Jerusalem (IL); Orly Weisel, Elazar (IL); Zigi Walter, Haifa (IL); and Yarden Oren, Jerusalem (IL)
Assigned to Intel Corporation, Santa Clara, CA (US)
Filed by Intel Corporation, Santa Clara, CA (US)
Filed on Feb. 24, 2023, as Appl. No. 18/174,275.
Application 18/174,275 is a continuation of application No. 15/499,900, filed on Apr. 28, 2017, granted, now 11,599,777.
Prior Publication US 2023/0281435 A1, Sep. 7, 2023
This patent is subject to a terminal disclaimer.
Int. Cl. G06N 3/06 (2006.01); G06N 3/044 (2023.01); G06N 3/045 (2023.01); G06N 3/063 (2023.01); G06N 3/084 (2023.01)

CPC G06N 3/063 (2013.01) [G06N 3/044 (2023.01); G06N 3/045 (2023.01); G06N 3/084 (2013.01)]

20 Claims

1. An apparatus comprising:

a cluster of distinct interconnected graphics processors, each graphics processor including a plurality of processing resources; and

processing circuitry to schedule operations to the cluster of distinct interconnected graphics processors, the processing circuitry configured to:

determine a traversal strategy for a deep learning neural network, the traversal strategy to be implemented via dispatch components of the graphics processors in the cluster of distinct interconnected graphics processors; and

convey the traversal strategy to the dispatch components of the graphics processors, the graphics processors configured to:

receive the traversal strategy and data for the deep learning neural network;

traverse a solution space of the deep learning neural network to score a plurality of solutions to schedule deep learning network execution on the plurality of processing resources of a graphics processor of the cluster of distinct interconnected graphics processors;

select a solution from the plurality of solutions to implement the deep learning network based on scores associated with the plurality of solutions; and

implement a workload schedule to assign tasks to the plurality of processing resources, wherein the workload schedule specifies a batch of grouped operations, the operations of the batch of grouped operations determined via a machine learning model based on historical data associated with the cluster of distinct interconnected graphics processors.