| CPC G06F 3/0656 (2013.01) [G06F 3/0604 (2013.01); G06F 3/0673 (2013.01); G06F 12/1027 (2013.01); G06F 2212/68 (2013.01)] | 13 Claims |

|
1. An apparatus comprising:
a first die having a plurality of cores, each of the plurality of cores comprising:
a scheduler to schedule tasks for execution on an execution circuit;
the execution circuit to execute the tasks;
and a local memory interface circuit to access a local portion of a dynamic random access memory (DRAM);
a second die comprising the DRAM, the DRAM comprising a plurality of local portions, wherein at least some of the plurality of cores are directly coupled to a corresponding local portion of the DRAM by a stacking of the first die and the second die; wherein each of the plurality of cores further comprises an interface circuit to couple the core to a first nearest neighbor core and a second nearest neighbor core, wherein the interface circuit is to store a first processed data in the corresponding local portion of the DRAM, and to send the first processed data to the local memory portion of the first nearest neighbor core, wherein the first nearest neighbor core is to further process the first processed data.
|