| CPC G06F 13/1605 (2013.01) [G06F 9/3004 (2013.01); G06F 9/3887 (2013.01); G06F 9/3888 (2023.08); G06F 9/38885 (2023.08); G06F 9/5016 (2013.01); G06T 1/20 (2013.01); G06T 1/60 (2013.01)] | 20 Claims |

|
1. An apparatus comprising:
a processing resource of a chiplet; and
an L1 cache communicably coupled to the processing resource and comprising synchronization hardware circuit to:
allocate a first thread group as a member of a super thread group (STG) comprising a collection of thread groups running on the chiplet;
receive from a second thread group within the STG, a request to communicate with the first thread group identified by a thread group identifier (ID) within the STG;
access a routing table to determine a location of the first thread group based on the thread group ID; and
route the request to the determined location of the first thread group using communication links between L1 caches of the chiplet.
|
|
10. A method comprising:
allocating, by a synchronization hardware circuit of an L1 cache of a graphics processor, a first thread group as a member of a super thread group (STG) comprising a collection of thread groups running on the graphics processor;
receiving, by the synchronization hardware circuit, from a second thread group within the STG, a request to communicate with the first thread group identified by a thread group identifier (ID) within the STG;
accessing, by the synchronization hardware circuit, a routing table to determine a location of the first thread group based on the thread group ID; and
routing, by the synchronization hardware circuit, the request to the determined location of the first thread group using communication links between L1 caches of the graphics processor.
|
|
16. A non-transitory computer-readable medium having instructions stored thereon, which when executed by one or more processors, cause the one or more processors to:
allocate, by a synchronization hardware circuit of an L1 cache of the one or more processors, a first thread group as a member of a super thread group (STG) comprising a collection of thread groups running on the one or more processors;
receive, by the synchronization hardware circuit, from a second thread group within the STG, a request to communicate with the first thread group identified by a thread group identifier (ID) within the STG;
access, by the synchronization hardware circuit, a routing table to determine a location of the first thread group based on the thread group ID; and
route, by the synchronization hardware circuit, the request to the determined location of the first thread group using communication links between L1 caches of the one or more processors.
|