CPC G06F 9/3887 (2013.01) [G06F 9/3001 (2013.01); G06F 9/30014 (2013.01); G06F 9/30036 (2013.01); G06F 9/30094 (2013.01); G06F 9/30109 (2013.01); G06F 9/30112 (2013.01); G06F 9/3016 (2013.01); G06F 9/3851 (2013.01); G06F 9/3891 (2013.01); G06F 9/50 (2013.01); G06F 13/4068 (2013.01); G06F 13/4282 (2013.01); G06F 15/80 (2013.01); G06N 3/00 (2013.01); G06N 3/044 (2023.01); G06N 3/045 (2023.01); G06N 3/063 (2013.01); G06N 3/084 (2013.01); G06N 20/00 (2019.01); G06N 20/10 (2019.01); G06T 1/20 (2013.01); G06F 2213/0026 (2013.01)] | 24 Claims |
1. A graphics processing unit (GPU) comprising:
a processing cluster comprising a plurality of multiprocessors interconnected via a data crossbar, the plurality of multiprocessors configured to distribute processed data among the plurality of multiprocessors directly via the data crossbar, from a first multiprocessor of the plurality of multiprocessors to a second multiprocessor of the plurality of multiprocessors, wherein a multiprocessor of the plurality of multiprocessors comprises:
an instruction cache to store a first instruction and a second instruction, the first instruction to cause the multiprocessor to perform a floating-point operation and the second instruction to cause the multiprocessor to perform an integer operation; and
a plurality of general-purpose graphics compute units having a single instruction, multiple thread architecture, the plurality of general-purpose graphics compute units including a first general-purpose graphics compute unit to execute the first instruction concurrently with execution of the second instruction by a second general-purpose graphics compute unit.
|