US 11,853,734 B2
Compiler-initiated tile replacement to enable hardware acceleration resources
Gregory P. Rodgers, Austin, TX (US); and Joseph L. Greathouse, Austin, TX (US)
Assigned to Advanced Micro Devices, Inc., Santa Clara, CA (US)
Filed by ADVANCED MICRO DEVICES, INC., Santa Clara, CA (US)
Filed on May 10, 2022, as Appl. No. 17/740,828.
Application 17/740,828 is a continuation of application No. 16/832,275, filed on Mar. 27, 2020, granted, now 11,347,486.
Prior Publication US 2022/0269492 A1, Aug. 25, 2022
This patent is subject to a terminal disclaimer.
Int. Cl. G06F 8/41 (2018.01); G06F 17/16 (2006.01)
CPC G06F 8/4435 (2013.01) [G06F 17/16 (2013.01)] 20 Claims
OG exemplary drawing
 
1. A method comprising:
in response to receiving an indication that source code to be compiled at a processing system includes at least one function that performs a memory access to a tile of data, determining whether at least a portion of the source code representing the at least one function that performs the memory access to the tile of data is replaceable by a tensor operation executable at a special-purpose hardware accelerator of the processing system; and
generating code that replaces the at least a portion of the source code representing the at least one function that performs the memory access to the tile of data with the tensor operation in response to determining that the at least a portion of source code representing the at least one function that performs the memory access to the tile of data is replaceable by the tensor operation.