CPC G06F 9/30036 (2013.01) [G06F 7/485 (2013.01); G06F 7/4876 (2013.01); G06F 7/762 (2013.01); G06F 9/3001 (2013.01); G06F 9/30032 (2013.01); G06F 9/30043 (2013.01); G06F 9/30109 (2013.01); G06F 9/30112 (2013.01); G06F 9/30134 (2013.01); G06F 9/30145 (2013.01); G06F 9/30149 (2013.01); G06F 9/3016 (2013.01); G06F 9/30185 (2013.01); G06F 9/30196 (2013.01); G06F 9/3818 (2013.01); G06F 9/3836 (2013.01); G06F 17/16 (2013.01); G06F 2212/454 (2013.01)] | 17 Claims |
1. A processor comprising:
decode circuitry to decode an instruction having fields for an opcode, a source matrix operand identifier of a single two-dimensional tile register in a matrix operations accelerator of the processor, and a destination matrix operand identifier; and
execution circuitry to execute the decoded instruction to cause the matrix operations accelerator to transpose each row of elements of the identified source matrix operand into a corresponding column of the identified destination matrix operand and zero any remaining columns of the identified destination matrix operand and unconfigured rows of the identified destination matrix operand.
|