US 12,217,056 B2
Load/store unit for a tensor engine and methods for loading or storing a tensor
Philip Winterbottom, San Jose, CA (US)
Assigned to Celestial AI Inc., Santa Clara, CA (US)
Filed by Celestial AI Inc., Santa Clara, CA (US)
Filed on Jan. 25, 2024, as Appl. No. 18/423,210.
Claims priority of provisional application 63/441,689, filed on Jan. 27, 2023.
Prior Publication US 2024/0403046 A1, Dec. 5, 2024
Int. Cl. G06F 9/30 (2018.01)
CPC G06F 9/30036 (2013.01) [G06F 9/30065 (2013.01); G06F 9/3013 (2013.01)] 20 Claims
OG exemplary drawing
 
1. A method for processing a tensor, comprising:
obtaining a first register for a number of items in the tensor;
obtaining one or more second registers for a number of items in a first and a second axis of the tensor;
obtaining a stride in the first and the second axis;
obtaining a next item in the tensor using the stride in the first axis and a first offset register, when the first register indicates the tensor has additional items to process and the second registers indicate the next item resides in the first axis;
obtaining a next item in the tensor using the stride in the first axis and the second axis, the first offset register, and a second offset register, when the first register indicates the tensor has additional items to process, and the second registers indicate the next item resides in the second axis of the tensor;
modifying the first register and one or more of the second registers; and
modifying at least one of the first and the second offset registers.