US 12,450,056 B2
Efficient data layout and alignment for wide-vector accelerator systems
Shubham Jain, Elmsford, NY (US); Geoffrey Burr, Cupertino, CA (US); and Yasuteru Kohda, Yamato (JP)
Assigned to International Business Machines Corporation, Armonk, NY (US)
Filed by International Business Machines Corporation, Armonk, NY (US)
Filed on Mar. 22, 2022, as Appl. No. 17/701,308.
Prior Publication US 2023/0305841 A1, Sep. 28, 2023
Int. Cl. G06F 9/00 (2018.01); G06F 9/30 (2018.01); G06F 9/38 (2018.01)
CPC G06F 9/30038 (2023.08) [G06F 9/30032 (2013.01); G06F 9/3877 (2013.01)] 20 Claims
OG exemplary drawing
 
1. A method for processing artificial intelligence (AI) workloads, the method comprising:
logically dividing a data vector into a hierarchy of segments and sub-segments with each of the segments comprising more than one of the sub-segments, wherein the data vector is a row from memory, each of the sub-segments comprises words, and each of the words comprises data-bits;
physically mapping the data-bits such that the words belonging to a same given one of the sub-segments are mapped contiguously across all of the segments;
pulling the row from the memory; and
performing alignment operations on the segments, the sub-segments, or a combination thereof to create an aligned data-vector.