| CPC G06F 13/28 (2013.01) [G06F 1/04 (2013.01)] | 20 Claims |

|
1. A direct memory access (DMA) system, comprising:
a DMA hardware unit comprising:
a request generator configured to generate multiple memory addresses for tensor elements of a multi-dimensional tensor in parallel and, for each memory address, a respective request for a memory system to perform a memory operation for the multi-dimensional tensor; and
a progress tracker comprising:
a response reorder unit configured to maintain, for each tensor element, a status of whether a memory operation for the tensor element has been performed; and
a synchronization unit configured to provide, to a processor core, multiple partial updates that each specify an overall status of memory operations performed on the tensor elements of the multi-dimensional tensor,
wherein the progress tracker is configured to determine that a number of received responses associated with the respective requests from the memory system is greater than or equal to a threshold value, and in response to determining that the number of received responses is greater than or equal to the threshold value, cause the synchronization unit to provide a partial update to the processor core.
|