US 11,868,809 B2
Hardware assisted fine-grained data movement
Muhammad Amber Hassaan, Austin, TX (US); Anirudh Mohan Kaushik, Austin, TX (US); Sooraj Puthoor, Austin, TX (US); Gokul Subramanian Ravi, Austin, TX (US); Bradford Beckmann, Bellevue, WA (US); and Ashwin Aji, Santa Clara, CA (US)
Assigned to Advanced Micro Devices, Inc., Santa Clara, CA (US)
Filed by ADVANCED MICRO DEVICES, INC., Santa Clara, CA (US)
Filed on Jan. 11, 2023, as Appl. No. 18/095,704.
Application 18/095,704 is a division of application No. 16/824,601, filed on Mar. 19, 2020, granted, now 11,734,059.
Prior Publication US 2023/0229494 A1, Jul. 20, 2023
Int. Cl. G06F 9/46 (2006.01); G06F 9/48 (2006.01); G06F 9/52 (2006.01); G06F 16/901 (2019.01)
CPC G06F 9/4881 (2013.01) [G06F 9/52 (2013.01); G06F 16/9024 (2019.01); G06F 2209/486 (2013.01)] 20 Claims
OG exemplary drawing
 
1. A method implemented in a processor, comprising:
performing a task dependency assessment of a task dependency graph representative of a plurality of tasks and task data requirements that correspond to each task of the plurality of tasks;
based on the task dependency assessment, generating an asynchronous schedule for a task, the schedule providing for transfer of proxy objects required for the task to a graphics processing unit memory during execution of a preceding task; and
transferring data associated with the proxy objects to the graphics processing unit memory based on the schedule.