CPC G06T 1/20 (2013.01) [G06F 9/46 (2013.01); G06N 3/045 (2023.01); G06N 3/063 (2013.01); G06N 3/08 (2013.01); G06N 3/044 (2023.01); G06N 3/084 (2013.01)] | 18 Claims |
1. An apparatus comprising:
a processing system including a graphics processor, the graphics processor including a plurality of processing resources, the plurality of processing resources configured to be partitioned into a plurality of physical resource slices, wherein the processing system has a capability to limit usage of the plurality of processing resources by a plurality of contexts, the graphics processor includes a plurality of memory partitions, the plurality of physical resource slices includes a plurality of compute resource partitions, and the plurality of compute resource partitions is associated with a plurality of memory access paths to the plurality of memory partitions; and
circuitry configured to:
receive specification of a limitation on usage of the plurality of processing resources by respective contexts of the plurality of contexts;
schedule workloads associated with the plurality of contexts to the plurality of physical resource slices according to the limitation on usage specified for the respective contexts of the plurality of contexts;
limit execution of workloads for respective contexts of the plurality of contexts to a specified subset of the plurality of processing resources according to the physical resource slices associated with the respective contexts of the plurality of contexts; and
monitor, during execution of a workload associated with a context of the plurality of contexts, a utilization percentage of the specified subset of the plurality of processing resources to which the context is limited.
|