CPC G06T 1/20 (2013.01) | 19 Claims |
1. A method for running a service, comprising:
determining a target deployment manner of a graphics processing unit (GPU) according to performance data of each service in a service set, wherein the deployment manner comprises: dividing the GPU into sub-GPUs of a respective size and determining a service configured to be run by each sub-GPU; and
switching, for the service in the service set, running of the service from a sub-GPU indicated by a current deployment manner to a sub-GPU indicated by the target deployment manner, the switching comprising:
executing a switching step for each service:
determining, for the service, a first sub-GPU to be newly created and a second sub-GPU to be deleted by comparing the current deployment manner with the target deployment manner of the GPU;
pairing the first sub-GPU with the second sub-GPU for the service and obtaining at least one sub-GPU pair on the premise that a target performance of the service is satisfied;
creating the first sub-GPU comprised in the sub-GPU pair, running the service on the created first sub-GPU and then deleting the second sub-GPU comprised in the sub-GPU pair for each sub-GPU pair; and
creating a first sub-GPU, running the service on the created first sub-GPU and then deleting a second sub-GPU, for a first sub-GPU and a second sub-GPU that are not paired for the service.
|