US 12,294,523 B2
Application instance deployment method, application instance scheduling method, and apparatus
Nannan Wang, Shenzhen (CN); Yang Xu, Shenzhen (CN); and Bo Yi, Shenzhen (CN)
Assigned to Huawei Cloud Computing Technologies Co., Ltd., Guizhou (CN)
Filed by HUAWEI CLOUD COMPUTING TECHNOLOGIES CO., LTD., Guizhou (CN)
Filed on Aug. 11, 2022, as Appl. No. 17/886,017.
Application 17/886,017 is a continuation of application No. PCT/CN2021/076193, filed on Feb. 9, 2021.
Claims priority of application No. 202010086421.6 (CN), filed on Feb. 11, 2020.
Prior Publication US 2022/0385586 A1, Dec. 1, 2022
Int. Cl. H04W 24/02 (2009.01); G06F 15/16 (2006.01); H04L 12/26 (2006.01); H04L 12/28 (2006.01); H04L 47/2416 (2022.01); H04L 47/32 (2022.01); H04L 65/40 (2022.01); H04W 64/00 (2009.01)
CPC H04L 47/2416 (2013.01) [H04L 47/32 (2013.01); H04L 65/40 (2013.01)] 18 Claims
OG exemplary drawing
 
1. An application instance deployment method, comprising:
receiving, by a global management platform, quality of service (QOS) requirement information from a first client, wherein the QoS requirement information comprises a first delay requirement, a second delay requirement, and a first quantity of connections, wherein the first delay requirement represents a maximum accepted delay, and the second delay requirement represents an expected delay, and the second delay requirement represents a smaller delay than the first delay requirement;
selecting, by the global management platform, a first available site that meets the first delay requirement, wherein the first available site is selected from among sites managed by the global management platform;
deploying, by the global management platform, one or more first application instances on the first available site, wherein a quantity of connections that can be established to the one or more first application instances is less than or equal to the first quantity of connections;
determining a first total communication delay of accessing all the application instances by the first client;
determining a second total communication delay of accessing all the application instances by the first client after a second application instance is moved;
comparing the first total communication delay and the second total communication delay: and
in response to determining that the second total communication delay is smaller than the first total communication delay, changing a deployment location of the second application instance according to a QoS information table.