US 12,425,342 B2
Layer 4 load aware load balancing
Zhiyuan Yao, Paris (FR); Yoann Louis Simon Desmouceaux, Paris (FR); Pierre Pfister, Roquefort-les-Pins (FR); and William Mark Townsley, San Francisco, CA (US)
Assigned to Cisco Technology, Inc.
Filed by Cisco Technology, Inc., San Jose, CA (US)
Filed on May 18, 2022, as Appl. No. 17/747,421.
Prior Publication US 2023/0403235 A1, Dec. 14, 2023
Int. Cl. H04L 47/125 (2022.01); H04L 43/067 (2022.01); H04L 43/0852 (2022.01); H04L 43/20 (2022.01); H04L 47/2441 (2022.01); H04L 47/52 (2022.01); H04L 67/1008 (2022.01)
CPC H04L 47/125 (2013.01) [H04L 43/067 (2013.01); H04L 43/0852 (2013.01); H04L 47/2441 (2013.01); H04L 47/522 (2013.01); H04L 67/1008 (2013.01); H04L 43/20 (2022.05)] 20 Claims
OG exemplary drawing
 
1. A method comprising:
inferring, by a computing device, server processing speed data associated with a plurality of servers from flow duration data associated with a plurality of flows associated with the plurality of servers, wherein inferring the server processing speed data comprises using a Softmax normalization function on the flow duration data associated with the plurality of flows associated with the plurality of servers;
obtaining a plurality of queue lengths respectively associated with the plurality of servers;
determining a Shortest Expected Delay (SED) score for each of the plurality of servers from the inferred server processing speed data and the plurality of queue lengths; and
assigning a flow to the one of the plurality of servers having a lowest SED score.