US 11,057,480 B1
Methods and architecture for load-correcting requests for serverless functions
Ian James Wells, San Jose, CA (US); and Kyle Andrew Donald Mestery, Woodbury, MN (US)
Assigned to Cisco Technology, Inc., San Jose, CA (US)
Filed by Cisco Technology, Inc., San Jose, CA (US)
Filed on Apr. 10, 2020, as Appl. No. 16/846,111.
Int. Cl. H04L 29/08 (2006.01); H04L 12/24 (2006.01)
CPC H04L 67/16 (2013.01) [H04L 41/12 (2013.01); H04L 67/1019 (2013.01); H04L 67/28 (2013.01)] 20 Claims
OG exemplary drawing
 
1. A method for reducing latency of serverless computing in a cloud computing network using one to one-of-many routing, comprising:
at a server node of the cloud computing network suitable for running a serverless function, preloading an identity of an alternate node of the cloud computing network for running the serverless function, the identity of the alternate node stored in a memory as an identified alternate node;
receiving, at the server node and from a client node, a request to run the serverless function;
at the server node, determining that the server node is not currently running the serverless function or determining that the server node is currently overloaded; and
at the server node, based on determining that the serverless function is not currently running at the server node or based on determining that the server node is currently overloaded, redirecting the request to the identified alternate node while refraining from assessing whether the identified alternate node is currently running the serverless function.