US 11,706,111 B1
	Anti-fragile network
John Michael Stivoric, Pittsburgh, PA (US); David Andre, San Francisco, CA (US); Ryan Butterfoss, San Francisco, CA (US); Rebecca Radkoff, San Francisco, CA (US); Salil Vijaykumar Pradhan, San Jose, CA (US); Grace Taixi Brentano, Redwood City, CA (US); and Lam Thanh Nguyen, Mountain View, CA (US)
Assigned to X Development LLC, Mountain View, CA (US)
Filed by X Development LLC, Mountain View, CA (US)
Filed on Apr. 29, 2022, as Appl. No. 17/732,957.
Claims priority of provisional application 63/182,443, filed on Apr. 30, 2021.
Int. Cl. H04L 43/065 (2022.01); H04L 41/0604 (2022.01); H04L 41/12 (2022.01); H04L 43/0817 (2022.01)

CPC H04L 43/065 (2013.01) [H04L 41/0627 (2013.01); H04L 41/12 (2013.01); H04L 43/0817 (2013.01)]

20 Claims

1. A computer-implemented method comprising:

receiving, at one or more computing devices, parameter data from a network of nodes, the parameter data comprising attributes, policies, and action spaces for each node included in the network of nodes;

configuring, by the one or more computing devices, one or more interruptive events on one or more nodes included in the network of nodes;

determining, by the one or more computing devices, a first action of each node included in the network of nodes in response to the one or more interruptive events;

determining, by the one or more computing devices, a first performance metric, for each node, that corresponds to the first action, wherein the first performance metric is determined based on at least a first reward value associated with the first action;

performing a reinforcement learning process to train a machine learning model to generate actions to be performed by nodes that improve anti-fragility of the network, including continuously updating, by the one or more computing devices, the first action for each node in the reinforcement learning process until an anti-fragility metric of a final action satisfies a performance threshold representing the anti-fragility of the network, wherein updating the first action for a node in the reinforcement learning process comprises:

determining an updated action for the node based on the first performance metric of the first action, and

determining an updated performance metric, for the node, that corresponds to the updated action, wherein the updated performance metric is determined based on an updated reward value associated with the updated action, and wherein the updated reward value is larger than the first reward value; and

transmitting, by the one or more computing devices, the final action for each node to the network of nodes, whereby each node performs the final action in response to the one or more interruptive events.