US 12,445,358 B2
Techniques for artificial intelligence capabilities at a network device
Francesc Guim Bernat, Barcelona (ES); Suraj Prabhakaran, Aachen (DE); Kshitij A. Doshi, Tempe, AZ (US); Brinda Ganesh, Portland, OR (US); and Timothy Verrall, Pleasant Hill, CA (US)
Assigned to Intel Corporation, Santa Clara, CA (US)
Filed by Intel Corporation, Santa Clara, CA (US)
Filed on Oct. 2, 2023, as Appl. No. 18/375,934.
Application 18/375,934 is a continuation of application No. 16/235,462, filed on Dec. 28, 2018, granted, now 11,824,732.
Prior Publication US 2024/0080246 A1, Mar. 7, 2024
Int. Cl. H04L 41/16 (2022.01); G06N 3/04 (2023.01); G06N 5/04 (2023.01); H04L 41/0816 (2022.01); H04L 41/5009 (2022.01); H04L 41/5019 (2022.01); H04L 41/5051 (2022.01)
CPC H04L 41/16 (2013.01) [G06N 3/04 (2013.01); G06N 5/04 (2013.01); H04L 41/0816 (2013.01); H04L 41/5012 (2013.01); H04L 41/5019 (2013.01); H04L 41/5051 (2013.01)] 29 Claims
OG exemplary drawing
 
10. A switch device comprising:
an interface configured to couple with a plurality of ingress links and a plurality of egress links, wherein the switch device is configured to forward data from the plurality of ingress links to the plurality of egress links of the switch device;
an inference resource; and
circuitry configured to:
receive, through the interface, information associated with loading a neural network to the inference resource to support an artificial intelligence (AI) service for a network that includes the switch device;
cause the neural network to be loaded to the inference resource based on the information; and
receive, via an ingress link from among the plurality of ingress links, an AI service request, wherein if the AI service request cannot be fulfilled using the neural network loaded to the inference resource, the AI service request is to be forwarded, via an egress link from among the plurality of egress links, to a second inference resource located separate from the switch device;
wherein:
the information comprises registration data; and
the registration data comprises tenant identification data.