CPC G06F 9/505 (2013.01) [G06F 9/5072 (2013.01)] | 20 Claims |
1. A method comprising:
receiving monitoring information from each respective containerized edge data center unit of a plurality of containerized edge data center units, wherein the monitoring information includes host metrics information corresponding to one or more physical components of the respective containerized edge data center unit, and inference performance information associated with one or more machine learning (ML) or artificial intelligence (AI) inference workloads implemented by a high-performance computing (HPC) engine included within the respective containerized edge data center unit;
receiving respective status information corresponding to a plurality of connected edge assets, wherein each connected edge asset is associated with one or more containerized edge data center units of the plurality of containerized edge data center units, and wherein the plurality of containerized edge data center units and the plurality of connected edge assets are included in a fleet of edge devices;
displaying, using a remote fleet management graphical user interface (GUI), at least a portion of the monitoring information or the status information corresponding to a selected subset of the fleet of edge devices, wherein the selected subset is determined based on one or more user selection inputs to the remote fleet management GUI;
receiving, using the remote fleet management GUI, one or more user configuration inputs indicative of an updated configuration associated with at least one containerized edge data center unit of the selected subset of the fleet of edge devices; and
transmitting, from a cloud computing environment associated with the remote fleet management GUI, control information corresponding to the updated configuration, wherein the control information is generated based on the one or more user configuration inputs and comprises model finetuning information for the one or more ML or AI inference workloads, and wherein the control information is transmitted to the at least one containerized edge data center unit of the selected subset.
|