| CPC G06F 8/71 (2013.01) [G06F 8/65 (2013.01)] | 10 Claims |

|
1. A method for managing deep-learning model files in a Kubernetes system, comprising:
monitoring, by a network device functioning as a master node in the Kubernetes system and via a Kubernetes interface, a status of a target model management object;
determining, by the network device, a target inference application that is deployed on a worker node in the Kubernetes system and matches the target model management object based on a preset field of the target inference application, wherein the preset field of the target inference application identifies the target model management object being corresponding to a target model file of the target inference application;
based on the status of the target model management object, notifying, by the network device and via the Kubernetes interface, the target inference application to perform an operation on the target model file of the target inference application according to the status of the target model management object; and
performing, by the worker node in the Kubernetes system, the operation on the target model file of the target inference application according to the status of the target model management object.
|