US 12,118,378 B2
Application programming interface for spinning up machine learning inferencing server on demand
Yuliya L. Feldman, Campbell, CA (US); Alexandr Nikitin, El Sobrante, CA (US); Manoj Agarwal, Cupertino, CA (US); and Chirag Rajan, Burlingame, CA (US)
Assigned to Salesforce, Inc., San Francisco, CA (US)
Filed by Salesforce, Inc., San Francisco, CA (US)
Filed on Jun. 2, 2021, as Appl. No. 17/337,377.
Prior Publication US 2022/0391239 A1, Dec. 8, 2022
Int. Cl. G06F 9/455 (2018.01); G06F 11/36 (2006.01); G06F 16/955 (2019.01); G06N 20/00 (2019.01); H04L 67/133 (2022.01)
CPC G06F 9/45558 (2013.01) [G06F 11/3664 (2013.01); G06F 16/955 (2019.01); G06N 20/00 (2019.01); H04L 67/133 (2022.05); G06F 2009/45575 (2013.01); G06F 2009/45591 (2013.01); G06F 2009/45595 (2013.01)] 20 Claims
OG exemplary drawing
 
1. A method by one or more electronic devices for spinning up a scoring container on demand, the method comprising:
receiving, from an orchestrator component via an application programming interface (API), a request to spin up the scoring container, wherein the scoring container is configured to provide scoring functionality;
spinning up the scoring container responsive to receiving the request to spin up the scoring container; and
providing, to the orchestrator component via the API, a response to the request to spin up the scoring container, wherein the response includes a job identifier (ID) of the scoring container and a uniform resource locator (URL) to use to submit scoring requests to the scoring container, wherein the URL includes the job ID.