US 11,734,937 B1
	Creating text classification machine learning models
Yahor Pushkin, Redmond, WA (US); Sravan Babu Bodapati, Bellevue, WA (US); Rishita Rajal Anubhai, Seattle, WA (US); Dimitrios Soulios, Seattle, WA (US); and Yaser Al-Onaizan, Cortlandt Manor, NY (US)
Assigned to Amazon Technologies, Inc., Seattle, WA (US)
Filed by Amazon Technologies, Inc., Seattle, WA (US)
Filed on Jan. 2, 2020, as Appl. No. 16/733,079.
Int. Cl. G06Q 10/10 (2023.01); G06Q 10/06 (2023.01); G06Q 30/06 (2023.01); G06Q 30/02 (2023.01); G06V 30/10 (2022.01); G06N 5/04 (2023.01); G06N 20/20 (2019.01); G06F 18/214 (2023.01)

CPC G06V 30/10 (2022.01) [G06F 18/2155 (2023.01); G06N 5/04 (2013.01); G06N 20/20 (2019.01)]

20 Claims

1. A computer-implemented method comprising:

receiving, at a machine learning service of a provider network from a computing device of a user located outside the provider network, a first plurality of unlabeled documents from a domain of the user, a second plurality of labeled documents from the domain of the user, and a request to create a text classifier for the domain;

performing a first training iteration of a language machine learning model on the first plurality of unlabeled documents from the domain of the user to pretrain the language machine learning model for prediction objectives for unlabeled data;

performing a second training iteration of the pretrained language machine learning model on the second plurality of labeled documents to further train the language machine learning model as the text classifier for the domain;

hosting the language machine learning model within the provider network in association with an endpoint;

receiving an inference request at the endpoint for an unlabeled document of the user;

generating, by the language machine learning model, an inference based on the inference request for the unlabeled document of the user; and

transmitting the inference to a client application or to a storage location.