US 12,271,829 B2
Method, electronic device, and computer program product for managing training data
Zijia Wang, WeiFang (CN); Jiacheng Ni, Shanghai (CN); and Zhen Jia, Shanghai (CN)
Assigned to Dell Products L.P., Round Rock, TX (US)
Filed by Dell Products L.P., Round Rock, TX (US)
Filed on Feb. 8, 2022, as Appl. No. 17/666,736.
Claims priority of application No. 202210073368.5 (CN), filed on Jan. 21, 2022.
Prior Publication US 2023/0237344 A1, Jul. 27, 2023
Int. Cl. G06F 16/00 (2019.01); G06F 16/25 (2019.01); G06N 5/022 (2023.01)
CPC G06N 5/022 (2013.01) [G06F 16/258 (2019.01)] 20 Claims
OG exemplary drawing
 
1. A method for managing training data, comprising:
storing, in response to a determination that new training data is collected by a sensor, the new training data into a collected data stream of a storage pool in a processor-based machine learning system;
storing, in response to a determination that the new training data and historical data stored in a full data stream of the storage pool are refined into refined training data utilizing a dataset distillation algorithm, the refined training data into a refined data stream of the storage pool;
storing the new training data into the full data stream of the storage pool; and
providing, in response to receiving a use request for training data from at least one of an edge device and a cloud, training data from at least portions of one or more of the collected data stream, the full data stream and the refined data stream.