US 12,346,442 B2
System and method for computer security augmented data set algorithm training
Mantas Briliauskas, Vilnius (LT); and Aleksandr Ševčenko, Vilnius (LT)
Assigned to UAB 360 IT, Vilnius (LT)
Filed by UAB 360 IT, Vilnius (LT)
Filed on May 30, 2023, as Appl. No. 18/203,462.
Application 18/203,462 is a continuation in part of application No. 17/728,518, filed on Apr. 25, 2022, granted, now 11,663,334.
Prior Publication US 2023/0342466 A1, Oct. 26, 2023
This patent is subject to a terminal disclaimer.
Int. Cl. G06F 21/56 (2013.01)
CPC G06F 21/565 (2013.01) [G06F 21/563 (2013.01); G06F 2221/033 (2013.01)] 20 Claims
OG exemplary drawing
 
1. A system for data augmentation for use in a training of and application of an anti-malware (AM) machine learning model comprising:
a processor; and
a memory coupled to the processor, the memory having stored therein at least one of programs or instructions executable by the processor to configure the system to:
receive a first plurality of binary files each having a first binary structure, wherein the first plurality of binary files include one or more known files containing malicious content and one or more known files not containing malicious content;
alter a source code of each of the first plurality of binary files to produce a second plurality of binary files each having a second binary structure that is different from the first binary structure, and wherein each altered binary file is functionality similar to the corresponding file in the first plurality of binary files from which it was produced;
use the first and second plurality of binary files to train the AM machine learning model to distinguish between binary files containing malicious content and binary files not containing malicious content; and
apply the trained AM machine learning model to identify unknown binary files containing malicious content.