| CPC G06F 21/565 (2013.01) [G06F 21/563 (2013.01); G06F 2221/033 (2013.01)] | 20 Claims |

|
1. A system for data augmentation for use in a training of and application of an anti-malware (AM) machine learning model comprising:
a processor; and
a memory coupled to the processor, the memory having stored therein at least one of programs or instructions executable by the processor to configure the system to:
receive a first plurality of binary files each having a first binary structure, wherein the first plurality of binary files include one or more known files containing malicious content and one or more known files not containing malicious content;
alter a source code of each of the first plurality of binary files to produce a second plurality of binary files each having a second binary structure that is different from the first binary structure, and wherein each altered binary file is functionality similar to the corresponding file in the first plurality of binary files from which it was produced;
use the first and second plurality of binary files to train the AM machine learning model to distinguish between binary files containing malicious content and binary files not containing malicious content; and
apply the trained AM machine learning model to identify unknown binary files containing malicious content.
|