| CPC G06F 8/75 (2013.01) [G06F 21/563 (2013.01); G06F 40/30 (2020.01)] | 20 Claims |

|
1. A method comprising:
providing a plurality of source code samples to an artificial intelligence model (AIM) trained to describe source code based on performing semantic analysis on the source code;
producing, by a processing device using the AIM, a plurality of semantic descriptions that describe the plurality of source code samples;
converting the plurality of semantic descriptions into a plurality of semantic embeddings; and
creating a plurality of clusters from the plurality of semantic embeddings, wherein each one of the plurality of clusters corresponds to two or more of the plurality of source code samples.
|