| CPC G06F 16/3329 (2019.01) [G06F 40/284 (2020.01)] | 30 Claims |

|
1. A method for removing unauthorized information associations from a large language model (LLM) that is pre-trained on training data comprising unauthorized data (UD), the method comprising:
receiving at a synthetic generator module a list of one or more UD instance-UD association pairs between real UD instances and UD associations identified for the real UD instances in the training data;
generating by the synthetic generator module one or more synthetic UD instance-UD association pairs comprising a synthetic UD instance-UD association pair from each real UD instance-UD association pair of the one or more real UD instance-UD association pairs, the synthetic UD instance-UD association pair being configured to one of reduce or remove influence of the real UD instance-UD association pair from which the synthetic UD instance-UD association pair was generated on an output of the LLM; and
generating by a fine tuner module a fine-tuned LLM by iteratively fine-tuning the LLM based upon the one or more synthetic UD instance-UD association pairs.
|