CPC G05B 13/042 (2013.01) [B60W 40/064 (2013.01); B60W 40/08 (2013.01); B60W 40/105 (2013.01); G05B 13/021 (2013.01); G05B 13/024 (2013.01); G05B 13/0265 (2013.01); G05B 13/041 (2013.01); G05B 19/4065 (2013.01); G05B 19/41835 (2013.01); G05B 23/0229 (2013.01); G06F 18/2193 (2023.01); G06N 5/043 (2013.01); G06N 5/046 (2013.01); G06N 7/01 (2023.01); G06Q 10/06315 (2013.01); G06Q 10/06395 (2013.01); G06Q 30/0202 (2013.01); G05B 2219/36301 (2013.01); G06Q 10/087 (2013.01)] | 12 Claims |
1. A method for controlling an environment, the method comprising:
obtaining data specifying baseline probability distributions over possible settings for each of a plurality of controllable elements of an environment;
maintaining a causal model that identifies causal relationships between the possible settings and environment responses to the possible settings;
repeatedly performing the following:
selecting, by a control system, control settings for the environment based on the causal model and values for a particular internal parameter of the control system that are sampled from a range of possible values for the particular internal parameter;
selecting control settings for the environment based on the baseline probability distributions;
monitoring environment responses to (i) the control settings selected based on the causal model and (ii) the control settings selected based on the baseline probability distributions;
determining, for each of the possible values of the particular internal parameter, a measure of a difference between (i) a current system performance when the control settings are selected based on the causal model and on the possible value of the particular internal parameter and (ii) a baseline system performance measure for the control settings selected based on the baseline probability distributions; and
updating how frequently each of the possible values is sampled based on the measures of difference.
|