US 11,965,666 B2
Control method for air conditioner, and device for air conditioner and storage medium
Jianming Tan, Guangdong (CN); Shaobin Li, Guangdong (CN); Dechao Song, Guangdong (CN); Dong Yue, Guangdong (CN); Chong Chen, Guangdong (CN); Xiaoyu Luo, Guangdong (CN); Jiabi Deng, Guangdong (CN); Pengfei Wang, Guangdong (CN); and Wenxuan Xiao, Guangdong (CN)
Assigned to Gree Electric Appliances, Inc. of Zhuhai, Zhuhai (CN)
Appl. No. 17/600,506
Filed by Gree Electric Appliances, Inc. of Zhuhai, Guangdong (CN)
PCT Filed Dec. 16, 2019, PCT No. PCT/CN2019/125505
§ 371(c)(1), (2) Date Sep. 30, 2021,
PCT Pub. No. WO2020/199648, PCT Pub. Date Oct. 8, 2020.
Claims priority of application No. 201910258756.9 (CN), filed on Apr. 1, 2019.
Prior Publication US 2022/0205666 A1, Jun. 30, 2022
Int. Cl. F24F 11/63 (2018.01); G05B 13/02 (2006.01); G05B 19/042 (2006.01)
CPC F24F 11/63 (2018.01) [G05B 13/027 (2013.01); G05B 19/042 (2013.01); G05B 2219/2614 (2013.01)] 15 Claims
OG exemplary drawing
 
1. A control method for an air conditioner, comprising:
constructing a first reward matrix according to multiple sets of target operating parameters of an air conditioner, wherein each of the multiple sets of the target operating parameters of the air conditioner at least comprises a target indoor environment temperature, a target outdoor environment temperature, a target setting temperature, a target intermediate temperature of an indoor evaporator, a target intermediate temperature of an outdoor condenser, a first target operating frequency of a compressor, a first target opening degree of an electronic expansion valve and a first target rotating speed of an external fan;
calculating a maximum expected benefit of performing a current action in a current state based on the first reward matrix and a Q-learning algorithm, wherein the current state is represented by a current indoor environment temperature and a current outdoor environment temperature, and the current action is represented by a current operating frequency of the compressor, a current opening degree of the electronic expansion valve and a current rotating speed of the external fan; and
acquiring target action parameters under the maximum expected benefit, and controlling operation of the air conditioner based on second target action parameters, wherein the second target action parameters at least comprise a second target operating frequency of the compressor, a second target opening degree of the electronic expansion valve and a second target rotating speed of the external fan.