US 11,840,245 B2
Vehicle control data generation method, vehicle controller, vehicle control system, vehicle learning device, vehicle control data generation device, and memory medium
Yosuke Hashimoto, Nagakute (JP); Akihiro Katayama, Toyota (JP); Yuta Oshiro, Nagoya (JP); Kazuki Sugie, Toyota (JP); and Naoya Oka, Nagakute (JP)
Assigned to TOYOTA JIDOSHA KABUSHIKI KAISHA, Toyota (JP)
Filed by TOYOTA JIDOSHA KABUSHIKI KAISHA, Toyota (JP)
Filed on Dec. 21, 2020, as Appl. No. 17/128,822.
Claims priority of application No. 2020-002032 (JP), filed on Jan. 9, 2020.
Prior Publication US 2021/0213966 A1, Jul. 15, 2021
Int. Cl. B60W 50/06 (2006.01); B60W 40/12 (2012.01); B60W 50/00 (2006.01)
CPC B60W 50/06 (2013.01) [B60W 40/12 (2013.01); B60W 2050/0088 (2013.01); B60W 2510/06 (2013.01); B60W 2540/10 (2013.01)] 9 Claims
OG exemplary drawing
 
9. A non-transitory computer readable memory medium that stores a program that causes an execution device to execute a vehicle control data generation process, the generation process comprising:
obtaining, by the execution device with relationship defining data stored in a memory device, a preference variable and a state of a vehicle that is based on a detection value of a sensor, the preference variable indicating a relative preference of a user for two or more requested elements, the relationship defining data defining a relationship between the state of the vehicle and an action variable related to an operation of an electronic device in the vehicle;
operating, by the execution device with the relationship defining data stored in the memory device, the electronic device;
providing, by the execution device with the relationship defining data stored in the memory device, based on the obtained state of the vehicle, a greater reward when a characteristic of the vehicle meets a standard than when the characteristic of the vehicle does not meet the standard; and
updating, by the execution device with the relationship defining data stored in the memory device, the relationship defining data by inputting, to a predetermined update map, the obtained state of the vehicle, the value of the action variable used to operate the electronic device, and the reward corresponding to the operation of the electronic device, wherein
the update map outputs the updated relationship defining data so as to increase an expected return for the reward in a case where the electronic device is operated in accordance with the relationship defining data,
the two or more requested elements include at least two of three requested elements, the three requested elements including a requested element indicating a high acceleration response of the vehicle, a requested element indicating that at least one of vibration or noise of the vehicle is small, and a requested element indicating a high energy use efficiency, and
the providing the reward includes changing a reward that is provided when a characteristic of the vehicle is a predetermined characteristic in a case where the value of the preference variable is a second value such that the changed reward differs from the reward that is provided when the characteristic of the vehicle is the predetermined characteristic in a case where the value of the preference variable is a first value.