CPC G06F 3/0659 (2013.01) [G06F 3/0611 (2013.01); G06F 3/0673 (2013.01); G06N 20/00 (2019.01)] | 18 Claims |
1. A data storage device, comprising:
a non-volatile memory (NVM) device; and
a controller coupled to the NVM device, wherein the controller is configured to:
receive Q table data from the NVM device;
execute a Q-learning algorithm to create updated Q table data;
write the updated Q table data to a first Q table or a second Q table in the NVM device, wherein:
all actions associated with a state of one or more states of the first Q table and the second Q table are generated on a same wordline of the NVM device; and
on the same wordline, each action of the state of the first Q table and each action of the state of the second Q table are alternating; and
read the same wordline of the NVM device, wherein reading the same wordline senses all actions associated with the state of the one or more states of the first Q table and the second Q table.
|