| CPC G10L 13/08 (2013.01) [G06F 40/40 (2020.01); G10L 13/047 (2013.01); G10L 2013/083 (2013.01)] | 20 Claims |

|
1. A method comprising:
receiving an input including one or more tokens;
generating, using one or more rule-based algorithms, a set of one or more plain text representations corresponding to the one or more tokens;
determining, using the one or more rule-based algorithms and for at least one individual plain text representation of the set of one or more plain text representations, one or more weights;
determining, from the set of one or more plain text representations and based at least in part on comparing the one or more weights to one or more thresholds, a subset of one or more plain text representations; and
selecting, using a trained language model, a plain text representation from the subset of one or more plain text representations.
|