US 12,222,911 B1
String data error detection and repair
Mukul Singh, Delhi (IN); José Pablo Cambronero Sánchez, New Haven, CT (US); Sumit Gulwani, Sammamish, WA (US); Vu Minh Le, Redmond, WA (US); Carina Suzana Negreanu, Cambridge (GB); and Gust Ben Anneloes Verbruggen, Keerbergen (BE)
Assigned to Microsoft Technology Licensing, LLC, Redmond, WA (US)
Filed by Microsoft Technology Licensing, LLC, Redmond, WA (US)
Filed on Sep. 28, 2023, as Appl. No. 18/374,131.
Int. Cl. G06F 16/215 (2019.01); G06F 16/23 (2019.01); G06F 40/18 (2020.01); G06F 40/30 (2020.01); G06N 3/0455 (2023.01)
CPC G06F 16/215 (2019.01) [G06F 16/2365 (2019.01); G06F 40/18 (2020.01); G06F 40/30 (2020.01); G06N 3/0455 (2023.01)] 20 Claims
OG exemplary drawing
 
1. A string error mitigation method performed by a computing system, the method comprising:
learning a regular-expression-based pattern that characterizes at least a predefined fraction of a set of string data values, the learning comprising utilizing an inductive synthesis framework;
identifying, and masking out with a mask value, a substring based on a predefined set of semantic data types;
deriving one or more edits to repair a string which contains the mask value, the deriving comprising extracting a minimal sequence of edits through dynamic programming; and
replacing the mask value by a replacement substring which conforms to the regular-expression-based pattern.