US 11,693,839 B2
Parser for schema-free data exchange format
Yinan Li, Redmond, WA (US); Nikolaos Romanos Katsipoulakis, Pittsburgh, PA (US); Badrish Chandramouli, Redmond, WA (US); Jonathan D Goldstein, Woodinville, WA (US); and Donald Kossmann, Kirkland, WA (US)
Assigned to Microsoft Technology Licensing, LLC, Redmond, WA (US)
Filed by Microsoft Technology Licensing, LLC, Redmond, WA (US)
Filed on Sep. 10, 2020, as Appl. No. 17/17,513.
Application 17/017,513 is a division of application No. 15/582,313, filed on Apr. 28, 2017, granted, now 10,817,490.
Prior Publication US 2021/0004361 A1, Jan. 7, 2021
Int. Cl. G06F 16/00 (2019.01); G06F 16/22 (2019.01); G06F 16/25 (2019.01); G06F 16/23 (2019.01); G06F 16/2455 (2019.01); G06F 16/81 (2019.01); G06F 40/221 (2020.01)
CPC G06F 16/2237 (2019.01) [G06F 16/2365 (2019.01); G06F 16/2455 (2019.01); G06F 16/258 (2019.01); G06F 16/81 (2019.01); G06F 40/221 (2020.01)] 20 Claims
OG exemplary drawing
 
1. A computer implemented method of generating a structural index identifying locations of fields in a dataset having a schema-free data exchange format having multiple fields of data at different physical positions in the dataset, the method comprising:
detecting key/value structural character separators of data fields in the dataset;
detecting object begin and object end characters in the dataset;
detecting when the structural character separators occur within dataset strings and do not define a structure of the dataset;
creating a bitmap corresponding to structural character separators of the data fields that define the structure of the dataset, wherein the bitmap includes a bit for each character in the dataset and identifies physical locations of such data fields from logical locations of such data fields;
executing a query on the dataset using the bitmap to identify data from data fields responsive to the query.