US 12,130,832 B2
Extensible data platform with database domain extensions
Mehmet Kadri Umay, Redmond, WA (US); Imran Siddique, Bellevue, WA (US); Nayana Singh Patel, Mercer Island, WA (US); and Jyothsna Devi Bijjam, Redmond, WA (US)
Assigned to Microsoft Technology Licensing, LLC, Redmond, WA (US)
Filed by Microsoft Technology Licensing, LLC, Redmond, WA (US)
Filed on Sep. 18, 2023, as Appl. No. 18/468,857.
Application 18/468,857 is a continuation of application No. 17/351,969, filed on Jun. 18, 2021, granted, now 11,768,849.
Claims priority of provisional application 63/161,289, filed on Mar. 15, 2021.
Prior Publication US 2024/0004896 A1, Jan. 4, 2024
Int. Cl. G06F 16/25 (2019.01); G06F 16/16 (2019.01); G06F 16/188 (2019.01); G06F 16/21 (2019.01); G06F 16/23 (2019.01); G06F 16/18 (2019.01)
CPC G06F 16/258 (2019.01) [G06F 16/164 (2019.01); G06F 16/196 (2019.01); G06F 16/211 (2019.01); G06F 16/2365 (2019.01); G06F 16/2379 (2019.01)] 20 Claims
OG exemplary drawing
 
1. A computing system comprising:
one or more server computing devices including one or more processors configured to execute instructions to implement:
a domain extensibility module that provides software development tools for building domain extensions for a database platform of the computing system; and
a data ingestion module that provides software development tools for defining a metadata schema for extracting metadata from data files stored on the database platform, and generating a metadata extraction pipeline to extract metadata based on the defined metadata schema, wherein
the one or more processors are configured to:
receive a set of data having a legacy file format from a domain-specific data platform, the domain-specific data platform being configured to aggregate data detected by one or more sensors operating in a domain associated with the domain-specific data platform;
define a target metadata schema that includes one or more metadata fields that will be populated during a data ingestion process;
define a target domain extension that defines one or more new file formats different from the legacy file format for storing the received set of data after performing the data ingestion process;
ingest the received set of data using a metadata extraction pipeline to generate metadata files based on the target metadata schema;
store the ingested set of data and the generated metadata files in the defined one or more new file formats based on the target domain extension; and
provide a network accessible endpoint for accessing the ingested set of data and the metadata files,
the domain extensions define a data type for data to be stored on the database platform, and storage and infrastructure components for the database platform for storing that defined data type, and
the database platform provides functionality for enabling connectivity between the database platform and legacy applications via file system mounting.