US 12,450,252 B2
Extensible data platform with database domain extensions
Mehmet Kadri Umay, Redmond, WA (US); Imran Siddique, Bellevue, WA (US); Nayana Singh Patel, Mercer Island, WA (US); and Jyothsna Devi Bijjam, Redmond, WA (US)
Assigned to Microsoft Technology Licensing, LLC, Redmond, WA (US)
Filed by Microsoft Technology Licensing, LLC, Redmond, WA (US)
Filed on Oct. 17, 2024, as Appl. No. 18/919,163.
Application 18/919,163 is a continuation of application No. 18/468,857, filed on Sep. 18, 2023, granted, now 12,130,832.
Application 18/468,857 is a continuation of application No. 17/351,969, filed on Jun. 18, 2021, granted, now 11,768,849, issued on Sep. 26, 2023.
Claims priority of provisional application 63/161,289, filed on Mar. 15, 2021.
Prior Publication US 2025/0045294 A1, Feb. 6, 2025
Int. Cl. G06F 16/25 (2019.01); G06F 16/16 (2019.01); G06F 16/188 (2019.01); G06F 16/21 (2019.01); G06F 16/23 (2019.01)
CPC G06F 16/258 (2019.01) [G06F 16/164 (2019.01); G06F 16/196 (2019.01); G06F 16/211 (2019.01); G06F 16/2365 (2019.01); G06F 16/2379 (2019.01)] 20 Claims
OG exemplary drawing
 
1. A computing system comprising:
one or more server computing devices including one or more processors configured to execute instructions to implement:
a domain extensibility module that provides software development tools for building domain extensions for a database platform of the computing system;
a data ingestion module that provides software development tools for defining a metadata schema for extracting metadata from data files stored on the database platform, and generating a metadata extraction pipeline to extract metadata based on the defined metadata schema; and
a machine learning model module that provides software development tools for integrating one or more machine learning models with the computing system, wherein
the one or more processors are configured to:
receive a set of data from a domain-specific data platform, the domain-specific data platform being configured to aggregate data detected by one or more sensors operating in a domain associated with the domain-specific data platform;
define a target metadata schema that includes one or more metadata fields that will be populated during a data ingestion process;
define a target domain extension that defines one or more data types for storing the received set of data after performing the data ingestion process;
ingest the received set of data using a metadata extraction pipeline to generate metadata files based on the target metadata schema;
store the ingested set of data and the generated metadata files in the defined one or more new file formats based on the target domain extension; and
provide a network accessible endpoint for accessing the ingested set of data and the metadata files,
the domain extensions define a data type for data to be stored on the database platform, and storage and infrastructure components for the database platform for storing that defined data type,
the database platform provides functionality for enabling connectivity between the database platform and legacy applications via file system mounting, and
the machine learning model module enables the extraction of metadata for the ingested set of data.