US 12,380,090 B2
Managing data risk using automated dependency discovery
Gavin Manning, Mountain View, CA (US); and Paul Vanlint, Mountain View, CA (US)
Assigned to GOOGLE LLC, Mountain View, CA (US)
Filed by GOOGLE LLC, Mountain View, CA (US)
Filed on Mar. 14, 2024, as Appl. No. 18/605,313.
Claims priority of provisional application 63/452,161, filed on Mar. 14, 2023.
Prior Publication US 2024/0311366 A1, Sep. 19, 2024
Int. Cl. G06F 7/00 (2006.01); G06F 16/23 (2019.01)
CPC G06F 16/2365 (2019.01) 20 Claims
OG exemplary drawing
 
1. A method of risk assessment for a target dataset including a plurality of records, the method implemented in a computing system and comprising:
receiving, by one or more processors and from a data source, information related to a plurality of datasets including the target dataset;
automatically determining, by the one or more processors, dependencies on the target dataset using logs indicative of read and write operations related to the target dataset;
generating, by the one or more processors and using the determined dependencies, a dependency graph indicative of the read and write operations, within the plurality of datasets, on the target dataset, wherein each operation of the read and write operations represents an edge within the dependency graph;
determining, by the one or more processors and using the generated dependency graph, a level of risk associated with the target dataset, wherein the level of risk is based on a risk of corruption (ROC) and an impact of corruption (IOC) for the target dataset; and
providing, by the one or more processors, an indication of the determined level of risk.