US 11,954,567 B2
Probability distribution assessment for classifying subterranean formations using machine learning
Jiazuo Zhang, Reading (GB); and Graham Baines, Abingdon (GB)
Assigned to Landmark Graphics Corporation, Houston, TX (US)
Appl. No. 16/963,313
Filed by Landmark Graphics Corporation, Houston, TX (US)
PCT Filed Feb. 20, 2020, PCT No. PCT/US2020/019062
§ 371(c)(1), (2) Date Jul. 20, 2020,
PCT Pub. No. WO2021/040791, PCT Pub. Date Mar. 4, 2021.
Claims priority of provisional application 62/891,023, filed on Aug. 23, 2019.
Prior Publication US 2022/0004919 A1, Jan. 6, 2022
Int. Cl. G06F 18/22 (2023.01); G01V 20/00 (2024.01); G06F 18/214 (2023.01); G06F 18/2431 (2023.01); G06N 20/00 (2019.01)
CPC G06N 20/00 (2019.01) [G01V 20/00 (2024.01); G06F 18/214 (2023.01); G06F 18/22 (2023.01); G06F 18/2431 (2023.01); G01V 2210/66 (2013.01)] 20 Claims
OG exemplary drawing
 
1. A system comprising:
a processor; and
a non-transitory memory device comprising instructions that are executable by the processor to cause the processor to perform operations comprising:
receiving geological or geophysical data collected for hydrocarbon exploration from a plurality of subterranean formations;
generating, using the geological or geophysical data, a plurality of training data sets used to train machine-learning models, each training data set of the plurality of training data sets including a portion of the geological or geophysical data and one or more training classes, each training class being defined by a probability distribution and representing a classification of the portion of the geological or geophysical data;
receiving a test data set including geological or geophysical data of a subterranean formation;
selecting a training data set from the plurality of training data sets by comparing the test data set with the probability distribution of each training class of the one or more training classes of the training data set;
determining a trained machine-learning model trained using the selected training data set;
generating an output representing a recommendation to execute the trained machine-learning model to classify the test data set; and
displaying, based on a result of executing the trained machine-learning model on the test data set, an interface presenting a classification of the test data set that is usable for hydrocarbon exploration.