US 12,072,897 B2
Similarity searching across digital standards
Divyesh Gaur, Pittsburgh, PA (US); Suman Ghosh, Monroeville, PA (US); Jean De Dieu Iradukunda, Pittsburgh, PA (US); Audra Ziegenfuss, Baden, PA (US); and Uxue Zurutuza Dorronsoro, Pittsburgh, PA (US)
Assigned to SAE INTERNATIONAL, Warrendale, PA (US)
Filed by SAE International, Warrendale, PA (US); COGNISTIC LLC, Pittsburgh, PA (US); and Sigma Resources LLC, Pittsburgh, PA (US)
Filed on Feb. 23, 2021, as Appl. No. 17/183,048.
Prior Publication US 2022/0269685 A1, Aug. 25, 2022
Int. Cl. G06F 16/2457 (2019.01)
CPC G06F 16/24578 (2019.01) 20 Claims
OG exemplary drawing
 
1. A method for identifying similar objects by performing document attribute comparisons, the method comprising:
providing a digital standard system comprising a user interface, at least one machine-learning model, and at least one data store comprising a plurality of digital standards, wherein each of the plurality of digital standards comprises a digital format of an underlying standard formatted, by the digital standard system using the at least one machine-learning model to classify conceptual units extracted from the underlying standard format and store the classified conceptual units into the at least one data store, into a data structure and data model describing interactions and relationships of aspects within the underlying standard as identified from the classified conceptual units, wherein the digital standard system provides digital functionality allowing a user to interact with and view the plurality of digital standards, the digital functionality being facilitated via the digital format of the underlying standard, wherein the data structure and the data model provide a consistent format for the plurality of digital standards and identifies a data type for attributes of objects represented by the plurality of digital standards, wherein the data type corresponds to a format of an attribute within a digital standard, wherein the plurality of digital standards have disparate data types across similar attributes;
displaying, on a display device, the user interface of the digital standard system and allowing user interaction with the digital standard system via the user interface;
receiving, from an input device and within an input field of the user interface of the digital standard system, a reference document, wherein the reference document corresponds to an object and comprises a plurality of attributes of the object, each of the attributes having a data type;
receiving, from a user providing input to at least one input area within the user interface, a request for a similarity comparison based upon the reference document, wherein the request provides an indication of similarity attributes corresponding to attributes of the object to be compared with attributes of other objects during the similarity comparison;
comparing, using the digital standard system, the reference document to each digital standard of a group of digital standards of the plurality of digital standards that correspond to other objects having a same type as the reference document and located within the at least one data store, wherein the comparing comprises making the data types for the similarity attributes the same across the reference document and the group of digital standards by substituting a data type of each of the similarity attributes received from the user with a defined data type and performing a comparison of the similarity attributes having the substitute data types, wherein the comparison performed is based upon the substitute data type of a given similarity attribute and is facilitated via the digital format of the underlying standard;
generating, using the digital standard system, a document similarity score for each of the digital standards within the group based upon the comparing, wherein the document similarity score is generated based upon an aggregation of similarity attribute scores calculated for each of the similarity attributes and being determined from the comparison; and
displaying, to the user within the user interface, at least one of the digital standards within the group with an indication of a similarity of the at least one of the digital standards within the group to the reference document, the indication of similarity being based upon a document similarity score corresponding to the at least one of the digital standards within the group, wherein the at least one of the digital standards displayed is different than the reference document and is a replacement for the object corresponding to the reference document.