| CPC G06F 8/70 (2013.01) [G06F 16/367 (2019.01); G06N 5/022 (2013.01)] | 14 Claims |

|
1. A computer-implemented method, comprising:
constructing, by one or more processors, an NLP-based search engine on a transformers-based architecture to handle sequential data, wherein
the transformers-based architecture comprises a multi-head self-attention mechanism combined with an encoder-decoder structure that considers longer-distanced context around a word and enables parallelization techniques for Graphics Processing Unit (GPU) computing, and
the sequential data comprises at least one of natural language or tasks including translation or text summarization;
receiving, by the one or more processors, a plurality of Software Development Life Cycle (SDLC) artifacts of an SDLC from a plurality of heterogeneous data sources, wherein
the plurality of heterogeneous data sources comprises unstructured documents including word and pdf files, semi-structured documents including JSON and XML files, and structured databases including relational databases and
the plurality of SDLC artifacts comprises at least requirements, test cases, and defect logs;
correlating and clustering, by the one or more processors, the plurality of SDLC artifacts to generate a knowledge fabric, wherein the correlating and clustering comprises:
extracting, by the one or more processors, semantic and contextual data from the plurality of SDLC artifacts based on Natural Language Processing (NLP) and deep text analytics, wherein
the extracting comprises named entity extraction and information extraction for use in building knowledge organization systems or ontologies, and
the NLP includes using the NLP-based search engine on the transformers-based architecture; and
transforming, by the one or more processors, the extracted semantic and contextual data to one or more knowledge graphs to build the knowledge organization systems or the ontologies, wherein the one or more knowledge graphs comprise Resource Description Framework (RDF) triples, semantic web graphs, or labeled property graphs that are queryable by graph query languages;
deriving, by the one or more processors, one or more actionable items based on the knowledge fabric, wherein the one or more actionable items include change impact analysis for new change requests to the requirements, defect hotspot identification, root cause analysis, and contextual search results utilized to improve overall process efficiency of the SDLC;
automatically generating a defect prediction based on the knowledge fabric;
processing data that identifies a file impacted by a new requirement or a defect;
identifying a correction to the defect;
identifying, based on the knowledge fabric, time-series data from one or more application performance management tools;
combining the time-series data with Chaos engineering; and
highlighting performance issues and bottlenecks in one or more infrastructure resources during the SDLC, wherein the one or more infrastructure resources is at least one of CPU utilization, memory usage, network latency, or input/output (I/O) constraints.
|