Diego Mandelli

AI/ML for Equipment Reliability

The past two decades have seen the emergence of advanced prognostic and health management computational tools for anomaly detection, diagnostic, and prognostic purposes, which are helping system engineers and plant operators monitor the performance of several assets and optimize plant resources (personnel, time, and money). In the same direction, nuclear power plants (NPPs) are now in the process of digitizing operation and maintenance activities to track trends and events at the system or plant level (e.g., plant planned shutdown or system taken out of service) and, more importantly, observed abnormal conditions. In this respect, NPPs generate large amounts of equipment reliability (ER) data, which record the historic performance of a large number of components and assets. As a drawback, engineers and operators are now facing the challenge of processing the amount of ER data being continuously generated, which is not only extremely large but also appears in both textual and numeric forms. This project directly tackles this challenge by providing computational methods to assist system engineers and operators with the means to extract knowledge from ER data. Our approach deviates substantially from state-of-practice methods where the main focus is almost exclusively on numeric data analysis. Instead, we employ a model-based system engineering (MBSE) model approach to represent systems and assets to capture their architectural and functional (i.e., cause-effect) relations. Textual and numeric ER data elements are processed by identifying first which elements of the developed MBSE elements they are referring to. For numeric ER data, this task is fairly easy provided there are system design documents that indicate a precise association between asset and monitored time series. On the other hand, this task is more challenging when dealing with textual data; we employ technical language processing (TLP) methods to “extract knowledge” from textual elements. Filtering abnormal behaviors can then be performed on numeric data through anomaly detections methods and textual elements (by understanding their semantic nature). Such abnormal instances that are associated with a specific MBSE element are then stored in a relational database. This database takes the form of a graph where the main skeleton is the actual system MBSE model and abnormal instances are “linked” to the skeleton. At this point, both numeric and textual data elements are integrated and put into context. From here, graph-based analysis methods can be employed to perform “machine reasoning,” including identifying abnormal patterns and the root cause behind them. The presented methods have been developed in the digital analytics, causal knowledge acquisition and reasoning (DACKAR) repository.

DACKAR Official website

DACKAR GitHub repository

MBSE system and equipment representation

Matrix profile for anomaly detection

Knowledge extractioon from textual data

Event causality

System architectura and data knowledge graph