Fault Detection
   HOME

TheInfoList



OR:

Fault detection, isolation, and recovery (FDIR) is a subfield of
control engineering Control engineering, also known as control systems engineering and, in some European countries, automation engineering, is an engineering discipline that deals with control systems, applying control theory to design equipment and systems with d ...
which concerns itself with monitoring a system, identifying when a fault has occurred, and pinpointing the type of fault and its location. Two approaches can be distinguished: A direct pattern recognition of sensor readings that indicate a fault and an analysis of the discrepancy between the sensor readings and expected values, derived from some model. In the latter case, it is typical that a fault is said to be detected if the discrepancy or ''residual'' goes above a certain threshold. It is then the task of fault isolation to categorize the type of fault and its location in the machinery. Fault detection and isolation (FDI) techniques can be broadly classified into two categories. These include model-based FDI and signal processing based FDI.


Model-based FDI

In model-based FDI techniques some model of the system is used to decide about the occurrence of fault. The system model may be
mathematical Mathematics is a field of study that discovers and organizes methods, Mathematical theory, theories and theorems that are developed and Mathematical proof, proved for the needs of empirical sciences and mathematics itself. There are many ar ...
or knowledge based. Some of the model-based FDI techniques include observer-based approach, parity-space approach, and parameter identification based methods. There is another trend of model-based FDI schemes, which is called set-membership methods. These methods guarantee the detection of fault under certain conditions. The main difference is that instead of finding the most likely model, these techniques omit the models, which are not compatible with data. The example shown in the figure on the right illustrates a model-based FDI technique for an aircraft elevator reactive controller through the use of a truth table and a state chart. The truth table defines how the controller reacts to detected faults, and the state chart defines how the controller switches between the different modes of operation (passive, active, standby, off, and isolated) of each actuator. For example, if a fault is detected in hydraulic system 1, then the truth table sends an event to the state chart that the left inner actuator should be turned off. One of the benefits of this model-based FDI technique is that this reactive controller can also be connected to a continuous-time model of the actuator hydraulics, allowing the study of switching transients.


Signal processing based FDI

In signal processing based FDI, some mathematical or statistical operations are performed on the measurements, or some neural network is trained using measurements to extract the information about the fault. A good example of signal processing based FDI is time domain reflectometry where a signal is sent down a cable or electrical line and the reflected signal is compared mathematically to original signal to identify faults. Spread Spectrum Time Domain Reflectometry, for instance, involves sending down a spread spectrum signal down a wire line to detect wire faults. Several clustering methods have also been proposed to identify the novel fault and segment a given signal into normal and faulty segments.


Machine fault diagnosis

Machine fault diagnosis is a field of
mechanical engineering Mechanical engineering is the study of physical machines and mechanism (engineering), mechanisms that may involve force and movement. It is an engineering branch that combines engineering physics and engineering mathematics, mathematics principl ...
concerned with finding faults arising in machines. A particularly well developed part of it applies specifically to rotating machinery, one of the most common types encountered. To identify the most probable faults leading to failure, many methods are used for data collection, including
vibration Vibration () is a mechanical phenomenon whereby oscillations occur about an equilibrium point. Vibration may be deterministic if the oscillations can be characterised precisely (e.g. the periodic motion of a pendulum), or random if the os ...
monitoring,
thermal imaging Infrared thermography (IRT), thermal video or thermal imaging, is a process where a thermal camera captures and creates an image of an object by using infrared radiation emitted from the object in a process, which are examples of infrared im ...
, oil particle analysis, etc. Then these data are processed utilizing methods like spectral analysis,
wavelet analysis A wavelet is a wave-like oscillation with an amplitude that begins at zero, increases or decreases, and then returns to zero one or more times. Wavelets are termed a "brief oscillation". A taxonomy of wavelets has been established, based on the n ...
, wavelet transform, short term Fourier transform, Gabor Expansion, Wigner-Ville distribution (WVD), cepstrum, bispectrum, correlation method, high resolution spectral analysis, waveform analysis (in the time domain, because spectral analysis usually concerns only frequency distribution and not phase information) and others. The results of this analysis are used in a root cause failure analysis in order to determine the original cause of the fault. For example, if a bearing fault is diagnosed, then it is likely that the bearing was not itself damaged at installation, but rather as the consequence of another installation error (e.g., misalignment) which then led to bearing damage. Diagnosing the bearing's damaged state is not enough for precision maintenance purposes. The root cause needs to be identified and remedied. If this is not done, the replacement bearing will soon wear out for the same reason and the machine will suffer more damage, remaining dangerous. Of course, the cause may also be visible as a result of the spectral analysis undertaken at the data-collection stage, but this may not always be the case. The most common technique for detecting faults is the time-frequency analysis technique. For a rotating machine, the rotational speed of the machine (often known as the
RPM Revolutions per minute (abbreviated rpm, RPM, rev/min, r/min, or r⋅min−1) is a unit of rotational speed (or rotational frequency) for rotating machines. One revolution per minute is equivalent to hertz. Standards ISO 80000-3:2019 def ...
), is not a constant, especially not during the start-up and shutdown stages of the machine. Even if the machine is running in the steady state, the rotational speed will vary around a steady-state mean value, and this variation depends on load and other factors. Since sound and vibration signals obtained from a rotating machine are strongly related to its rotational speed, it can be said that they are time-variant signals in nature. These time-variant features carry the machine fault signatures. Consequently, how these features are extracted and interpreted is important to research and industrial applications. The most common method used in signal analysis is the FFT, or Fourier transform. The Fourier transform and its inverse counterpart offer two perspectives to study a signal: via the time domain or via the frequency domain. The FFT-based spectrum of a time signal shows us the existence of its frequency contents. By studying these and their magnitude or phase relations, we can obtain various types of information, such as
harmonics In physics, acoustics, and telecommunications, a harmonic is a sinusoidal wave with a frequency that is a positive integer multiple of the ''fundamental frequency'' of a periodic signal. The fundamental frequency is also called the ''1st harm ...
, sidebands, beat frequency, bearing fault frequency and so on. However, the FFT is only suitable for signals whose frequency contents do not change over time; however, as mentioned above, the frequency contents of the sound and vibration signals obtained from a rotating machine are very much time-dependent. For this reason, FFT-based spectra are unable to detect how the frequency contents develop over time. To be more specific, if the
RPM Revolutions per minute (abbreviated rpm, RPM, rev/min, r/min, or r⋅min−1) is a unit of rotational speed (or rotational frequency) for rotating machines. One revolution per minute is equivalent to hertz. Standards ISO 80000-3:2019 def ...
of a machine is increasing or decreasing during its startup or shutdown period, its bandwidth in the FFT spectrum will become much wider than it would be simply for the steady state. Hence, in such a case, the harmonics are not so distinguishable in the spectrum. The time frequency approach for machine fault diagnosis can be divided into two broad categories: linear methods and the quadratic methods. The difference is that linear transforms can be inverted to construct the time signal, thus, they are more suitable for signal processing, such as noise reduction and time-varying filtering. Although the quadratic method describes the energy distribution of a signal in the joint time frequency domain, which is useful for analysis, classification, and detection of signal features, phase information is lost in the quadratic time-frequency representation; also, the time histories cannot be reconstructed with this method. The short-term Fourier transform ( STFT) and the
Gabor transform The Gabor transform, named after Dennis Gabor, is a special case of the short-time Fourier transform. It is used to determine the Sine wave, sinusoidal frequency and phase (waves), phase content of local sections of a signal as it changes over time ...
are two algorithms commonly used as linear time-frequency methods. If we consider linear time-frequency analysis to be the evolution of the conventional FFT, then quadratic time frequency analysis would be the power spectrum counterpart. Quadratic algorithms include the Gabor spectrogram, Cohen's class and the adaptive spectrogram. The main advantage of time frequency analysis is discovering the patterns of frequency changes, which usually represent the nature of the signal. As long as this pattern is identified the machine fault associated with this pattern can be identified. Another important use of time frequency analysis is the ability to filter out a particular frequency component using a time-varying filter.


Robust fault diagnosis

In practice, model uncertainties and measurement noise can complicate fault detection and isolation. As a result, using fault diagnostics to meet industrial needs in a cost-effective way, and to reduce maintenance costs without requiring more investments than the cost of what is to be avoided in the first place, requires an effective scheme of applying them. This is the subject of
maintenance, repair and operations The technical meaning of maintenance involves functional checks, servicing, repairing or replacing of necessary devices, equipment, machinery, building infrastructure and supporting utilities in industrial, business, and residential installa ...
; the different strategies include: *
Condition-based maintenance Predictive maintenance techniques are designed to help determine the condition of in-service equipment in order to estimate when maintenance should be performed. This approach claims more cost savings over routine or time-based preventive maint ...
* Planned preventive maintenance *
Preventive maintenance The technical meaning of maintenance involves functional checks, servicing, repairing or replacing of necessary devices, equipment, machinery, building infrastructure and supporting utilities in industrial, business, and residential installa ...
*
Corrective maintenance Corrective maintenance is a maintenance task performed to identify, isolate, and rectify a fault so that the failed equipment, machine, or system can be restored to an operational condition within the tolerances or limits established for in-serv ...
(does not use diagnostics) * Integrated vehicle health management


Fault detection and diagnosis using artificial intelligence


Machine learning techniques for fault detection and diagnosis

In fault detection and diagnosis, mathematical classification models which in fact belong to
supervised learning In machine learning, supervised learning (SL) is a paradigm where a Statistical model, model is trained using input objects (e.g. a vector of predictor variables) and desired output values (also known as a ''supervisory signal''), which are often ...
methods, are trained on the
training set In machine learning, a common task is the study and construction of algorithms that can learn from and make predictions on data. Such algorithms function by making data-driven predictions or decisions, through building a mathematical model from ...
of a labeled
dataset A data set (or dataset) is a collection of data. In the case of tabular data, a data set corresponds to one or more database tables, where every column of a table represents a particular variable, and each row corresponds to a given record o ...
to accurately identify the redundancies, faults and anomalous samples. During the past decades, there are different
classification Classification is the activity of assigning objects to some pre-existing classes or categories. This is distinct from the task of establishing the classes themselves (for example through cluster analysis). Examples include diagnostic tests, identif ...
and preprocessing models that have been developed and proposed in this research area. ''K''-nearest-neighbors algorithm (''k''NN) is one of the oldest techniques which has been used to solve fault detection and diagnosis problems. Despite the simple logic that this instance-based algorithm has, there are some problems with large dimensionality and processing time when it is used on large
dataset A data set (or dataset) is a collection of data. In the case of tabular data, a data set corresponds to one or more database tables, where every column of a table represents a particular variable, and each row corresponds to a given record o ...
s. Since ''k''NN is not able to automatically extract the features to overcome the
curse of dimensionality The curse of dimensionality refers to various phenomena that arise when analyzing and organizing data in high-dimensional spaces that do not occur in low-dimensional settings such as the three-dimensional physical space of everyday experience. T ...
, so often some data preprocessing techniques like
Principal component analysis Principal component analysis (PCA) is a linear dimensionality reduction technique with applications in exploratory data analysis, visualization and data preprocessing. The data is linearly transformed onto a new coordinate system such that th ...
(PCA),
Linear discriminant analysis Linear discriminant analysis (LDA), normal discriminant analysis (NDA), canonical variates analysis (CVA), or discriminant function analysis is a generalization of Fisher's linear discriminant, a method used in statistics and other fields, to fi ...
(LDA) or
Canonical correlation analysis In statistics, canonical-correlation analysis (CCA), also called canonical variates analysis, is a way of inferring information from cross-covariance matrices. If we have two vectors ''X'' = (''X''1, ..., ''X'n'') and ''Y'' ...
(CCA) accompany it to reach a better performance. In many industrial cases, the effectiveness of ''k''NN has been compared with other methods, specially with more complex classification models such as
Support Vector Machine In machine learning, support vector machines (SVMs, also support vector networks) are supervised max-margin models with associated learning algorithms that analyze data for classification and regression analysis. Developed at AT&T Bell Laborato ...
s (SVMs), which is widely used in this field. Thanks to their appropriate nonlinear mapping using
kernel methods In machine learning, kernel machines are a class of algorithms for pattern analysis, whose best known member is the support-vector machine (SVM). These methods involve using linear classifiers to solve nonlinear problems. The general task of pa ...
, SVMs have an impressive performance in generalization, even with small training data. However, general SVMs do not have automatic feature extraction themselves and just like ''k''NN, are often coupled with a data pre-processing technique. Another drawback of SVMs is that their performance is highly sensitive to the initial parameters, particularly to the
kernel method In machine learning, kernel machines are a class of algorithms for pattern analysis, whose best known member is the support-vector machine (SVM). These methods involve using linear classifiers to solve nonlinear problems. The general task of patt ...
s, so in each signal
dataset A data set (or dataset) is a collection of data. In the case of tabular data, a data set corresponds to one or more database tables, where every column of a table represents a particular variable, and each row corresponds to a given record o ...
, a parameter tuning process is required to be conducted first. Therefore, the low speed of the training phase is a limitation of SVMs when it comes to its usage in fault detection and diagnosis cases.
Artificial Neural Networks In machine learning, a neural network (also artificial neural network or neural net, abbreviated ANN or NN) is a computational model inspired by the structure and functions of biological neural networks. A neural network consists of connected ...
(ANNs) are among the most mature and widely used mathematical classification algorithms in fault detection and diagnosis. ANNs are well-known for their efficient self-learning capabilities of the complex relations (which generally exist inherently in fault detection and diagnosis problems) and are easy to operate. Another advantage of ANNs is that they perform automatic feature extraction by allocating negligible weights to the irrelevant features, helping the system to avoid dealing with another feature extractor. However, ANNs tend to over-fit the training set, which will have consequences of having poor validation accuracy on the validation set. Hence, often, some regularization terms and prior knowledge are added to the ANN model to avoid over-fitting and achieve higher performance. Moreover, properly determining the size of the hidden layer needs an exhaustive parameter tuning, to avoid poor approximation and generalization capabilities. In general, different SVMs and ANNs models (i.e. Back-Propagation Neural Networks and Multi-Layer Perceptron) have shown successful performances in the fault detection and diagnosis in industries such as
gearbox A transmission (also called a gearbox) is a mechanical device invented by Louis Renault (who founded Renault) which uses a gear set—two or more gears working together—to change the speed, direction of rotation, or torque multiplication/r ...
,
machinery A machine is a physical system that uses power to apply forces and control movement to perform an action. The term is commonly applied to artificial devices, such as those employing engines or motors, but also to natural biological macromolec ...
parts (i.e.
mechanical bearing A ball bearing A bearing is a machine element that constrains relative motion to only the desired motion and reduces friction between moving parts. The design of the bearing may, for example, provide for free linear movement of the moving pa ...
s),
compressor A compressor is a mechanical device that increases the pressure of a gas by reducing its volume. An air compressor is a specific type of gas compressor. Many compressors can be staged, that is, the gas is compressed several times in steps o ...
s,
wind Wind is the natural movement of atmosphere of Earth, air or other gases relative to a planetary surface, planet's surface. Winds occur on a range of scales, from thunderstorm flows lasting tens of minutes, to local breezes generated by heatin ...
and
gas turbine A gas turbine or gas turbine engine is a type of Internal combustion engine#Continuous combustion, continuous flow internal combustion engine. The main parts common to all gas turbine engines form the power-producing part (known as the gas gene ...
s and steel plates.


Deep learning techniques for fault detection and diagnosis

With the research advances in ANNs and the advent of
deep learning Deep learning is a subset of machine learning that focuses on utilizing multilayered neural networks to perform tasks such as classification, regression, and representation learning. The field takes inspiration from biological neuroscience a ...
algorithms using deep and complex layers, novel classification models have been developed to cope with fault detection and diagnosis. Most of the shallow learning models extract a few feature values from signals, causing a dimensionality reduction from the original
signal A signal is both the process and the result of transmission of data over some media accomplished by embedding some variation. Signals are important in multiple subject fields including signal processing, information theory and biology. In ...
. By using
Convolutional neural network A convolutional neural network (CNN) is a type of feedforward neural network that learns features via filter (or kernel) optimization. This type of deep learning network has been applied to process and make predictions from many different ty ...
s, the
continuous wavelet transform In mathematics, the continuous wavelet transform (CWT) is a formal (i.e., non-numerical) tool that provides an overcomplete representation of a signal by letting the translation and scale parameter of the wavelets vary continuously. Definition ...
scalogram can be directly classified to normal and faulty classes. Such a technique avoids omitting any important fault message and results in a better performance of fault detection and diagnosis. In addition, by transforming signals to image constructions, 2D
Convolutional neural network A convolutional neural network (CNN) is a type of feedforward neural network that learns features via filter (or kernel) optimization. This type of deep learning network has been applied to process and make predictions from many different ty ...
s can be implemented to identify faulty signals from vibration image features. Deep belief networks,
Restricted Boltzmann machine A restricted Boltzmann machine (RBM) (also called a restricted Sherrington–Kirkpatrick model with external field or restricted stochastic Ising–Lenz–Little model) is a generative stochastic artificial neural network that can learn a prob ...
s and
Autoencoder An autoencoder is a type of artificial neural network used to learn efficient codings of unlabeled data (unsupervised learning). An autoencoder learns two functions: an encoding function that transforms the input data, and a decoding function ...
s are other
deep neural networks Deep learning is a subset of machine learning that focuses on utilizing multilayered neural networks to perform tasks such as classification, regression, and representation learning. The field takes inspiration from biological neuroscience a ...
architectures which have been successfully used in this field of research. In comparison to traditional machine learning, due to their deep architecture,
deep learning Deep learning is a subset of machine learning that focuses on utilizing multilayered neural networks to perform tasks such as classification, regression, and representation learning. The field takes inspiration from biological neuroscience a ...
models are able to learn more complex structures from
dataset A data set (or dataset) is a collection of data. In the case of tabular data, a data set corresponds to one or more database tables, where every column of a table represents a particular variable, and each row corresponds to a given record o ...
s, however, they need larger samples and longer processing time to achieve higher accuracy.


Fault recovery

Fault Recovery in FDIR is the action taken after a failure has been detected and isolated to return the system to a stable state. Some examples of fault recoveries are: * Switch-off of a faulty equipment * Switch-over from a faulty equipment to a redundant equipment * Change of state of the complete system into a Safe Mode with limited functionalities


See also

*
Control reconfiguration Control reconfiguration is an active approach in control theory to achieve Fault-Tolerant Control, fault-tolerant control for dynamic systems. It is used when severe Fault (technology), faults, such as actuator or sensor outages, cause a break-up o ...
*
Control theory Control theory is a field of control engineering and applied mathematics that deals with the control system, control of dynamical systems in engineered processes and machines. The objective is to develop a model or algorithm governing the applic ...
*
Failure mode and effects analysis Failure is the social concept of not meeting a desirable or intended Goal, objective, and is usually viewed as the opposite of success. The criteria for failure depends on context, and may be relative to a particular observer or belief system ...
*
Fault-tolerant system Fault tolerance is the ability of a system to maintain proper operation despite failures or faults in one or more of its components. This capability is essential for high-availability, mission-critical, or even life-critical systems. Fault to ...
*
Predictive maintenance Predictive maintenance techniques are designed to help determine the condition of in-service equipment in order to estimate when maintenance should be performed. This approach claims more cost savings over routine or time-based preventive maint ...
* Spread-spectrum time-domain reflectometry *
System identification The field of system identification uses statistical methods to build mathematical models of dynamical systems from measured data. System identification also includes the optimal design#System identification and stochastic approximation, optimal de ...


References

{{DEFAULTSORT:Fault Detection And Isolation Control theory Systems engineering Reliability engineering