Fault diagnosis

TOPIC AREA

What Is Fault Diagnosis?

Fault diagnosis is the process of detecting the presence of an abnormal condition in a system, isolating its location, and identifying its nature and severity. In engineered systems ranging from power transformers to aircraft engines, faults that go undetected can escalate into failures with serious safety and economic consequences. Effective fault diagnosis combines sensing, signal processing, and reasoning to enable operators and automated controllers to intervene before minor degradation becomes catastrophic failure.

Condition Monitoring and Signal Analysis

Condition monitoring is the continuous or periodic acquisition of measurements from a system to track its health over time. Relevant signals include vibration, temperature, pressure, current, voltage, acoustic emission, and chemical composition, depending on the system. Deviations from baseline behavior indicate the possible onset of a fault.

Vibration analysis is among the most widely used monitoring approaches for rotating machinery. Characteristic fault frequencies associated with bearing defects, gear tooth damage, and shaft imbalance appear as spectral peaks that can be tracked with accelerometers and fast Fourier transform analysis. Envelope analysis further isolates impulsive fault signatures from background noise.

Dissolved gas analysis (DGA) is the standard method for diagnosing internal faults in oil-filled power transformers. When insulation degrades or arcing occurs, gases such as hydrogen, methane, acetylene, and ethylene dissolve in the transformer oil in ratios characteristic of specific fault types. IEC and IEEE standards for DGA interpretation define diagnostic ratios and gas concentration thresholds used by utilities worldwide to schedule maintenance and avoid unplanned outages.

Model-Based Diagnosis

Model-based fault diagnosis compares the actual behavior of a system with the predicted behavior of a mathematical model. The difference, called a residual, is nominally zero under healthy operation and becomes nonzero when a fault occurs. Observers, parity equations, and parameter estimation techniques generate residuals from available measurements.

A threshold on the residual triggers a fault alarm, while the pattern of which residuals are excited identifies the fault location through a diagnostic logic layer called a decision tree or fault signature matrix. The challenge is designing residuals that are sensitive to faults but insensitive to disturbances and model uncertainty, a property called fault detectability with disturbance decoupling.

Analytical redundancy methods documented in the IEEE Control Systems literature provide a rigorous framework for constructing such residuals for linear and nonlinear dynamical systems. Sliding mode observers and unknown input observers are particularly useful when disturbances cannot be fully modeled.

Data-Driven Fault Detection

When accurate physical models are unavailable or too complex to derive, data-driven methods learn fault signatures directly from historical operational data. Machine learning classifiers, including support vector machines, random forests, and deep convolutional neural networks, are trained on labeled examples of healthy and faulty operation to build diagnostic models.

Principal component analysis and autoencoders provide unsupervised alternatives that detect anomalies as deviations from the learned manifold of normal operation without requiring labeled fault examples. Recurrent neural networks and transformer architectures capture temporal dependencies in time-series sensor data, improving detection of faults that evolve gradually over time.

Fault location is a specialized task in power systems where the objective is to identify the segment of a transmission or distribution line on which a fault has occurred. Traveling wave methods and impedance-based algorithms analyze transient waveforms captured at substation relays to pinpoint fault distance within meters. NIST metrology resources for sensor calibration are relevant to ensuring the measurement accuracy that underpins reliable fault detection.

Research on deep learning for industrial fault diagnosis surveys neural architectures applied to bearing, gearbox, and motor faults, documenting benchmark performance across public datasets.

Applications

  • Power transformer maintenance programs use dissolved gas analysis to identify winding insulation breakdown and arcing before catastrophic failure.
  • Wind turbine operators monitor gearbox and bearing vibration spectra to schedule component replacement during planned maintenance windows.
  • Aviation maintenance uses onboard health monitoring systems to detect engine anomalies and trigger ground inspections before flight.
  • Railway infrastructure monitoring analyzes axle-box accelerations to identify track irregularities and wheel defects.
  • Chemical process plants apply model-based residual generation to detect sensor drift, valve sticking, and heat exchanger fouling.
  • Semiconductor manufacturing uses statistical process control and machine learning to detect deposition and etch process faults from in-situ sensor data.