当前位置:首页 期刊杂志

Application of deep autoencoder model for structural condition monitoring

时间:2024-07-28

PATHIRAGE Chathurdara Sri Nadith,LI Jun,LI Ling,*,HAO Hong,and LIU Wanquan

1.Department of Computing,Curtin University,Perth WA 6102,Australia;

2.Centre for Infrastructural Monitoring and Protection,Curtin University,Perth WA 6102,Australia

Abstract:Damage detection in structures is performed via vibration based structural identification.Modal information,such as frequencies and mode shapes,are widely used for structural damage detection to indicate the health conditions of civil structures.The deep learning algorithm that works on a multiple layer neural network model termed as deep autoencoder is proposed to learn the relationship between the modal information and structural stiffness parameters.This is achieved via dimension reduction of the modal information feature and a non-linear regression against the structural stiffness parameters.Numerical tests on a symmetrical steel frame model are conducted to generate the data for the training and validation,and to demonstrate the efficiency of the proposed approach for vibration based structural damage detection.

Keywords:auto encoder,non-linear regression,deep auto encoder model,damage identification,vibration,structural health monitoring.

1.Introduction

Structural health monitoring(SHM)aims to assess the structural performance and evaluate the safety conditions of civil infrastructure under operational conditions.Civil structures continuously accumulate damage during their service life due to material deterioration,cyclic loading,and environmental conditions.By analyzing the measurements from various sensors installed on structures,SHM techniques detect and track the possible anomalies that could potentially produce more damage and finally lead to catastrophic structural failures with a huge loss.Measured data from the structures are widely used to detect not only the existence and location of possible damage,but also the severity of the damage.

Vibration measurements are usually used for structural condition monitoring.It is based on the fact that when structures have damages or condition changes,the structural physical properties(i.e.,stiffness,mass and damping matrices)will change and therefore vibration properties(i.e.,modal parameters)will also change.The changes in the structural vibration characteristics,such as natural frequencies,mode shapes and other modal information,can be used to identify the location and severity of damages.

Neural network techniques have been applied for vibration based damage identification in civil engineering.Yun et al.[1]presented using the neural networks technique to estimate structural joint damage from modal data.It was found that the noise injection learning with a realistic noise level for each input component is effective to better under stand the noise effect.Bakhary et al.[2]proposed a statistical approach to take into account the effect of uncertainties in developing an artificial neural network model for structural damage detection.Artificial neural networks are generally applicable to problems where a significant database of labelled information is available,but difficult to specify an explicit algorithm.

Back propagation(BP)neural network is one of the most popular shallow learning algorithms for training the neural network based on an optimization method such as the gradient descent method.BP based training has been foundto be effective,providing:i)initial weights are close enough to a good solution;ii)computers are fast enough;iii)data sets are big enough.However,it is difficult to optimize the weights in the networks with multiple hidden layers.Theoretical results such as those shown in[3-5],and reviewed and discussed by[6]demonstrate that deep learning(DL)architectures are favourable in learning this kind of complicated functions that can represent high level abstractions in computer vision[7-10],natural language processing[11],information retrieval[12]and robotics[13].DL based neural networks significantly outperform shallow learning based counterparts in many applications,and often match or overwhelm the state-of-the-art ap-proaches.

Autoencoders[14]are simple learning models which transform inputs into outputs with the least possible amount of distortion.They have been used in the“deep architecture”approaches[6,14,15],where autoencoders are stacked and trained from bottom up in an unsupervised manner,followed by a supervised learning phase to train the top layer and fine-tune the entire architecture.The unsupervised learning algorithm is used to pre-train each layer,to capture the main variations in its input.This unsupervised pre-training phase sets the stage for a final training phase where the deep architecture is fine-tuned with respect to a supervised training criterion with gradient-based optimization.These deep architectures have been shown to lead to state-of-the-art results in a number of challenging classification tasks[7,8,16]and regression problems[9,10].

Deep autoencoder(DAE)models[15,17]that fall in this category allow effective feature learning through hierarchical non-linear mappings via the multiple hidden layers of the model.Inspired by the work of[9,10]where the image processing has been performed with DAE,we propose a DAE model for vibration based structural condition monitoring and identification.

2.DAE model for structural health monitoring

This section discusses a DAE model that is proposed to perform the dimensionality reduction and the relationship learning.In the proposed approach,the fact that the natural frequencies and their corresponding mode shapes of a structure relate with the physical properties,such as stiffness,is explored.The modal information,such as frequencies and mode shapes,are used as the input to the proposed DAE model and the output will be the structural elemental stiffness parameters.A new feature is formed with all the frequencies along with their mode shapes to be fed into the DAE.Since a high dimensional feature may contain unnecessary information due to the redundant data and measurement noise effect,learning a relationship from such a high dimensional feature(the measured frequencies and their associated mode shapes)to elementary stiffness parameters will be likely to be less accurate than that of using reduced dimensional features.It is therefore understandable that the problem can be better addressed in two steps.The first attempts to reduce the dimensionality of the feature preserving the required information,while the second is to learn the relationship between the feature with the reduced dimension and the stiffness parameters of the structure as shown in Fig.1.A DAE model that consists of two main components connected sequentially where each component is optimized on a specific objective with relevance to the final goal is proposed in this paper.Details of the two DAE components will be presented in the next section.

Fig.1 The proposed DAE model

3.DAE model for pattern recognition

3.1 Dimension reduction

Autoencoders are stacks that learn an under complete representation to form deep architecture.In the proposed autoencoders,the 1st hidden layer performs the feature fusion(both the frequencies and mode shapes from the structure in this study)based on non-linear dimensionality reduction,the 2nd hidden layer performs further compression on the learnt low dimensional feature observed in the 1st hidden layer;and the 3rd and 4th layer perform the corresponding decoding of the low dimensional feature observed in the 2nd hidden layer.Hence the low dimensional feature space observed from the 2nd hidden layer will preserve the information to perform the reconstruction of the original feature since the used error criteria are the mean squares error for the reconstruction of the original feature.Structural natural frequencies and their corresponding mode shapes are fed into the autoencoder as follows:

where N is the number of samples involved in the training,is the concatenated feature that contains frequencies and mode shape values,g1(·)and f1(·)are the decoder and encoder[17]functions respectively.

The 2nd hidden layer further reduces the dimension of the low dimensional features obtained from the 1st hidden layer via the following cost function:

where N is the number of samples,is the low dimensional representation that is established in the layer 1 for the τth sample,g2(·)and f2(·)are the decoder and encoder functions that are utilized in the 2nd layer respectively.Pre-training is carried out to obtain the parameters using the full batch gradient descent algorithm.

3.2 Relationship learning

This model has three layers,out of which the first two layers are pre-trained as described in the above section.A relationship learning layer after the 2nd hidden layer is introduced.This will facilitate the relationship learning process by being able to utilize the low dimensional feature learnt at the 2nd hidden layer which is indeed a better feature representation than the input itself.Furthermore,the dimensionality reduction process will be supervised with the stiffness information,thus keeping the necessary information on the concatenated feature to predict the elemental stiffness parameters in the output.The 3rd hidden layer(the relationship learning layer) utilizes the output from the 2nd hidden layer to learn the mapping towards the stiffness elements that are presented at the output layer.Hence the observed latent representation in the 2nd hidden layer is utilized to perform a non-linear regression on stiffness elements via the third layer.The following cost function is optimized during this process:

where N is the number of samples,is the low dimensional representation that is produced at the 2nd layer for the τth sample,is the corresponding expected vector of structural stiffness parameters for the τth sample.g3(·)andf3(·)are the decoder and the encoder functions that are utilized in the 3rd layer respectively.Pre-training is also carried out with the full batch gradient descent algorithm as mentioned above.Once the optimal parameters are observed,the whole network is fine-tuned again to optimize all layers(L)jointly as below:

4.Verification

Numerical verifications on an experimental testing model are conducted to demonstrate the accuracy and efficiency of the proposed approach.A 7-storey steel frame structure is taken as an example.Seventy elements are defined in the structure,hence 70 elemental stiffness parameters are used to describe the health conditions of the structure.The stiffness parameters 0 and 1 denote the complete failure and intact states of the element respectively.A non-zero value would indicate that there is a stiffness reduction in the associated structural element. The data generation is performed assuming that stiffness reduction can occur in each of the 70 elements individually.The experimental model,data generation,pre-processing and the performance evaluation of the proposed DAE model are presented in the following sections.

4.1 Experimental model

A seven-storey steel plane frame is fabricated in the laboratory and the dimensions of the frame are described in Table 1.

Table 1 Dimensions of the experimental model

The initial Young’s modulus is taken as 210 GPa for all members.The connections between column and beam elements are continuously welded at the top and bottom of the beam section.Two pairs of mass blocks with approximately 4 kg weight each,are fixed at the quarter and three quarter length of the beam in each storey to simulate the mass from the floor of a building structure.The bottoms of the two columns of the frame are welded onto a thick and solid steel plate fixed to the ground.

Fig.2 shows the finite element model of the steel frame structure.It consists of 65 nodes and 70 planar frame elements.The weights of steel blocks are added at the corresponding nodes of the finite element model as concentrated masses.Each node has three degrees of freedom(DOFs)(two translational displacements x,y and a rotational displacement θ),and the system has 195 DOFs in total.The translational and rotational restraints at the supports,which are Nodes 1 and 65,are represented initially by a large stiffness of 3×109N/m and 3×109N·m/rad,respectively.

Fig.2 Finite element model of the frame structure

Finite element model updating of the initial finite element model is conducted to minimize the discrepancies between the analytical finite element model and the experimental model in the laboratory.The difference between the frequencies and mode shapes obtained from the analytical finite element model and the experimental measurements is minimized.

Measured natural frequencies and mode shapes of the first seven modes are shown in Fig.3.Only the mode shape values at the 14 beam-column joints are shown.The first-order modal sensitivity-based updating method[18]is used.It should be noted that the first seven measured frequencies and their associated 7×14 mode shape values on the beam-column joints are used in the updating procedure and the 70 elastic modulus values and six support stiffness values are required to be updated.The first seven measured and analytical frequencies,and the modal assurance values(MAC)associated with these seven modes are included in the objective function for model updating.The detailed model updating process can be found in[19,20].A very good updated model has been achieved to match the analytical and measured modal information.This updated finite element model is taken as the reference model for generating the training and validation data.

Fig.3 The first seven measured frequencies and mode shapes of the frame structure

4.2 Data generation

Modal analysis is performed using the reference model to generate input and output data to train the proposed DAE model.The first seven frequencies and the corresponding mode shapes at 14 beam-column joints are obtained.The elemental stiffness parameters are normalized to the range between 0 and 1,where 1 denotes the intact state and 0 denotes the completely damaged state as mentioned above.For example,if the stiffness parameter of a specific element is equal to 0.9,it means 10%stiffness reduction is introduced in this element.Twenty-one hundred data sets are generated from the baseline model.For each element,the stiffness parameter varies from 1,0.99,0.98,...,to 0.7 while keeping other elements undamaged.Therefore,30 data sets are generated for the scenario when a local damage is introduced in a specific element.With 70 elements in the finite element model,totally 2 100 data sets including the first seven frequencies and mode shapes as the input and the elemental stiffness parameters as the output are obtained for the training and validation.These modal information vectors are used as the input to the DAE model and the associated structural elemental stiffness parameters are defined as the output.

Fig.4 ADAE model with decoder to reconstruct the original input from encoded feature

4.3 Data pre-processing

Since the concatenated feature cτcontains both the frequencies and mode shapes that are measured in different scales,each feature is normalized separately so that all features lie on a common ground in the learning process.After the normalization process,both the frequencies and the mode shapes will be in the range from-0.5 to+0.5.This range is chosen since the operating range of the tanh(x)activation function falls to-1 to+1 and the corresponding inputs should be+∞and-∞respectively.The effectiveness of the proposed DAE model is evaluated based on the generated datasets,as described in the following sections.

4.4 Evaluation of the proposed model

The main objective of the proposed DAE model is to learn the relationship between the concatenated feature vector that is fed to the input and the output stiffness parameters.This is achieved via non-linear dimension reduction followed by a relationship learning phase as described above.Hence the evaluation of the model is performed separately on two components.First the effectiveness of the features learnt via dimensionality reduction component is assessed.Then the quality of learning the mapping as a whole is evaluated.The following two sections describe the evaluation process in detail.

4.4.1 Effective dimension reduction

The hidden feature obtained at the 2nd layer is decoded by introducing the decoder part to the proposed model as shown below.

Decoder facilitates the progressive dimensionality gain as opposed to the gradual dimension reduction via encoding layers.We choose the nodes for each hidden layer as shown in Fig.4.The pre-training for layer 1 and layer 2 is performed as described in Section 3.1.Once pre-trained weights are obtained(layer 1,layer 2 encoding weights),those encoding weights()are used to tie the weights[21]for the corresponding decoders as shown below:

Then the whole network is fine-tuned again to optimize all layers(L)jointly as below:

As shown in Table 2,it is conceivable that the finetuning stage indeed has a significant improvement to the global non-linearity involved in performing the effective dimension reduction.It is conceivable that the fine-tuning stage indeed has a significant improvement to the global non-linearity involved in performing the effective dimension reduction.The mean squares error is reduced to0.09%in the final training stage after fine tuning.The L2 regularization is used on the weights of the DAE model to ensure that it has less space to over- fit the training data.The testing error is as low as 0.77%indicating the accuracy of the proposed DAE model in performing the dimensionality reduction to represent the original input features.

Table 2 Evaluation results of the proposed model with the decoder for reconstruction of original feature

4.4.2 Effective relationship learning

Since the quality of the reduced dimensional feature is satisfactory for reconstructing the original feature as demonstrated above,the quality of the proposed DAE model for learning the relationship mapping is assessed in this section.The cost functions denoted in Section 3.2 in pretraining the layers and fine-tuning the whole network at the end are optimized.The effectiveness of the relationship learning process is evaluated with the MSE between the simulated stiffness parameters and the predicted outputs.The test samples are fed into the fine-tuned DAE model to generate the elemental stiffness parameters in the output.One hundred and five modal parameters including seven frequencies and their corresponding mode shapes on 14 beam-column joints are included in the input vector,and 70 stiffness parameters in the output vector.Ninety,seventy and seventy neural nodes are chosen for the first,the second and third hidden layers of the DAE model as shown in Fig.3.The predicted output stiffness values are compared against the expected stiffness parameters to observe how close the predicted outputs are to the true values.The errors that are observed in the different training phases of the DAE model are summarized in Table 3.

Table 3 Evaluation results of the proposed model

Different layers of DAE training have an effect on the global non-linearity involved in performing the effective dimensionality reduction while the error is further reduced in the latter training stages especially after fine-tuning as shown in Table 3.The validation error gradually decreases along with the depth of the network.It is also indicated that the error between the identified stiffness parameter values and the true values is 2.9%on the test dataset,demonstrating that the accuracy and efficiency of the proposed DAE model for structural damage identification are highly satisfactory.

4.4.3 Single damage identification

The model is firstly evaluated with single structural damage cases and an example from the test data set is shown in Fig.5.The identified stiffness reduction is very close to the actual value where the values of positive and negative false identifications are very small and close to zero while the identified locations of the damage are also clearly distinguishable.

Fig.5 An example of single damage identification

4.4.4 Multiple damage identification

Multiple structural damage identification is challenging,and needs more precision on the identification of accurate stiffness reductions at the exact stiffness elements compared to the single damage cases.The proposed model is applied for such cases and an example is shown in Fig.6.The identified stiffness reductions are very close to the actual values with very small false identifications.Hence the identified locations of the damage also show a very good level of accuracy.

Fig.6 An example of multiple damage identification

5.Conclusions

This paper proposes a method of applying DAE for structural condition monitoring and damage identification.The relationship that exists in the modal information,such as frequencies and mode shapes,to structural stiffness parameters is trained for SHM.The proposed DAE model is developed not only for dimensional reduction that preserves the necessary information but also for identifying the relationship between the input modal information and the output stiffness parameters.Effective dimension reduction and accurate relationship training are achieved with the proposed DAE model based on the deep learning architecture and the pre-training algorithm.The proposed model has produced excellent stiffness prediction and damage identification with some testing data on a steel frame structural model.The work can be extended to combine more vibration properties and consider noise effect towards a robust training thus improving the performance for structural health monitoring and damage identification.

免责声明

我们致力于保护作者版权,注重分享,被刊用文章因无法核实真实出处,未能及时与作者取得联系,或有版权异议的,请联系管理员,我们会立即处理! 部分文章是来自各大过期杂志,内容仅供学习参考,不准确地方联系删除处理!