Automated Machine Learning for Epileptic Seizure Detection Based on EEG Signals

时间：2024-07-28

Jian Liu,Yipeng Du,Xiang Wang,＊,Wuguang Yue and Jim Feng

1University of Science and Technology Beijing,Beijing,100083,China

2Hwa Create Co.,Ltd,Beijing,100193,China

3Amphenol Global Interconnect Systems,San Jose,CA 95131,US

Abstract:Epilepsy is a common neurological disease and severely affects the daily life of patients.The automatic detection and diagnosis system of epilepsy based on electroencephalogram(EEG)is of great significance to help patients with epilepsy return to normal life.With the development of deep learning technology and the increase in the amount of EEG data,the performance of deep learning based automatic detection algorithm for epilepsy EEG has gradually surpassed the traditional hand-crafted approaches.However,the neural architecture design for epilepsy EEG analysis is time-consuming and laborious,and the designed structure is difficult to adapt to the changing EEG collection environment,which limits the application of the epilepsy EEG automatic detection system.In this paper,we explore the possibility of Automated Machine Learning(AutoML)playing a role in the task of epilepsy EEG detection.We apply the neural architecture search(NAS)algorithm in the AutoKeras platform to design the model for epilepsy EEG analysis and utilize feature interpretability methods to ensure the reliability of the searched model.The experimental results show that the model obtained through NAS outperforms the baseline model in performance.The searched model improves classification accuracy,F1-score and Cohen’s kappa coefficient by 7.68%,7.82% and 9.60% respectively than the baseline model.Furthermore,NASbased model is capable of extracting EEG features related to seizures for classification.

Keywords:Deep learning;automated machine learning;EEG;seizure detection

1 Introduction

According to World Health Organization(WHO)statistics,epilepsy is the most common neurological disease second only to stroke[1]and 2.4 million people are diagnosed with epilepsy each year[2].Seizures are caused by an abnormal discharge of neurons in the brain[3,4].During the seizure,the patient will lose consciousness.Some parts of the body or the whole body will even twitch[5],which can last from a few seconds to a few minutes[6].Sudden unprovoked seizures can make patients unable to protect themselves in time and even cause fainting and life-threatening[7,8].Electroencephalography(EEG)has become the standard technology for epilepsy detection due to its high spatial and temporal resolution,non-invasiveness and low cost[9].It collects electrical signals during brain activity through electrodes placed in different positions of the brain,so that it can detect abnormal brain potentials such as spikes and spikes in patients during epileptic seizures in real time[6,10].In traditional clinical diagnosis,EEG-based seizure detection relies on well-trained professionals to complete.Since epileptic seizures are random and sudden,doctors and other hospital staff need to face EEG data recorded for several hours or even days,which will undoubtedly consume a lot of time and energy for professionals to complete this work.At the same time,the large patient population will also put pressure on them[11].Moreover,nearly three-quarters of epilepsy patients come from countries below the middle income,and the high cost of artificial seizure detection will make them prohibitive[12].

Considering the limitations of artificial epilepsy detection,many researchers begin to focus on the research of automatic epilepsy detection technology based on EEG[13-15].Feature extraction is the key to the automatic detection algorithms for epilepsy EEG and the current feature extraction methods can be divided into manual design features and automatic feature extraction.Manual methods utilize prior knowledge to model the characteristics of epilepsy EEG,which usually extract temporal and frequency domain features related to seizures as the basis for epilepsy detection.In the time domain analysis,the authors in reference[16]apply sample entropy and approximate entropy as epileptic features and designed a corresponding classifier.In the field of frequency domain analysis,reference[17]uses Fourier transform to convert EEG epilepsy features into frequency domain representation for automatic epilepsy detection.Similarly,discrete wavelet transform method to extract features of epilepsy EEG signals is employed in[18].In addition,the authors in article[19]extract time domain and frequency domain features simultaneously to realize an automatic epilepsy detection algorithm.The hand-crafted approaches combined with the experience of professionals are capable of boosting the performance of the automatic epilepsy detection system in certain scenarios,but they face many challenges in actual application scenarios.Firstly,the EEG signal of epilepsy is non-stationary,so the features extracted from the same patient at different times of epileptic seizures are quite different[20].It is difficult to extract information from the features of all patients with epilepsy EEG episodes using manual design methods,which will cause information loss in the feature extraction stage to a certain extent,and reduce the accuracy of the EEG-based automatic epilepsy detection system.Secondly,due to the low signal-to-noise ratio of the EEG,the methods of manually extracting features are easily affected by noise or artifacts,resulting in inaccurate feature extraction.Therefore,it is difficult for hand-crafted approaches to adapt to the real EEG acquisition scene,which is complex and changeable.

The method of automatically extracting epilepsy EEG features based on deep learning has gradually attracted attention.Deep learning technology can automatically learn the feature extraction mode of related tasks from the provided data through the deep structure of the neural network,which has achieved excellent results in multiple EEG analysis tasks including epilepsy EEG detection and exceeds the manual feature extraction method on performance[21].Discriminative models,representative models and hybrid models are the frequently applied architectures in deep learning based methods.In the area of discriminative models,convolutional neural networks(CNN)are capable of utilizing convolutional structures to extract local features of multiple channels of EEG at the same time and are widely used in automatic epilepsy detection algorithms[22,23].The authors in reference[24]designs a 13-layer convolutional neural network to automatically detect epileptic seizures.As for the representative models,deep belief networks(DBN)[25]and authoencoder(AE)[26]also have many applications in automatic epilepsy detection and have achieved good recognition accuracy.Hybrid models can apply the advantages of different deep learning models to improve the system performance.A combination of CNN and long short term memory(LSTM)is proposed in[27]to build an automatic epilepsy EEG detection system.Although these deep learning-based automatic detection methods for epilepsy EEG have achieved good results in their respective data sets and experimental environments,the collection environment and EEG equipment of different medical institutions are different.It will lead to differences in the signal-to-noise ratio of EEG signals and the number of acquisition channels for different data sets,so it is not clear whether these automatic detection algorithms can adapt to these changes and achieve qualified performance.Furthermore,the design of deep learning architecture for epilepsy EEG detection is time-consuming and difficult,which requires the cooperation of medical institutions and professionals with computer-related backgrounds.The high threshold of algorithm design limits the application of EEG-based automatic epilepsy detection systems in countries below the middle income.

Deep learning is playing an increasingly important role in medical-related fields[28-30],especially in brain research[31-36].With the development of computing power,Automated Machine Learning(AutoML)provides the possibility of using machine learning algorithms to solve problems for people without relevant knowledge[37].In the task of automatic detection for epilepsy EEG,Neural Architecture Search (NAS) of AutoML can quickly build a detection model that adapts to specific epilepsy EEG signals and users only need to provide the corresponding data set.It will undoubtedly save much time and cost of algorithm development and accelerate the deployment of the epilepsy EEG automatic detection system.NAS methods can be divided into three types,namely NAS based on deep reinforcement learning[38,39],NAS based on evolutionary algorithm[40,41],and NAS based on Bayesian[34].NAS has been attempted to be applied in EEG analysis,where the application areas include emotion recognition[42],motor imagery EEG[43]and state evaluation[44].Moreover,feature interpretability tools are very important for deep learning-based EEG analysis,as the tools can ensure the reliability of black-box models like CNNs.Analysis of model weights[45],model activations[46]and the correlation of input and output[47]are the frequently employed methods for the inspection of neural networks in EEG signals classification.

In this paper,we apply the Bayesian-based NAS algorithm of the open source platform AutoKeras[37]to implement the search of the convolutional neural network for the automatic detection of epilepsy EEG.We compare the searched model with EEGNet that can perform EEG analysis across EEG paradigms[48].Then deep learning feature interpretability approaches are utilized to verify the reliability of the searched model.The contributions of this work can be categorized as:

• We study the possibility of automatic machine learning in the field of automatic epilepsy EEG detection.The experimental results show that the searched model achieves an accuracy of 76.61%in the test set.

• We use the deep learning feature interpretability method to analyze the NAS-based model.The analysis results show that the model extracts the EEG features related to epileptic seizures,which ensures the reliability of automatic machine learning algorithms.

The organization of this paper is as follows.Section 2 introduces the dataset,NAS algorithm and experimental protocol.The classification results and the analysis of feature explainability are given in Section 3.Section 4 presents discussion and the conclusion is illustrated in Section 5.

2 Materials and Methods

2.1 Dataset Description

The experimental data we use comes from[49]and here is a brief description of dataset.This dataset contains five sets(represented as A-E),and each set has a total of 100 pieces of EEG data.The EEG signals of sets A and B are extracranial data collected from five healthy subjects.The difference is that the set A is recorded with eyes open and the signals in set B are closed eyes.Sets C,D and E are intracranial data acquired from five epilepsy patients,in which the epileptogenic zone of the patients has been diagnosed through resection.EEG in sets C and D are acquired when the epilepsy patient had no seizures,where the data collection location of set C is located in the epileptic area and that of set D is set in the hippocampal formation of the opposite hemisphere of the brain.The EEG recorded during the epileptic seizure is included in set E.All analog EEG signals are converted to digital signals at a sampling rate of 173.61 Hz and saved as single-channel data.Each EEG signal has been screened for artifacts to avoid the influence of noise and has passed the weak stationarity criterion.Then the original data is divided into EEG segments with a duration of 23.6 seconds[49].

2.2 Neural Architecture Search Method

We use AutoKeras(AK)to implement automatic neural architecture search,which is one of the most widely used open-source AutoML system[37].Network morphism,a technique that changes the structure of a neural network but maintains its function,is adopted to improve search efficiency in this system[50,51].Moreover,AK applies Bayesian optimization to choose the most prospective operation of network morphism each time,thus guiding through the neural architecture search space[37].The flowchart of the NAS algorithm is shown in Fig.1.

The purpose of the NAS algorithm is to search for the best performing model on a given epilepsy EEG dataset.Here,we define the epilepsy EEG dataset asED,which is divided into training setEDtrainand validation setEDval.At the same time,we define the search space asSE,and the NAS algorithm uses the cross-entropy loss functionCEto evaluate the performance of the searched model.Assuming thatω*is the best performing model searched for the NAS algorithm andω*can be expressed as,

whereθrepresents the parameters of the model.To implement the use of Bayesian optimization algorithms to guide exploration of the search space via morphing the neural architectures,theSEneeds to satisfy the traditional Gaussian process assumption.

In order to make the NAS space meet the hypothesis of Gaussian process,AK propose an editdistance kernel function for neural networks and it can be written as[37],

wheres1ands2represent two neural network architectures,αdenotes a function to map distance to the new space,MlaandMscare the distance for layer morphism and skip-connections morphism respectively,andβis the balancing factor ofMlaandMsc.AndMlacan be written as,

wherep(.)is the width of the layers andφl(.)represents the injective matching function of layers.

Similarly,Msccan be expressed as,

whereq(.)denotes the topology level of the layer where the skip connection starts,andr(.)denotes the number of layers between the start and end point of the skip connection.

Then AK choose upper-confidence bound(UCB)[52]as acquisition function and propose a novel approach combining simulated annealing[53]and A*search[54]to optimize UCB on tree-structured space[37].

The acquisition function UCB can be written as,

whereμ(.)represents the posterior mean of the performance of the current model structurew,σ(.)expresses the standard deviation ofw.ηis an equilibrium factor that controls the progress of the NAS algorithm in the search space to find equilibrium in exploration and exploitation.The Bayesian optimization algorithm works by evaluatingγ(w)for each newly generated structure to find the best structure with smallestγ(w)as the next model to morphism and train.In order to optimize UCB on tree-structured space,a novel approach combining simulated annealing and A* search is proposed.We assume that the current set of all trained models isωall,and the current optimal performance isCEmin.For all models,use network morphism to generate a new structure,defined asω′.

Algorithm 1 The simulated annealing algorithm applied in NAS Input:ωall,CEmin,tl,fac t ←1,B ←PriorityQueue()B.Push(ωall)while Bimages/BZ_166_1022_2529_1069_2581.png∅and t ＞tl do t ←t×fac,ω ←B.Pop()ω′=nm(ω)if e(CEmin-γ(ω′))/t ＞Rand()then B.Push(ω′)end if if CEmin ＞γ (ω′)then

Algorithm 1 Continued CEmin ←γ (ω′),ωcan =ω′hspace*{2pc}end if end while return the estimated optimal model ωcan

Moreover,AK introduce a graph-level network morphism to maintain the consistency of the intermediate tensor shape[37].

2.3 Experiment Protocol

In the preprocessing,we apply the Butterworth filter to filter all segments from 0.43 to 100 Hz.Then the data of each segments is divided into 23 epochs containing 178 sampling points,and the duration of each epoch is approximately 1.025 s.The epochs are z-normalized so that the mean is equal to zero and the standard deviation is equal to one,where the z-normalization is one of the common and useful preprocess procedures for time series classification[55].Finally,we randomly selected epochs according to the ratio of 3:1:1 to form the training set,test set,and validation set.The experimental procedure of epilepsy EEG detection algorithm is shown in the Fig.2.

When using AK to automatically search the neural network structure,we regarded the singlechannel epilepsy EEG signal as a special type of image signal,and configure the ImageClassifier in AK to customize the search space.Specifically,we set the block type parameter in CnnModule to regular convolutions,which means that AK is capable of finding the model with the best performance by searching the structure composed of raw convolution.We set different search times and conduct multiple experiments.It should be noted that in the NAS algorithm,the number of searches is equal to the number of generated neural networks.In the search process,only the training set and the validation set were used.After the search was completed,the searched best model was trained for 100 more training iterations.Then we saved the model with the smallest cross-entropy loss on the validation set and tested its performance on the test set.

EEGNet[48]was chosen as the baseline model in our experiment,which is a deep convolutional neural network that can extract EEG features and classify signals across EEG paradigms.The previous classification results showed that EEGNet achieved good results on several EEG classification tasks,not limited to a specific EEG paradigm.Depthwise convolutions and separable convolutions are employed to reduce the amount of model parameters to adapt to different sizes of EEG datasets in EEGNet[48].Moreover,depthwise convolutions are also used as spatial filters to extract the frequency-specific spatial features of EEG,and separable convolutions are applied for compression and correlation extraction of high-dimensional feature maps[48].

Three performance metrics are introduced to evaluate the classification results in our experiment,including classification accuracy,F1-score and Cohen’s kappa coefficient[56].The accuracy is the ratio of the number of samples correctly classified by the model to the total number of samples for a given test set.And it can be expressed as,

wheretpsandtnsis the true positives and true negatives,respectively.In addition,fpsandfnsdenotes the false positives and false negatives.

The calculation of the F1-score takes into account the Precision and Recall,which can reflect the performance of the algorithm to a certain extent,especially for unbalanced data.F1-score can be written as,

Cohen’s kappa coefficient is used to measure the degree of consistency between model classification and manual labeling.The larger the value,the better the model performance.The calculation of the Cohen’s kappa coefficient can be expressed as,

wherePois the empirical agreement probability of labels assigned to any sample,andPeis the expected agreement when labels are assigned randomly by two annotators.

We apply one-way analysis of variance to perform probability testing,modeling classification indicators as response variables and different approaches as factors.

2.4 Feature Explainability

Deep learning interpretability methods,as an important tool for testing the reliability of black box models such as deep convolutional neural networks,is very necessary for the application of algorithms in medical scenarios such as epilepsy detection.For the deep learning model obtained using the NAS method,since there is no prior knowledge and artificial experience involved in the structural design,it is difficult to ensure the reliability of neural networks only from the performance metrics on test set.Therefore,we use the feature explainability approaches to analyze the features extracted by the model,verifying that the neural networks extract features related to epilepsy EEG for classification instead of noise or artifacts.

We apply t-distributed stochastic neighbor embedding (t-SNE)[57]algorithm to reduce the dimensions of the high-dimensional features extracted by the searched model to three dimensions for visualization.The t-SNE algorithm is a machine learning algorithm widely used in nonlinear dimensionality reduction.Through dimensionality reduction and visualization of high-dimensional data,we can intuitively see the differences of extracted features between the classes,thus evaluating the model’s ability to extract task-related features.

You can guess how frightened she was! But the lion seemed in such pain that she was sorry for him, and drew nearer and nearer till she saw he had a large thorn in one foot

We employ the‘Gradient*Input’method[58]in the DeepExplain framework[59]to calculate the correlation between EEG input features and model decisions.This method is based on the gradient and the forward-backward iterations of the neural network to obtain the influence of each input sample on the neurons in the model decision layer,so as to determine which features will activate the neurons related to the right decision and which will interfere with the activation of the correct neurons in the model.We can visualize the main basis for the analysis of the searched model using this approach,and verify that the model extracts the relevant features of epilepsy EEG for classification.

3 Results

3.1 Classification Results

Fig.3 shows the performance of the convolutional neural network on the test set under different trial settings of the search algorithm,where the number of trials is the number of deep learning models tried by the NAS and error bars represent two standard errors of the mean.It can be clearly seen from the figure that the performance metrics (accuracy,F1-score and Cohen’s kappa coefficient) of the searched model reach the best when the search algorithm has the largest number of trials (150).And the classification accuracy of this searched model on the test set is 76.61%.When the number of models tried by the search algorithm is less than 100,the performance of the searched model is basically the same,and there is no obvious improvement.However,when the number of trials of the search algorithm is greater than 100,the performance of the final searched model on the test set boosts as the number of trials increases.

Through the experimental results,we can find that the performance of the optimal model grows faster after the NAS algorithm evaluation exceeds 100 trials,indicating that the Bayesian optimizer plays an important role in generating new neural network architectures.At the beginning of the NAS,the Bayesian optimizer is continuously trained by the performance of the model on the test set and the corresponding model structure to build the relationship between the model and the accuracy.In the later stage of the NAS algorithm,the trained Bayesian optimizer is capable of evaluating the accuracy of the neural network on the test set to find a model architecture with better performance,thus reducing the search time and improving the efficiency of the NAS.

3.2 Feature Explainability

We use the t-SNE algorithm to reduce the dimensionality of the high-dimensional feature vectors output by the global average pooling layer and visualize it in a 3-dimensional space.The visualization results are shown in the Fig.4,where the data is randomly selected from the test set,and the color represents the classes of the EEG feature.We can see that the features extracted by the convolutional neural network are clustered together according to the corresponding classes,reflecting the class difference of the high-dimensional features.The characteristics of the data collected from healthy subjects are the most similar,while the features of EEG acquired from patients with epilepsy are quite different.This indicates that trained model obtained by NAS has the feature extraction ability to classify epilepsy EEG.

Through the feature representation extracted by the model searched by NAS,we can see that the model is good at extracting the features of epilepsy EEG signals and classifying epilepsy EEG and signals with other labels.In addition,it can be clearly observed that it is difficult for the NAS model to extract distinguishable features from the EEG collected in normal people with eyes open and eyes closed,which means that the signals with these two labels have high similarity.Boosting the classification performance of the NAS model on these two labels is one of our follow-up works.

Fig.5 shows the relevance between EEG data with epilepsy class and model decisions calculated using the “Gradient * Input”[58]method in the Deep Explain framework[59].We randomly select high-confidence and low-confidence data in test set for analysis,where the time domain signal displayed is processed by z-normalization.It can be seen from the figure that regardless of the level of confidence,the significant changes in the amplitude of the input original EEG signal are highly correlated with the model decision relevance,which indicates that the model has extracted abnormal discharge features of epileptic EEG for seizure detection.Furthermore,we can see that the signal part with sudden amplitude changes has a high degree of consistency with the model for high-confidence data,which means that the model basically does not miss the extraction of the characteristics of each abnormal discharge.But for low-confidence data,the feature of abnormal discharge has a low degree of matching with the model decision-making correlation.The model misses the extraction of some epilepsy features in the original input EEG signal,which may easily cause classification errors.

4 Discussions

4.1 Performance Comparison

4.2 The Architect of Searched Model

Fig.7 and Tab.1 shows the architecture of the convolutional neural network with the best performance obtained by NAS.It can be seen from the figure that the feature extraction part of this model consists of two convolution-max-pooling blocks,where each convolution-max-pooling block contains two convolutional layers and one maximum pooling layer.The convolutional layers are used to extract the local features of epilepsy EEG signals,and the stacking of two convolutional layers are capable of making the extracted EEG features more high-dimensional.The maximum pooling layers are added to compress the dimensionality of the feature map,thus avoiding the model from overfitting due to excessive parameters and improving the model’s ability to resist noise.The feature extraction part of this model is similar to the design of DeepConvNet[60]and convolutional neural network in[61]for EEG analysis.Then the model extracts global EEG features through the global average pooling layer and employs dropout technology as a regularization measure.

Table 1:The summary of NAS-searched model architecture

The number of searched model parameters under different trial settings is shown in Tab.2.Since the AutoKeras platform applies network morphism,the model built by NAS gradually changes from a simple model to a complex one.With the increase in the number of trials,the amount of model parameters is increasing.It can be seen from the classification results that the corresponding feature extraction capabilities of searched model are also improving.This provides practical experience for the subsequent application of NAS to realize epilepsy EEG detection algorithms.A model with great performance requires sufficient number of trials for NAS.

Table 2:The number of trainable parameters for models under different trial settings

5 Conclusions

In this work,we explore the possibility of AutoML playing a role in the task of automatic epilepsy EEG detection.We employ the neural architecture search algorithm based on network morphism in the AutoKeras platform to realize the design of the automatic epilepsy EEG detection model.The experimental results show that the multiple performance metrics (accuracy,F1-score and Cohen’s kappa coefficient)of the model obtained by NAS are better than the baseline model on the test set.In addition,deep learning feature interpretability methods are applied to analyze the feature extraction of the model,ensuring the reliability of the algorithm.However,there are still some limitations in this work.First,the search efficiency of the NAS algorithm needs to be improved.Second,the model generated by NAS has a large number of model parameters,which limits the response speed of practical applications.In future works,we will further boost the search efficiency of the NAS algorithm and promote the NAS to optimize the amount of parameters in the generated neural networks.Moreover,we intend to improve the ability of the model to analyze EEG signals in different states,and deploy it to the actual epilepsy EEG detection system.

Funding Statement:This work is supported by Fundamental Research Funds for the Central Universities(Grant No.FRF-TP-19-006A3).

Conflicts of Interest:The authors declare that they have no conflicts of interest to report regarding the present study.