999精品在线视频,手机成人午夜在线视频,久久不卡国产精品无码,中日无码在线观看,成人av手机在线观看,日韩精品亚洲一区中文字幕,亚洲av无码人妻,四虎国产在线观看 ?

Classification and Categorization of COVID-19 Outbreak in Pakistan

2021-12-10 11:57:34AmberAyoubKainaatMahboobAbdulRehmanJavedMuhammadRizwanThippaReddyGadekalluMustufaHaiderAbidiandMohammedAlkahtani
Computers Materials&Continua 2021年10期

Amber Ayoub,Kainaat Mahboob,Abdul Rehman Javed,Muhammad Rizwan,Thippa Reddy Gadekallu,Mustufa Haider Abidi and Mohammed Alkahtani

1Department of Computer Science,Kinnaird College for Women,Lahore,54000,Pakistan

2Department of Cyber Security,Air University,Islamabad,Pakistan

3School of Information Technology and Engineering,Vellore Institute of Technology,Tamil Nadu,India

4Raytheon Chair for Systems Engineering,Advanced Manufacturing Institute,King Saud University,Riyadh,11421,Saudi Arabia

5Industrial Engineering Department,College of Engineering,King Saud University,Riyadh,11421,Saudi Arabia

Abstract:Coronavirus is a potentially fatal disease that normally occurs in mammals and birds.Generally,in humans,the virus spreads through aerial droplets of any type of fluid secreted from the body of an infected person.Coronavirus is a family of viruses that is more lethal than other unpremeditated viruses.In December 2019,a new variant,i.e.,a novel coronavirus(COVID-19)developed in Wuhan province,China.Since January 23,2020,the number of infected individuals has increased rapidly,affecting the health and economies of many countries,including Pakistan.The objective of this research is to provide a system to classify and categorize the COVID-19 outbreak in Pakistan based on the data collected every day from different regions of Pakistan.This research also compares the performance of machine learning classifiers(i.e.,Decision Tree(DT),Naive Bayes(NB),Support Vector Machine,and Logistic Regression)on the COVID-19 dataset collected in Pakistan.According to the experimental results,DT and NB classifiers outperformed the other classifiers.In addition,the classified data is categorized by implementinga BayesianRegularization Artificial Neural Network(BRANN)classifier.The results demonstrate that the BRANN classifier outperforms state-of-the-art classifiers.

Keywords:COVID-19;pandemic;neural network;BRANN;machine learning

1 Introduction

The COVID-19 outbreak that appeared in Wuhan,China at the end of December 2019 was initially considered a pneumonia based on etiology.The virus soon spread worldwide at a rapid rate[1].On January 30,2020,the World Health Organization(WHO)declared the COVID-19 outbreak a Public Health Emergency of International Concern[2,3].This virus has affected people in more than 209 nations around the world.The overheads of the coronavirus outbreak are continually increasing.When this virus first started to spread,there were approximately 600 confirmed cases in China.Globally,the number of people who have died because of this virus has been increasing daily[4].The WHO determined that the most common symptoms of this virus are tiredness,fever,and dry cough[5].Most people with these symptoms can recover without extraordinary treatment or prescriptions.However,some patients have more severe symptoms,such as a runny nose,sore throat,nasal congestion,and general or severe pain.Typically,80%of people who became infected have severe symptoms[6].In the United Kingdom,the National Health Service(NHS)has reported cases with more severe side effects,including high fever and persistent cough.The NHS recommends that anybody with these sorts of symptoms should selfquarantine for 7 to 14 days[7].The infection spreads between individuals in close contact who are exposed to respiratory aerosol droplets that are emitted,primarily when an infected person coughs or sneezes,or shouts,sings,or talks.

For the most part,the droplets do not travel significant distances.Typically,they fall to the ground or onto immediate surfaces.Transmission may also occur through little droplets that can remain suspended in the air for longer periods of time[8].People may become infected by touching a contaminated surface and then touching their face[9].Outbreaks and rapid spread are highly expected,even before symptoms are noticeable,and from individuals who do not possess any symptoms of being infected by the virus,but they carry it[10].Fig.1 represents the worldwide spread of this coronavirus.It is believed that the virus did not spread in Pakistan the way it spread in other countries,like China,the USA,and Italy.Pakistan,with permeable borders,is sandwiched between two focal points of this coronavirus(China and Iran).

Figure 1:Worldwide spread of COVID-19

Recently,Pakistan has reinforced their precautions against COVID-19 by various strategies,such as detailing the use of national crisis readiness,compulsory thermal screenings at all entry points,observation of regional spread,contact tracing,and information assortment through various sources.Testing has been reinforced by bringing in Polymerase Chain Reaction units for SARS-COV-2 diagnostics[11].Assets have been deployed to setup quarantine centers in preparation of expected cases.Locations for these stations include a few urban areas,emergency clinics,and reconnaissance units that have been actuated to track the contacts of affirmed cases,as suggested by the WHO[10].The COVID-19 infection has spread to more than 213 nations,and as of April 17,2020,there were 1,995,983 confirmed cases and 131,037 deaths[12].

Pakistan revealed its initial two positive cases on February 26,2020.These cases were connected to travel to Iran[13].The number of positive cases across the nation rose to 7,025 on April 17th,2020:3,276 positive cases and 135 deaths in Punjab,2,008 cases in Sindh,993 cases in Khyber Pakhtunkhwa,303 cases in Balochistan,237 cases in Gilgit Baltistan,154 cases in Islamabad Capital Territory(ICT),and 46 cases in Azad Jammu Kashmir[14].

The number of positive cases is rising rapidly every day.In fact,in most countries,the number of cases is probably much higher than recorded,due to limited testing[14,15].Fig.2 shows the number of total coronavirus cases in Pakistan.The exponential increase in cases has driven the Government to force total and severe lockdowns in numerous urban areas[16].

Figure 2:COVID-19 cases in Pakistan

Fig.3 shows the total number of COVID-19 cases,the total number of deaths,and the total number of recovered cases in different regions of Pakistan.

Figure 3:Total recovered cases,deaths,and confirmed cases in Pakistan

1.1 Problem Statement

Deaths due to COVID-19 are increasing day by day in Pakistan.The nature of the COVID-19 outbreak differs in various countries.For example,in China,Iran,and France,COVID-19 outbreak is characterized by extremely high numbers and severe cases.The outbreak severity can be detected through an increase in the number of deaths.Thus,in this research,the nature of the outbreak is detected with the help of the COVID-19 dataset for the past few months in Pakistan collected by the Government.If the nature of the COVID-19 outbreak can be detected from the past months’ death rate,then with the help of standard operating procedure and precautionary measures,the death rate can be reduced in the coming months in Pakistan.For outbreak detection,the COVID-19 dataset is first classified with machine learning(ML)classifiers.Then the classified dataset is categorized into severe and normal COVID-19 outbreaks,using the Bayesian regularized artificial neural network(BRANN)classifier.

1.2 Motivation and Contribution

The COVID-19 death rate is high and is increasing day by day globally[17].This research is intended to classify and categorize the nature of the outbreak in Pakistan using machine learning classifiers.In this study,a dataset of COVID-19 patients from different regions(primarily populated regions)of Pakistan is preprocessed and then classified to understand the nature of the virus and its outbreak in Pakistan.Machine learning classifiers:Decision Tree(DT),Naive Bayes(NB),Support Vector Machine(SVM),and Logistic Regression(LR)are implemented,and results are compared based on performance measures(i.e.,accuracy,precision,and recall).The comparison of machine learning classifiers indicates that the DT and NB classifiers return 100% accuracy.Classified data is input to the BRANN to categorize the COVID-19 outbreak in Pakistan to determine if the nature of the outbreak will be normal or severe.

The remainder of this paper is organized as follows.Section 2 discusses the related work.Section 3 provides the proposed methodology to classify and categorize COVID patients.Section 4 provides the experimental analysis and results.Conclusions and suggestions for future work are presented in Section 5.

2 Literature Review

COVID-19 virus was initially discovered in December 2019 in the population of Wuhan,China.Later,it spread to other regions of China and other parts of the world[18].Various papers and studies have applied different techniques on COVID-19 datasets.In this section,several studies that investigate the application of machine learning algorithms on different diseases are discussed.

SVM and Mutual Information techniques have been applied to classify genes[19].In that study,the authors claimed that the SVM classifier achieved the best mean accuracy rate.In addition,the fuzzy KNN approach has been used on a Parkinson’s dataset to help generate a diagnostic system that will make better clinical diagnostic decisions[20].Here,researchers utilized different machine learning techniques to propose a novel method.They computed significant features by implementing machine learning techniques to improve the accuracy rate of predicting cardiovascular disease.Their prediction model gives 88.7% accuracy[21].In 2015,a combination of SVM and fuzzy logic was applied for the risk classification of diabetes.Fuzzy reasoning was used to predict the risk factors of(Type-II)diabetes,and an SVM was used to generate fuzzy rules from the Pima diabetes dataset[22].

Other researchers used the NB classifier to improve the accuracy of predicting heart disease[23].Different machine learning techniques,such as Artificial Neural Network(ANN),random forest(RF),and K-means clustering techniques were implemented to predict diabetes.The ANN technique provided the best accuracy rate(75.7%)in the prediction of diabetes[24].Some researchers also implemented machine learning techniques to predict hypertension outcomes based on medical data.In that study,the researchers evaluated four classifiers,i.e.,SVM,DT,RF,and XGBoost,to meet the desired accuracy level of the prediction system.XGBoost produced the best results among the four classifiers and provided a system accuracy of 94.36%.[25,26].

Other researchers used histopathological data patients who had a lung lobectomy to treat adenocarcinoma.For both “accidental” models,adjacent to malignancies,the lungs show edema and fundamental proteinaceous exudates as huge protein globules[27].The researchers documented vascular joins with blazing gatherings of fibrinoid content,multinucleated goliath cells,and pneumocyte hyperplasia.In addition,some researchers used the ANFIS model to estimate landslide susceptibility and to develop a model to predict landslides.The ANFIS model was used to train and validate the dataset[28].Different ML classifiers have been used to develop predictive models[29,30].In 2017,researchers proposed an SVM and fuzzy logic-based system automatically block pornographic content on the web.SVMs have also been used in statistical learning approaches to classify hypothesis test data and compute the error rate using the Gaussian-density function[31,32].

3 Proposed Methodology

Machine learning classifiers,DT,NB,LR,and SMV,are used to classify and categorize the COVID-19 outbreak in different regions of Pakistan.The proposed system is shown in Fig.4.

Figure 4:Proposed system for COVID-19 data classification and prediction

3.1 Dataset

The “Corona-Virus Pakistan Dataset 2020” was downloaded from Kaggle[33].The dataset contains 13 features that represent the lab tests of suspected,confirmed,and fatal COVID-19 cases per day in the most populated regions of Pakistan(Tab.1).The dataset features are listed in Tab.2.The dataset has 315 rows and 13 columns,i.e.,11089 data items.The dataset was checked for null and missing values of categorical features;none were found.The data distribution of categorical features,such as Date and Province,are shown in Fig.5.

Table 1:Selected regions of Pakistan in dataset

Table 2:Features of COVID-19 dataset

Figure 5:Data distribution of categorical features

3.2 Dataset Preprocessing

Preprocessing is necessary to avoid misclassified results and errors[34,35].Data preprocessing involved data preparation,data exploration,data distribution,and replacing categorical features.Preprocessing resulted in a clean dataset suitable for classification.This preprocessed dataset is fed to the machine learning classifiers to produce classified results[36].

4 Experimental Analysis and Results

For the classification of the dataset,Google Colab was used for python coding,and dataset categorization was implemented through MATLAB.The dataset was split into training(70%)and testing(30%)sets.The metrics used in this work are as follows.

4.1 Decision Tree Classifier

The COVID-19 dataset was classified using the DT ID3 classifier.The results are shown in Tab.3.As can be seen,this classifier achieved 100% accuracy,precision,and recall.The confusion matrix for the DT classifier is plotted in Fig.6a.

Table 3:Results achieved for decision tree classifier

4.2 Naive Bayes Classifier

The NB Classifier is implemented on the COVID-19 dataset because it is a continuous dataset.The NB classifier also achieved 100% accuracy(Tab.4).The confusion matrix for this classifier is shown in Fig.6b.

4.3 Logistic Regression Classifier

The LR classifier has been used successfully to predict various diseases[37,38].The testing data is predicted for the first 25 entries.The histogram of the predictions is shown in Fig.7.Figs.8a and 8b depict the confusion matrices for LR and SVM classifiers respectively.The Receiver Operating Characteristics(ROC)plot for the COVID19 dataset,based on true positive rate and false positive rate,is shown in Fig.9a.The LR ROC curve covers 91% of the area.The results obtained for LR are listed in Tab.5.

Figure 6:Confusion matrices for both classifiers(a)Decision tree classifier(b)Naive Bayesian classifier

Table 4:Results achieved for Naive Bayesian classifier

Figure 7:Histogram of predicted probabilities

4.4 Support Vector Machine Classifier

The linear SVM classifier achieved precision of 98%.The ROC curve for multiclass SVM is depicted in Fig.9b.It shows that the ROC curve for class-1 covers 100% of the area,while class-2 covers 88% of the area.Tab.6 lists the SVM results using formulas(1–3).

Figure 8:Confusion matrices for(a)LR and(b)SVM classifiers

Figure 9:ROC Curve for(a)LR and(b)SVM classifiers

Table 5:Results achieved for logistic regression classifier

DT and NB classifiers yielded 100% accuracy for this dataset.Tab.7 shows the results for the DT,NB,LR,and SVM classifiers.The classified dataset is input to an ANN(Section 4.5)for data categorization.

Table 6:Results achieved for SVM classifier

Table 7:Comparison of classification results

4.5 Artificial Neural Network

In the Artificial Neural Network training classifier,Bayesian regularization is used to categorize the search space into two classes:normal outbreak and severe outbreak.This classifier is used to categorize the nature of the COVID-19 outbreak in Pakistan based on data collected from various regions.Fig.10 shows the COVID-19 dataset simulation architecture.

Figure 10:COVID-19 dataset simulation architecture

Algorithm 1:Algorithm for Classification 1. Provide the Input Parameters 2. Data Preprocessing 3. Checking of Conditional Probability 4. While(error-ratethreshold-value 7.Back Propagation 8.Weight setting 9.End If 10. End While 11. Neural Network’s Bayesian regularization 12. Classification Results 13. Classification of the Outbreak Nature

The output is labeled 0 or and 1,where 0 represents a normal outbreak and 1 represents a severe outbreak.The output is labeled based on input parameter values.Tab.8 shows the classified,important ranking features of the dataset as inputs selected for the neural network.The Error Histogram and Regression values are given in Tab.9.

Table 8:Selected inputs of COVID-19 dataset

In Tab.9,from the 852 dataset entries,596 instances are selected for training,128 are selected for validation,and 128 are selected for testing.Furthermore,50 hidden neurons with one epoch are used for the neural network.The confusion matrix results demonstrated that the actual class predicts the predicted class with 99.88% accuracy.This indicates that the BRANN classifier predicts the results accurately for this dataset.Tab.9 shows that the BRANN classifier correctly categorized 128 data items for the validation and testing process.

Table 9:Bayesian regularization results

Figure 11:Error histogram of Bayesian regularization ANN algorithm

Figure 12:Bayesian regularization regression plot

From Fig.11,it is evident BRANN has 0 errors.This indicates that the neural network fits the data perfectly.Fig.12 shows how accurately a neural network determines the function for regression to analyze the dataset.The actual network details are shown in comparison with the target output.How accurately a model fits the data is represented through this colored line shown in the Fig.12.This line should closely intersect the real output from the left to the right corner of the regression plot.The above figure shows that the COVID-19 dataset closely fits in the BRANN model.

Fig.13 shows the training state of the BRANN(gradient,mu,parameters,the sum of squared parameters,and validation checks).They all achieve 1000 epochs,which indicates the good performance of the dataset.Fig.14 represents the mean square error of the BRANN.The blue and red training lines represents the testing mean square,and the dotted line represents the 1000 epochs.The figure listed below shows the best training performance of the BRANN.

Figure 13:Training state of BRANN

Figure 14:Mean square error of neural network BRANN

Fig.15 is the confusion matrix of the BRANN classifier.The BRANN classifier gives 99.88%accuracy for training,testing,and validation of the classifier on the COVID dataset for Pakistan.The outcome of the dataset is divided into two classes 0 and 1,where 0 denotes that the outbreak is normal,and 1 represents that the outbreak is severe.Five potential features are selected as input according to their importance that is classified through ML classifiers.

Figure 15:BRANN confusion matrix

The COVID-19 dataset for Pakistan is classified through machine learning techniques,and their accuracy results are compared.The results show that the NB classifier gives 100% accuracy for this dataset.Therefore,the BRANN best fits the dataset and categorizes the dataset into a normal class and severe class for the COVID-19 outbreak in Pakistan.

5 Conclusion

The proposed system categorizes the COVID-19 outbreak in Pakistan based on a dataset collected in different regions of Pakistan.Machine learning classifiers play a vital role in the classification,categorization,and prediction of dangerous diseases such as COVID-19.With the help of various machine learning techniques,the loss from COVID19 can be minimized in the upcoming months in Pakistan.First,we classified the COVOD-19 dataset using different machine learning classifiers.Then,the BRANN classifier was used to categorize the nature of outbreak as normal or severe.The experiments show that the BRANN provides a best fit regression plot with minimal error rate.In future,the proposed model can be further tested on a larger dataset[39,40]to test its scalability.

Funding Statement:The authors are grateful to the Raytheon Chair for Systems Engineering for funding.

Conflicts of Interest:The authors declare that they have no conflicts of interest to report regarding the present study.

主站蜘蛛池模板: 九九久久精品免费观看| 精品福利国产| 青青热久免费精品视频6| 精品欧美一区二区三区在线| 在线观看免费国产| 特级精品毛片免费观看| 欧美影院久久| 精品91视频| 99热这里只有精品在线观看| 免费啪啪网址| 亚洲第一成年网| 亚洲首页在线观看| 老司机午夜精品网站在线观看 | 亚洲日本精品一区二区| 亚洲一道AV无码午夜福利| 欧美日韩一区二区在线免费观看 | 日本精品视频| 亚洲午夜片| 成人午夜亚洲影视在线观看| 在线免费无码视频| 伊大人香蕉久久网欧美| 日韩激情成人| 欧美性猛交一区二区三区| 亚洲h视频在线| 九色视频一区| 国产成人91精品| www欧美在线观看| 亚洲有无码中文网| 欧美一级片在线| 亚洲色图在线观看| 99久久国产自偷自偷免费一区| 97人妻精品专区久久久久| 白丝美女办公室高潮喷水视频| 国产在线观看第二页| 伊在人亚洲香蕉精品播放 | 欧美精品v欧洲精品| 9丨情侣偷在线精品国产| 91 九色视频丝袜| 亚洲狼网站狼狼鲁亚洲下载| 中文字幕永久在线观看| 免费高清毛片| 欧美精品高清| 欧美日在线观看| 重口调教一区二区视频| 特级欧美视频aaaaaa| 亚洲色图综合在线| 国产精品三级专区| 国产成人AV男人的天堂| 国产精品人莉莉成在线播放| 色婷婷久久| 一本无码在线观看| 欧美亚洲中文精品三区| 2022国产91精品久久久久久| 一级香蕉视频在线观看| 国产性爱网站| 久久久久久尹人网香蕉| 性视频久久| 超清无码熟妇人妻AV在线绿巨人| 小13箩利洗澡无码视频免费网站| 欧美精品在线看| 午夜日韩久久影院| 亚洲视频色图| 91无码网站| 先锋资源久久| 精品久久国产综合精麻豆| 少妇精品网站| 国产成人精彩在线视频50| 91香蕉视频下载网站| 国产在线观看精品| 美女裸体18禁网站| 91香蕉视频下载网站| 2021无码专区人妻系列日韩| 最新痴汉在线无码AV| 内射人妻无码色AV天堂| 在线观看无码a∨| 国产综合精品一区二区| 欧美特黄一级大黄录像| 人人妻人人澡人人爽欧美一区| 国产视频一二三区| 视频一区视频二区日韩专区| 毛片免费在线视频| 亚洲欧美精品在线|