Four Types of Multiclass Frameworks for Pneumonia Classification and Its Validation in X-ray Scans Using Seven Types of Deep Learning Artificial Intelligence Models

Nillmani,; Jain, Pankaj K.; Sharma, Neeraj; Kalra, Mannudeep K.; Viskovic, Klaudija; Saba, Luca; Suri, Jasjit S.

doi:10.3390/diagnostics12030652

Open AccessArticle

Four Types of Multiclass Frameworks for Pneumonia Classification and Its Validation in X-ray Scans Using Seven Types of Deep Learning Artificial Intelligence Models

by

Nillmani

¹

,

Pankaj K. Jain

¹

,

Neeraj Sharma

¹,

Mannudeep K. Kalra

²,

Klaudija Viskovic

³

,

Luca Saba

⁴ and

Jasjit S. Suri

^5,6,*

¹

School of Biomedical Engineering, Indian Institute of Technology (BHU), Varanasi 221005, India

²

Department of Radiology, Massachusetts General Hospital, Boston, MA 02115, USA

³

Department of Radiology and Ultrasound, University Hospital for Infectious Diseases, 10000 Zagreb, Croatia

⁴

Department of Radiology, Azienda Ospedaliero Universitaria (A.O.U.), 10015 Cagliari, Italy

⁵

Stroke Diagnostic and Monitoring Division, AtheroPoint^TM, Roseville, CA 95661, USA

⁶

Knowledge Engineering Center, Global Biomedical Technologies, Inc., Roseville, CA 95661, USA

^*

Author to whom correspondence should be addressed.

Diagnostics 2022, 12(3), 652; https://doi.org/10.3390/diagnostics12030652

Submission received: 21 February 2022 / Revised: 4 March 2022 / Accepted: 4 March 2022 / Published: 7 March 2022

(This article belongs to the Section Machine Learning and Artificial Intelligence in Diagnostics)

Abstract

:

Background and Motivation: The novel coronavirus causing COVID-19 is exceptionally contagious, highly mutative, decimating human health and life, as well as the global economy, by consistent evolution of new pernicious variants and outbreaks. The reverse transcriptase polymerase chain reaction currently used for diagnosis has major limitations. Furthermore, the multiclass lung classification X-ray systems having viral, bacterial, and tubercular classes—including COVID-19—are not reliable. Thus, there is a need for a robust, fast, cost-effective, and easily available diagnostic method. Method: Artificial intelligence (AI) has been shown to revolutionize all walks of life, particularly medical imaging. This study proposes a deep learning AI-based automatic multiclass detection and classification of pneumonia from chest X-ray images that are readily available and highly cost-effective. The study has designed and applied seven highly efficient pre-trained convolutional neural networks—namely, VGG16, VGG19, DenseNet201, Xception, InceptionV3, NasnetMobile, and ResNet152—for classification of up to five classes of pneumonia. Results: The database consisted of 18,603 scans with two, three, and five classes. The best results were using DenseNet201, VGG16, and VGG16, respectively having accuracies of 99.84%, 96.7%, 92.67%; sensitivity of 99.84%, 96.63%, 92.70%; specificity of 99.84, 96.63%, 92.41%; and AUC of 1.0, 0.97, 0.92 (p < 0.0001 for all), respectively. Our system outperformed existing methods by 1.2% for the five-class model. The online system takes <1 s while demonstrating reliability and stability. Conclusions: Deep learning AI is a powerful paradigm for multiclass pneumonia classification.

Keywords:

COVID-19; Omicron; chest X-rays; deep learning; transfer learning; convolutional neural network

1. Introduction

COVID-19 is an extremely contagious disease caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) [1]. The virus was first isolated from three pneumonia patients with critical respiratory illness in December 2019 in Wuhan, China [2]. Within a short period, the virus spread globally. On 11 March 2020, World Health Organization (WHO) declared the disease a pandemic [3]. Coronaviruses (CoVs) are a tremendously diverse family of enveloped positive-sense single-stranded RNA viruses [4]. The viruses are highly pathogenic and transmissible viruses that spread via respiratory droplets or aerosol between individuals in close proximity [5], leading to several pathways [6] causing damage to several organs such as heart [7] and liver [8], causing diabetes [9] and pulmonary embolism [10,11]. In the majority of infected cases, the person begins to exhibit symptoms such as cough, fever, fatigue, and loss of smell or taste. In numerous fatal instances, the infection progresses to the lower respiratory system, including the lungs, causing illness such as severe pneumonia followed by multi-organ dysfunction syndrome with several secondary infections and shock [12,13,14,15,16,17].

Even after two years of the virus outbreak and almost 10,000 million doses of vaccination being administered, the disease continues to destroy human health, life, and the global economy. The viruses are incredibly efficient in mutating fast and gradually converting into more deadly variants [18]. After the severe damage of the Delta variant, a new variant named Omicron was discovered. The WHO has already designated Omicron as a variant of concern [19]. Several notable mutations in spike proteins of Omicron make it highly transmissible. Moreover, there is still a risk of more new mutations in Cov-2 thereafter, presenting potential for a more pernicious variant outbreaks.

COVID-19 infection is normally detected by a reverse transcriptase polymerase chain reaction (RT-PCR) test, which is frequently followed by chest radiographs, such as X-rays and computed tomography (CT) scans [20,21]. The reference technique for COVID-19 detection is RT-PCR; although, the procedure is laborious, complicated, rigorous, and time consuming with a significant high error rate [20,22,23]. The RT-PCR kit, along with a specific biosafety facility to host the PCR machine, is expensive. Consequently, there is a substantial supply constraint. Many nations are experiencing problems with erroneously positive COVID-19 cases caused by inadequacy in test kit supply, as well as delays in the test results. These limitations of RT-PCR present major obstacles to restricting the control of the disease as infection spreads among healthy populations [24].

To counteract the spread of COVID-19, patients must undergo prompt and effective screening, as well as get appropriate medical attention. Several medical imaging modalities, including chest X-ray (CXR) and computed tomography (CT), can help with this [25,26]. COVID-19 has recently been detected using CT imaging [25,27], however, the high patient dosage and screening expenses are principal disadvantages of using CT imaging for diagnosis [28]. On the other hand, the CXR equipment is commonly accessible in hospitals and diagnostic centers to create a 2D projection of the thorax quickly and affordably. Radiologists already use the CXR modality to detect chest abnormalities in various lung illnesses, including pneumonia and tuberculosis. COVID-19 detection has also been done utilizing CXRs in a few patients [25,29]. COVID-19 patients reveal similar findings in radiographs such as bilateral, peripheral, and basal predominant ground-glass opacities, septal thickening, pleural effusion, bronchiectasis, and bilateral lymphadenopathy [27,30,31,32,33,34,35]. As a result, CXR scans might help in the early detection of COVID-19 in the suspected person. However, one challenge is that the CXRs of various pneumonia are very similar; therefore, it is tough to differentiate COVID-19 from other lung abnormalities manually. Nonetheless, deep learning algorithms powered by Artificial Intelligence (AI) can efficiently extract several image-based features that radiologists may be unable to observe manually in the original CXR. Regarding image feature extraction and classification, convolutional neural networks (CNNs) have proven their efficiency and are widely implemented by the research community [36,37,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52,53,54,55,56]. Nowadays, CNN-based solutions are widely utilized to resolve a variety of health problems such as brain tumor identification [57,58,59], lung and breast cancer detection [60,61,62], Alzheimer’s disease diagnosis [63], cardiovascular disease predictions [64,65,66,67,68,69,70], pneumonia detection [71,72,73,74,75], and many more. With the promising results in several applications, deep learning techniques for chest X-rays have been gaining prominence in recent years. The transfer learning technique has made the operation smoother by facilitating the quick retraining of a highly deep CNN [76,77,78,79,80,81,82,83,84,85,86,87].

In this work, we have designed and applied seven different deep learning models utilizing the transfer learning method to detect multiclass COVID-19 in CXR images. We have performed the binary and multiclass classification into COVID-19 and other lung diseases—namely, viral pneumonia (VP), bacterial pneumonia (BP), tuberculosis (TB), and normal images. Thereafter, we compared the results to get the best-suited model for their usefulness in practice. Figure 1 shows the overall schematic diagram of the development of the COVID-19 detection system.

The whole work has been structured in a section-wise manner. In Section 2, we have explored all the related work and contributions of different authors in this area. Section 3 explains the dataset, image-preprocessing, and deep learning models. In Section 4, results of the experiments and their comparative performances have been provided. Section 5 deals with the model’s performance evaluation. Next, Section 6 presents the scientific validation of the proposed models which has been done on another dataset. Further, in Section 7, we have compared the proposed models with other existing state-of-the-art methods. Finally, Section 8 concludes the study and presents the future scope.

2. Related Work

Recently, COVID-19 detection using deep learning techniques has become a very popular area of research. Several researchers have proposed deep learning methods for the detection of disease in CXR images. However, the majority of them employed a limited dataset with a small number of COVID-19 samples. Consequently, their outputs may not be generalized, and accuracy cannot be guaranteed with a larger dataset. Choudhury et al. [88] applied eight different deep learning pre-trained CNNs for the classification of CXR images with three classes named COVID-19, viral pneumonia, and normal, with a total of 423, 1485, and 1579 images for each class, respectively. The authors showed an accuracy of 97.74% by CheXNet for three-class with the equivalent precision, sensitivity, and F1-score of 96.61%, and specificity of 98.31%. Hemdan et al. [89] utilized 50 CXR images with 25 confirmed COVID-19 and 25 normal for classification using pre-trained deep CNNs and achieved a maximum accuracy of 90% using VGG16 and DenseNet201 models with the precision of 83%, recall of 100%, and F1-score of 91% for both of the networks. Hussain et al. [90] developed a novel deep neural network (DNN) named CoroDet. The authors used CXR images under four classes named COVID-19, viral pneumonia (VP), bacterial pneumonia (BP), and normal with an sample sizes of 500, 400, 400, and 800 for each class respectively. They performed the classification experiment into two-class (COVID-19 vs. normal), three-class (COVID-19, VP, and normal), and four-class (COVID-19, VP, BP, and normal) models with maximum accuracies of 99.1%, 94.2%, and 91.2% for each experiment respectively. Jain et al. [91] applied several pre-trained CNNs for the classification of CXR images into three classes—COVID-19, VP, and normal. They utilized 490 COVID-19 images and got the maximum accuracy of 97.97% using the Xception model. Mahdy et al. [92] recommended a deep CNN-based methodology for COVID-19 detection from chest X-ray images with an accuracy of 97.48%. Ioannis et al. [93] applied transfer-learning methods to classify CXR images into COVID-19, BP, and normal classes with 224, 700, and 504 images for each class, respectively. They attained 96.7% accuracy, 98.66% sensitivity, and 96.46% specificity for the experiment. Sethy et al. [94] applied ResNet50 and SVM to classify CXRs into COVID-19, pneumonia, and normal classes. They obtained an accuracy of 95.33% for the three-class experiment. Ozturk et al. [95] introduced a novel network named DarkCovidNet. Using this network, the authors received an accuracy of 98.08% for two-class classification and 87.02% accuracy for three-class classification. Khan et al. [96] introduced a novel network: Coronet inspired from Xception architecture. Using the Coronet model, the authors obtained an accuracy of 95% for three-class classification into COVID-19, VP, and normal. They also performed four-class classification into COVID-19, VP, BP, and normal with 89.6% accuracy. Wang et al. [97] introduced a novel DNN, named COVID-Net, for the detection of COVID-19. The authors utilized 13,975 CXR images for the classification and achieved an accuracy of 83.5%. Afshar et al. [98] introduced COVID-CAPS, a capsule network to classify small-sized data of CXR images. The authors obtained an accuracy of 95.7% using COVID-CAPS. Yang et al. [99] applied transfer four different learning-based networks to classify CXR images into binary and three-class. The authors obtained an accuracy of 99% for binary (COVID-19 and pneumonia) and 97% accuracy for three-class (COVID-19, pneumonia, and normal) classification, both by the VGG16 network. Nayak et al. [100] performed binary classification into COVID-19 and normal class using 406 CXR images. The authors applied eight different pre-trained neural networks using the transfer learning method and obtained a maximum accuracy of 98.33% using ResNet34 network. When it comes to fusion of machine learning and deep learning, Bhattacharya et al. [101] performed three-class classification. This was aimed at classification of CXRs into COVID-19, pneumonia, and normal class. The authors obtained a maximum accuracy of 96.6% using a combination of VGG16 and binary robust invariant scalable key-points algorithm. Deb et al. [102] proposed a multi-model deep CNN ensemble architecture for the classification of CXRs into binary (COVID-19 and non-COVID-19) and three-class (COVID-19, pneumonia, and normal). The authors obtained accuracies of 98.58% for binary and 93.48% for the three-class experiment. Nikolaou et al. [103] developed a novel CNN by modifying pre-trained EfficientNetB0. This network was applied for the binary (COVID-19 and normal) and three-class (COVID-19, pneumonia, and normal) classification, obtaining an accuracy of 95% for binary and 93% for three-class. Oh et al. [104] introduced a patch-based DNN, where the network was applied for four-class classification of CXRs into COVID-19, BP, TB, and normal. Their database consisted of 502 images, where 180 were COVID-19 images, and they obtained a classification accuracy of 88.9%. AI-Timemy et al. [105] performed five-class classification divided into COVID-19, VP, BP, TB, and normal class. They utilized 2186 images, including 435 COVID-19 images, for the experiment. The authors applied a combination of DL and ML methods and attained 91.6% accuracy.

In conclusion, several recent studies have been reported for COVID-19 and other pneumonia classifications using CXR images. Most of them applied various CNN networks and achieved promising results. However, in maximum cases, the dataset used has a deficient number of images due to the scarcity of COVID-19 data. Hence, their results need to be verified on a larger dataset. Additionally, the classification into relevant multiclasses (>3 pneumonia) is rare. A rigorous experiment on classification for a larger dataset of COVID-19 and other similar lung disorders is required. In this study, we have designed and applied seven different deep learning models utilizing the transfer learning method for the classification of four types of pneumonia including COVID-19. We have used almost the largest data set of 18,603 CXR images which consists of 3611 COVID-19, 1345 viral pneumonia, 2780 bacterial pneumonia, 700 tuberculosis, and 10,167 normal CXR images.

3. Methodology

We have designed and applied seven highly efficient pre-trained deep CNNs for the binary and multiclass classification of pneumonia diseases. The approaches we have opted for in this experiment are described in the five subsequent sub-sections.

3.1. Dataset

In this experiment, 18,603 CXR images were used, including both anterior-to-posterior (AP)/posterior-to-anterior (PA). The dataset was prepared from three different publicly available databases. COVID-19, viral pneumonia, and normal CXR images were taken from the Kaggle: “COVID-19 Radiography Database”, i.e., winner of the COVID-19 Dataset Award by Kaggle Community [106]. The tuberculosis images were taken from the Kaggle: “Tuberculosis (TB) Chest X-ray Database” [107]. Finally, the bacterial pneumonia images were taken from the Kaggle: “Chest X-Ray Images (Pneumonia)” [108].

3.1.1. COVID-19 Radiography Database

The COVID-19 radiography database includes CXR images of COVID-19 and viral pneumonia patients along with healthy persons. The dataset was created by different research groups and doctors in collaboration [88,109]. The first stage of release of the dataset had 219 COVID-19, 1341 normal, and 1345 viral pneumonia chest X-rays. After two updates, the current dataset has increased the number up to 3616 COVID-19, 10,192 normal, and 1345 viral pneumonia images. The images were in Portable Network Graphics (PNG) file format with a resolution of 299 × 299 pixels. We have taken all the COVID-19, viral pneumonia, and normal images for our experiment.

3.1.2. Chest X-ray Pneumonia Images

The chest X-ray images (pneumonia) dataset contains 5863 CXR images with 2780 bacterial pneumonia and the rest with viral pneumonia and normal images. The CXR images were taken from Guangzhou Women and Children’s Medical Center, Guangzhou, China [110,111]. The images were in JPEG format with variable resolutions. We have taken all the 2780 bacterial pneumonia images for our experiment.

3.1.3. Tuberculosis Chest X-ray Database

The tuberculosis chest X-ray database contained CXR images of tuberculosis patients along with the healthy person. The dataset was created by several research groups along with the collaboration of medical doctors [112]. There are 700 tuberculosis images in Portable Network Graphics (PNG) file format with a resolution of 512 × 512 pixels. We have taken all the 700 tuberculosis images for our experiment. Figure 2 shows sample CXR images from each class. The images indicate that it is hard to determine the differences between them manually.

3.2. Image Processing

All the CXR images collected from the different data sources were first converted into Portable Network Graphics (PNG) file format. Out of 18,632, 29 images, i.e., <1%, were excluded from the experiment as outliers since they were missing details such as lung region. Some X-ray images containing avoidable body parts were cropped, displaying only chest and lungs. Image augmentation was done for each image included in the training process. During image augmentation, shearing and zooming were applied to 20%, typically adapted in the imaging industry [113,114,115,116]. Images were resized to 224 × 224 pixels before the training process as required for the pre-trained model standards.

Finally, a total of 18,603 CXR images including 3611 COVID-19, 1345 viral pneumonia, 2780 bacterial pneumonia, 700 tuberculosis, and 10,167 normal images were utilized for the experiments. Table 1 shows the experimental steps and class-wise distribution of the images. Out of the total, 80% (i.e., 14,879 images) including 2887 COVID-19, 1075 viral pneumonia, 2224 bacterial pneumonia, 560 tuberculosis, and 8133 normal images were utilized for training the models. Next, 10%, (i.e., 1862) randomly selected images including 362 COVID-19, 135 viral pneumonia, 278 bacterial pneumonia, 70 tuberculosis, and 1017 normal images were utilized for validation. Finally, 10%, (i.e., 1862) randomly selected images that were not involved in training or validation were utilized to test the models. The test set included 362 COVID-19, 135 viral pneumonia, 278 bacterial pneumonia, 70 tuberculosis, and 1017 normal images.

3.3. Experimental Setup

The whole experiment was organized into three phases. During the first phase of the experiment, we classified the images into two classes as: (i) COVID-19 and normal, (ii) COVID-19 and viral pneumonia, (iii) COVID-19 and bacterial pneumonia, and (iv) COVID-19 and tuberculosis. In the second phase of the experiment, we performed the three-class classification into viral diseases i.e., COVID-19, viral pneumonia and normal. In the third and final phase of the experiment, five-class classification was done into viral and bacterial diseases—i.e., COVID-19, viral pneumonia, bacterial pneumonia, tuberculosis, and normal. The experimental protocol consisted of 80% training, 10% validation, and 10% testing. The experiment was performed utilizing Python 3.8 on a computer with Intel Core i7 8th Generation Processor, 16 GB RAM, and 8 GB NVIDIA Quadro P4000 graphics processing unit (GPU).

3.4. Model Architectures

Transfer learning is a machine learning approach in which a model developed for one job is used as the foundation for another task. It uses a trained model from a large dataset. Pre-trained weights are then used to train the network more quickly for an application with a smaller dataset. This eliminates the need for a large dataset and shortens the training time that a deep learning system requires when created from scratch. In this work, utilizing the transfer learning approach we applied seven highly efficient pre-trained CNNs—namely, VGG16, VGG19, Xception, InceptionV3, Densenet201, NasnetMobile, and Resnet152—for the experiment. The architecture of each network is shown in Figure 3a–g. The Densenet offers a superior architecture design when it comes to layering process. The feature maps of the preceding layers are utilized in all the subsequent layers. This reduces the complexity drastically, thereby improving the performance. In a conventional network, there are M connections for M layers, unlike in the dense layer there are M(M + 1)/2 direct connections, hence it is powerful and efficient. The loss function applied for two-class was binary cross-entropy and for multiclass was categorical cross-entropy (CE). The activation function applied for the dense layer was sigmoid for binary and softmax for multiclass classification. The output layer was modified according to the number of classes. The models were trained for 25 epochs with a batch size of 16 images.

3.5. Cross-Entropy Loss Function for Models

Binary cross-entropy loss function can be defined as in Equation (1) [67].

L_{B C E} = \frac{- 1}{N} \sum_{i = 1}^{N} [(y_{i} \times \log a_{i}) + (1 - y_{i}) \times \log (1 - a_{i})]

(1)

where, y_i is the input GT label 1, (1 − y_i) is GT label 0,

a_{i}

represents the Softmax classifier probability.

Categorical cross-entropy loss function can be defined as in Equation (2) [67].

L_{C C E} = \frac{1}{N} \sum_{i = 1}^{N} \sum_{c = 1}^{C} 1_{y_{i} \in C_{c}} \log a_{m o d e l} (y_{i} \in C_{c})

(2)

where, N is the total number of observations (images), C is the number of categories or classes,

1_{y_{i} \in C_{c}}

term indicates the i^th observation that belongs to the c^th category.

3.6. Performance Metrics Used for Classification Evaluation

The performances of the proposed models were evaluated by the following different matrices:

(a): Accuracy: Accuracy is the most significant criterion for the analysis of the convolutional neural network’s performance. Accuracy is the sum of true positive and true negative values divided by the entire component of the confusion matrix. It is represented as given in Equation (3) [88].

Accuracy = \frac{True Positives + True Negatives}{Total number of cases}

(3)

(b): Precision: Precision is an important measure of the results of the CNN models. It counts how many correct positive predictions have been made. Precision is evaluated as the ratio between true positive predicted components and the sum of positive predicted components. It is represented as given in Equation (4) [88].

Precision = \frac{True Positive}{True Positive + False Positive}

(4)

(c): Recall (Sensitivity): Recall is another important metric for the analysis of the classifier’s performance. It is defined as the ratio between the true positive predicted components and the sum of true positive and false negative predicted components. It is represented as given in Equation (5) [91].

Recall = \frac{True Positive}{True Positive + False Negative}

(5)

(d): F1-score: The F1-score is an important measure for assessing the test’s accuracy. It is the harmonic mean between precision and recall. It is defined as twice the ratio between multiplication of precision and recall to the sum of precision and recall. It is represented as given in Equation (6) [91].

F 1 - score = \frac{2 \times Precision \times Recall}{Precision + Recall}

(6)

4. Results

Three different phases of the experiment were performed to compare the results of each classification possibility. In the first phase, we performed binary, then three-class, and finally five-class classification experiments.

4.1. Binary Classification

The binary classification experiment deals with the classification of images into COVID-19 and other classes separately. We endeavored to know how accurately the models could classify the images of different classes from the COVID-19 class. The binary experiment was stepped into four different sub-phases as COVID-19 vs. normal, COVID-19 vs. viral pneumonia, COVID-19 vs. bacterial pneumonia, and COVID-19 vs. tuberculosis classification.

4.1.1. Binary Class Case 1: COVID-19 vs. Normal

The comparative performances of different CNNs for the binary classification into COVID-19 and normal images are shown in Table 2. Using Equations (3)–(6), we evaluated the performance of the VGG 16 network, and it demonstrated the greatest efficiency with the highest accuracy, precision, recall, and f1-score among all networks. The VGG16 achieved a test accuracy of 97.24% with weighted averages of precision, recall, and f1-score of 97.26%, 97.24%, and 97.21%, respectively. The DenseNet201 performed as the second most efficient network with an accuracy of 96.01%. The performance of ResNet152 was least efficient with an accuracy of 78.75%. Figure 4 shows the training and validation accuracies and Figure 5 shows the training and validation loss for the best performing VGG16 model. The graphs indicate improved accuracy and reduced loss with successive epochs. Figure 6 shows the confusion matrix of test data classification by the VGG16 model. The confusion matrix specifies that, out of 362 COVID-19 images, 331 were correctly classified, and 31 were misclassified as normal images. Whereas, out of 1017 normal images, 1010 were correctly predicted, and seven were misclassified as the COVID-19 images.

4.1.2. Binary Class Case 2: COVID-19 vs. Viral Pneumonia

Table 3 shows the comparative performances of different CNNs for binary classification into COVID-19 and viral pneumonia. The NasnetMobile network performed most efficiently with the highest accuracy, precision, recall, and f1-score among all networks. The model achieved an accuracy of 99.80% with the equivalent weighted average of precision, recall, and f1-score of 99.80% each. VGG16 model performed as the second most efficient network with an accuracy of 99.60%. The performance of the ResNet152 model was least efficient, with an accuracy of 97.79%.

Figure 7 shows the training and validation accuracy and Figure 8 shows the training and validation loss for the best performing NasnetMobile model. The graphs show how accuracy improves and loss reduces with successive epochs. Figure 9 shows the confusion matrix of the test data classification by the NasnetMobile model. The confusion matrix reveals that, out of 362 COVID-19 images, 361 were correctly predicted, and one was misclassified to the viral pneumonia class. Furthermore, our model correctly predicted all 135 viral pneumonia images without any false prediction.

4.1.3. Binary Class Case 3: COVID-19 vs. Bacterial Pneumonia

The comparative performance metrics of different CNNs for binary classification into COVID-19 and bacterial pneumonia are shown in Table 4. The DenseNet201 performed most efficiently with the highest accuracy, precision, recall, and f1-score among all networks. The model achieved an accuracy of 99.84% and an equivalent weighted average of precision, recall, and f1-score of 99.84% each. The InceptionV3 and NasnetMobile performed as the second most efficient network with the equivalent accuracy of 99.53%. The ResNet152 performed least efficiently with an accuracy of 98.59%.

Figure 10 shows the training and validation accuracy, and Figure 11 shows the training and validation loss for the best performing DenseNet201 model. The graphs show that accuracy improves and loss reduces with successive epochs.

Figure 12 shows the confusion matrix of the test data classification by the DenseNet201 model. The confusion matrix specifies that out of 362 COVID-19 images, 361 were correctly predicted and one image was misclassified to bacterial pneumonia class. However, the model correctly predicted all 278 bacterial pneumonia images without any false predictions.

4.1.4. Binary Class Case 4: COVID-19 and Tuberculosis

The comparative performance metrics of different CNNs for binary classification into COVID-19 and tuberculosis CXR images have shown in Table 5. VGG16 performed most efficiently with an accuracy of 99.31%, weighted average of precision and recall of 99.31%, and f1-score of 99.30%. VGG19 and Xception both performed as the second most efficient models with the equivalent accuracy of 99.07%. ResNet152 performed least efficiently with an accuracy of 91.20%.

Figure 13 shows the training and validation accuracy and Figure 14 shows the training and validation loss for the best performing VGG16 model. The graphs indicate improved accuracy and reduced loss with successive epochs. Figure 15 shows the confusion matrix of the test data classification by the VGG16 model. The confusion matrix reveals the model correctly classified all 362 COVID-19 CXR images. Furthermore, out of 70 tuberculosis CXR images, 67 were correctly predicted, and three were misclassified as COVID-19 images.

4.2. Three-Class Classification into Viral Diseases

The comparative performance metrics of different CNNs for three-class classification into COVID-19, viral pneumonia, and normal images have shown in Table 6. VGG16 network performed most efficiently with an accuracy of 96.63% and equivalent weighted average of precision, recall, and f1-score of 96.63% each. The DenseNet201 network performed as the second most efficient network with an accuracy of 95.51%. The performance of ResNet152 model was the least efficient with an accuracy of 77.21%.

Figure 16 shows the training and validation accuracy and Figure 17 shows the training and validation loss for the best performing VGG16 model. The graphs indicate improved accuracy and reduced loss with successive epochs.

Figure 18 shows the confusion matrix of the test data classification by the VGG16 model. The confusion matrix specifies that out of 362 COVID-19 images, 339 were correctly classified, and 23 were misclassified as 21 to normal and two images to the viral pneumonia class. Next, out of 1017 normal images, 994 were correctly predicted, and 23 were misclassified with 18 as COVID-19 and five images as viral pneumonia class. Further, out of 135 viral pneumonia images, 130 were correctly classified, and five were misclassified as normal images.

4.3. Five-Class Classification into Viral and Bacterial Diseases

The comparative performance metrics of different networks for classification into five classes: COVID-19, viral pneumonia, bacterial pneumonia, tuberculosis, and normal images are shown in Table 7. The VGG16 model performed most efficiently with an accuracy of 92.70% and weighted averages of precision, recall, and f1-score of 92.41%, 92.70%, and 92.47%, respectively. The DenseNet201 performed as the second most efficient model with an accuracy of 89.10%. The performance of ResNet152 network was least efficient, with an accuracy of 74.70%.

Figure 19 shows the training and validation accuracy and Figure 20 shows the training and validation loss for the best performing VGG16 model. The graphs indicate that accuracy improves and loss reduces with successive epochs. Figure 21 shows the confusion matrix of the test data classification by the VGG16 model. The confusion matrix reveals that, out of 362 COVID-19 images, 336 were correctly predicted and 26 were misclassified as 24 to normal, one to bacterial pneumonia, and one to tuberculosis class. Next, out of 278 bacterial pneumonia images, 238 were correctly classified and 40 were misclassified as 14 normal and 26 viral pneumonia images. Furthermore, out of 1017 normal, 1002 were correctly predicted and 15 were misclassified as 12 COVID-19, two as bacterial pneumonia, and one as a viral pneumonia image. Afterward, out of 70 tuberculosis images, 69 were correctly classified and one image was misclassified to normal class. Finally, out of 135 viral pneumonia images, 81 were correctly predicted, and 54 were misclassified as 37 bacterial pneumonia, 16 normal, and one COVID-19 image.

5. Performance Evaluation

We are able to design a multiclass system for COVID-19 classification and detection. The results of each experiment show very encouraging numbers. However, the system needs some performance evaluation to prove its robustness against all odds. Therefore, we obtained the receiver operating characteristic (ROC) curve and the area-under-the-curve (AUC) for the best performing model in all classification experiments.

The ROC curves are drawn using inference values and true labels for each class. Figure 22 shows the four ROC curves and AUC values for best performing models in two-class experiments. Figure 23 shows ROC curves and AUC values for the best performing model (VGG16) in three-class experiments. Similarly, Figure 24 shows ROC curves and AUC values for the best performing model (VGG16) in five-class experiments.

6. Scientific Validation

Scientific validation is a significant integrated part of system design. The goal of optimal model validation is to ensure that the model is also functioning well and delivering comparable results on different dataset domains. In this work, we verified all of our models on the facial biometric dataset named Faces95 from Libor Spacek’s Facial Images Databases [117]. Several articles in the literature demonstrate the use of a well-known and standardized Faces95 database [118]. The database contains 72 individual images with various expressions and positions sat at a fixed distance from the camera. There are 72 classes for both men and women, with a total of 1440 photographs. The sample images from the first eight classes are shown in Figure 25.

The experiments were performed under a similar experimental condition as for CXR images classification. The loss function applied was categorical cross-entropy. The activation function used for the dense layer was softmax. The models were trained for 25 epochs with a batch size of 16 images. The training, validation, and testing were done on 80%, 10%, and 10% of the randomly selected images, respectively. The performances were also evaluated in terms of accuracy, precision, recall, and F1-score. Table 8 shows the comparative performance of the models. The VGG16 model performed most efficiently with an accuracy of 98.61% and precision, recall, and F1-score of 99.07%, 98.61%, and 98.52%, respectively. Figure 26 shows the training and validation accuracy and Figure 27 shows the training and validation loss for the best performing VGG16 model. The graphs indicate improved accuracy and reduced loss with successive epochs. The results support our system performing excellently on other datasets too, along with the medical images, providing outstanding results in each scenario.

7. Discussion

The coronavirus pandemic and its fast spread has put the world in a very tough situation. The quality of life due to COVID-19 has gotten worse, including increased anxiety and depression [119]. The chaotic behavior of COVID-19 has caused it to spread as a nonlinear infection throughout different parts of the world [120]. Certain places of the world have a larger number of infections, while some have lower numbers. Furthermore, some have greater intensity and severity, while at some places it has smaller intensity and severity. Additionally, regarding cases related on a daily basis, studies have shown no significant correlation in the diffusion of disease in the different parts of the world [121]. Thus, it makes it difficult to predict, prepare, and combat the outbreak of the disease variants. To find solutions and overcome limitations—such as unavailability of RT-PCR tests, delayed results, and high costs—our strategy of deep learning-based transfer models for the classification of chest X-ray images to detect COVID-19 is proving to be most effective. We utilized 18,603 CXR images, with 3611 having COVID-19 and remainder of the sample being composed of patients with viral pneumonia, bacterial pneumonia, and tuberculosis disease classes along with normal images. We organized our experiment into three phases: (i) binary classification (COVID-19 and other classes separately); (ii) three-class classification into viral diseases (COVID-19, viral pneumonia, and normal); and (iii) five-class classification into viral and bacterial diseases (COVID-19, viral pneumonia, bacterial pneumonia, tuberculosis, and normal). To achieve optimal performance, we applied seven highly efficient pre-trained CNNs—VGG16, VGG19, DenseNet201, Xception, InceptionV3, NasnetMobile, and ResNet152—for the classification of the CXR images.

7.1. Principal Findings

For binary classification, we achieved the best performance by the DenseNet201 model with an accuracy of 99.84% for COVID-19 and bacterial pneumonia classification. Thereafter, the second-best performing model was NasnetMobile which provided 99.80% accuracy for the classification of COVID-19 and viral pneumonia. Finally, the VGG16 model performed third-best with 99.31% and 97.24% accuracies for classification into COVID-19 vs. tuberculosis and COVID-19 vs. normal class respectively. For three-class and five-class experiments, the VGG16 model performed best with accuracies of 96.63% and 92.70% respectively. The AUC for binary classification was best for COVID-19 vs. viral pneumonia and COVID-19 vs. tuberculosis class with the value of 1.0. Next, the AUC achieved for COVID-19 and tuberculosis was 0.98 and for COVID-19 and normal class was 0.95. Furthermore, for three-class and five-class classification, the AUC values achieved were 0.97 and 0.92 respectively.

7.2. Benchmarking

Table 9 shows the benchmarking table, presenting existing state-of-art classification methods and comparing them against the proposed method. Each row in the table shows different authors’ work in this area and the columns show the methods, number of X-ray images used, and results of the experiment. We used the highest number of images for our experiment compared with any other work in this area. To the best of our knowledge, our NasnetMobile model achieved the highest accuracy of 99.80% among all existing methods for binary classification of COVID-19 vs. viral pneumonia. Additionally, for the first time ever, we have performed the binary classification into COVID-19 vs. bacterial pneumonia and COVID-19 vs. tuberculosis disease classes with remarkable accuracies of 99.84% by Densenet201 and 99.31% by VGG16 model, respectively. Our results are very consistent with the previous studies on Densenet [122,123,124,125]. These studies have shown superior performance of Densenet169 applied to COVID CT/X-rays. The key advantages of the Densenet are its ability to alleviate the fundamental problem of vanishing-gradient. As a result, the feature extraction process is boosted up for its reuse, thereby reducing the number of parameters.

The CoroDet model by Hussain et al. [90] performed slightly better than our VGG16 model for binary classification between COVID-19 and normal. The authors achieved an accuracy of 99.1% in comparison to 97.24% by our model. However, our VGG16 model beat the CoroDet for three-class classification with an accuracy of 96.63% in comparison to 94.2%. Our model performed very close to the 97.97% of accuracy by the best performing Xception network for three-class classification applied by Jain et al. [91]. However, an advantage of our VGG16 network is that it is faster and takes less time for training. Furthermore, for the five-class classification, our VGG16 model outperformed other existing models with an accuracy of 92.70%.

7.3. A Special Note on Multiclass Frameworks for Pneumonia Classification

To date, most of the classification experiments for COVID-19 detection have been according to binary or three-class models. However, beyond COVID-19, a wide range of pneumonia exist among the population—including viral, bacterial, and tubercular. Therefore, it is vital to distinguish the COVID-19 from other diseases. A multiclass approach was apparently needed to classify COVID-19 from other pneumonia for the correct diagnosis of the patient. Our system is trained with the highest number of CXR images to date, includes most of the relevant pneumonia types, and is able to distinguish COVID-19 from other lung diseases with excellent accuracy.

7.4. Strengths, Weaknesses, and Extensions

The major strength of our system is its ability to detect COVID-19 very rapidly and it takes just a few seconds to provide results. Furthermore, the system is very cost-effective, as it requires only a patient’s chest X-ray scan which low-cost and readily available. Additionally, we have done six different types of classification experiments with consistently good accuracy that support our system’s robustness for practical applications.

One of the limitations of our system is its inability to detect the severity of the infection, partially due to collimator noise. One can adopt denoising methods [126] as part of preprocessing. Furthermore, predicting severity may help the physicians in treatment selection and thus in the fast and secure recovery of the patient. In addition, since we had a large database of CXR images, we did not perform k-fold cross-validation in which all images take part in training and testing at least once. In the extension of the work, we will make an effort for the advancement of the system, as it could be able to detect COVID-19 as well as the severity of the disease. In addition, we will include the heatmap images [127,128,129] of the disease, which will show the affected areas of the lungs. Broader advanced one-pass machine learning such as extreme learning machines [130] can be explored as more data are collected along with pruning methods [131,132,133] to lower the storage and improve the speed. This can also be extended for severity estimation [134] and application of an advanced image analysis solution such as stochastic imaging [135].

8. Conclusions

COVID-19 has become the foremost global challenge to save human life. Several healthcare organizations are struggling to discover effective solutions. However, artificial intelligence applications in computer-aided diagnosis (CAD) have proven their efficiency and importance in resolving several medical problems. Due to the presence of various types of pneumonia—such as viral, bacterial, tubercular, as well as COVID-19—a system was apparently needed for multiclass classification as current methods offer less reliable solutions. In this work, we have designed and applied seven highly efficient pre-trained convolutional neural networks—namely, VGG16, VGG19, DenseNet201, Xception, InceptionV3, NasnetMobile, and ResNet152—for classification of up to five classes by utilizing a large database of chest X-ray scans. For the first time ever, we have performed the binary classification into COVID-19 vs. bacterial pneumonia and COVID-19 vs. tuberculosis disease classes and achieved powerful accuracies of 99.84% by Densenet201 and 99.31% by VGG16 model, respectively. Our NasnetMobile and VGG16 models outperformed other existing methods for the binary (COVID-19 vs. viral pneumonia) and five-class (COVID-19, viral pneumonia, bacterial pneumonia, tuberculosis, and normal) classification with an accuracy of 99.80% and 92.70%, respectively. Performing with a remarkably high level of accuracy, the proposed models can provide an alternative to the current diagnostic methods for COVID-19 with a more accurate, cost-effective, and readily available system. The system may promisingly contribute to the fast diagnosis of patients, consequently lowering the medical load.

Author Contributions

N.: conceptualization; methodology; software; visualization; investigation; validation; data curation; writing—original draft. P.K.J.: validation; writing—review and editing. N.S.: validation; writing—review and editing; supervision. L.S.: validation. K.V.: validation, M.K.K.: validation. J.S.S.: conceptualization; visualization; investigation; validation; writing—review and editing; supervision. All authors have read and agreed to the published version of the manuscript.

Funding

The research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The dataset used in this study can be found in references [106,107,108].

Conflicts of Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Abbreviation

AI	Artificial intelligence
AUC	Area-under-the-curve
BP	Bacterial pneumonia
CAD	Computer-aided diagnosis
CNN	Convolution neural network
COV	Coronavirus
CT	Computed tomography
CXR	Chest X-ray
DL	Deep learning
DNN	Deep neural network
ESD	Ensemble subspace discriminant
FC	Fully connected
GPU	Graphics processing unit
JPEG	Joint photographic expert group
ML	Machine learning
Nasnet	Neural search architecture network
PNG	Portable network graphics
RAM	Random-access memory
ReLU	Rectified linear unit
ResNet	Residual neural network
RNA	Ribonucleic acid
ROC	Receiver operating characteristic
RT-PCR	Reverse transcriptase polymerase chain reaction
SARS-CoV-2	Severe acute respiratory syndrome coronavirus 2
TB	Tuberculosis
VGG	Visual geometry group
VP	Viral pneumonia
WHO	World health organization
2-D	Two-dimensional

References

Baig, A.M. Neurological manifestations in COVID-19 caused by SARS-CoV-2. CNS Neurosci. Ther. 2020, 26, 499–501. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Zhu, N.; Zhang, D.; Wang, W.; Li, X.; Yang, B.; Song, J.; Zhao, X.; Huang, B.; Shi, W.; Lu, R.; et al. A Novel Coronavirus from Patients with Pneumonia in China, 2019. N. Engl. J. Med. 2020, 382, 727–733. [Google Scholar] [CrossRef]
Cucinotta, D.; Vanelli, M. WHO declares COVID-19 a pandemic. Acta Biomed. 2020, 91, 157–160. [Google Scholar] [CrossRef] [PubMed]
V’Kovski, P.; Kratzel, A.; Steiner, S.; Stalder, H.; Thiel, V. Coronavirus biology and replication: Implications for SARS-CoV-2. Nat. Rev. Microbiol. 2021, 19, 155–170. [Google Scholar] [CrossRef] [PubMed]
Pal, M.; Berhanu, G.; Desalegn, C.; Kandi, V. Severe Acute Respiratory Syndrome Coronavirus-2 (SARS-CoV-2): An Update. Cureus 2020, 12, e7423. [Google Scholar] [CrossRef] [Green Version]
Saba, L.; Gerosa, C.; Fanni, D.; Marongiu, F.; La Nasa, G.; Caocci, G.; Barcellona, D.; Balestrieri, A.; Coghe, F.; Orru, G.; et al. Molecular pathways triggered by COVID-19 in different organs: ACE2 receptor-expressing cells under attack? A review. Eur. Rev. Med. Pharmacol. Sci. 2020, 24, 12609–12622. [Google Scholar]
Cau, R.; Bassareo, P.P.; Mannelli, L.; Suri, J.S.; Saba, L. Imaging in COVID-19-related myocardial injury. Int. J. Cardiovasc. Imaging 2021, 37, 1349–1360. [Google Scholar] [CrossRef]
Fanni, D.; Cerrone, G.; Saba, L.; Demontis, R.; Congiu, T.; Piras, M.; Gerosa, C.; Suri, J.; Coni, P.; Caddori, A.; et al. Thrombotic sinus-oiditis and local diffuse intrasinusoidal coagulation in the liver of subjects affected by COVID-19: The evidence from histology and scanning electron microscopy. Eur. Rev. Med. Pharmacol. Sci. 2021, 25, 5904–5912. [Google Scholar]
Viswanathan, V.; Puvvula, A.; Jamthikar, A.D.; Saba, L.; Johri, A.M.; Kotsis, V.; Khanna, N.N.; Dhanjil, S.K.; Majhail, M.; Misra, D.P.; et al. Bidirectional link between diabetes mellitus and coronavirus disease 2019 leading to cardiovascular disease: A narrative review. World J. Diabetes 2021, 12, 215–237. [Google Scholar] [CrossRef]
Cau, R.; Pacielli, A.; Fatemeh, H.; Vaudano, P.; Arru, C.; Crivelli, P.; Stranieri, G.; Suri, J.S.; Mannelli, L.; Conti, M.; et al. Complications in COVID-19 patients: Characteristics of pulmonary embolism. Clin. Imaging 2021, 77, 244–249. [Google Scholar] [CrossRef]
Gerosa, C.; Faa, G.; Fanni, D.; Manchia, M.; Suri, J.; Ravarino, A.; Barcellona, D.; Pichiri, G.; Coni, P.; Congiu, T.; et al. Fetal pro-gramming of COVID-19: May the barker hypothesis explain the susceptibility of a subset of young adults to develop severe disease. Eur. Rev. Med. Pharmacol. Sci. 2021, 25, 5876–5884. [Google Scholar] [PubMed]
Symptoms of COVID-19. Available online: https://www.cdc.gov/coronavirus/2019-ncov/symptoms-testing/symptoms.html (accessed on 8 January 2022).
Koh, H.K.; Geller, A.C.; VanderWeele, T.J. Deaths from COVID-19. JAMA 2021, 325, 1334. [Google Scholar] [CrossRef] [PubMed]
Woolf, S.H.; Chapman, D.A.; Sabo, R.T.; Weinberger, D.M.; Hill, L. Excess Deaths From COVID-19 and Other Causes, March-April 2020. JAMA 2020, 324, 510–513. [Google Scholar] [CrossRef]
Faust, J.S.; Del Rio, C. Assessment of Deaths From COVID-19 and From Seasonal Influenza. JAMA Intern. Med. 2020, 180, 1045. [Google Scholar] [CrossRef] [PubMed]
Iacobucci, G. COVID-19: New UK variant may be linked to increased death rate, early data indicate. BMJ 2021, 372, n230. [Google Scholar] [CrossRef] [PubMed]
Ciminelli, G.; Garcia-Mandicó, S. COVID-19 in Italy: An Analysis of Death Registry Data. J. Public Health 2020, 42, 723–730. [Google Scholar] [CrossRef]
WHO. Coronavirus (COVID-19) Dashboard. 2022. Available online: https://covid19.who.int/ (accessed on 7 February 2022).
World Health Organization. Omicron. Available online: https://www.who.int/news/item/28-11-2021-update-on-omicron (accessed on 17 January 2022).
Wang, W.; Xu, Y.; Gao, R.; Lu, R.; Han, K.; Wu, G.; Tan, W. Detection of SARS-CoV-2 in Different Types of Clinical Specimens. JAMA 2020, 323, 1843–1844. [Google Scholar] [CrossRef] [Green Version]
Cau, R.; Falaschi, Z.; Paschè, A.; Danna, P.; Arioli, R.; Arru, C.D.; Zagaria, D.; Tricca, S.; Suri, J.S.; Karla, M.K.; et al. CT findings of COVID-19 pneumonia in ICU-patients. J. Public Health Res. 2021, 10, 2270. [Google Scholar] [CrossRef]
Wikramaratna, P.S.; Paton, R.S.; Ghafari, M.; Lourenço, J. Estimating the false-negative test probability of SARS-CoV-2 by RT-PCR. Eurosurveillance 2020, 25, 2000568. [Google Scholar] [CrossRef]
Li, Y.; Yao, L.; Li, J.; Chen, L.; Song, Y.; Cai, Z.; Yang, C. Stability issues of RT-PCR testing of SARS-CoV-2 for hospitalized patients clinically diagnosed with COVID-19. J. Med. Virol. 2020, 92, 903–908. [Google Scholar] [CrossRef] [Green Version]
Yang, T.; Wang, Y.-C.; Shen, C.-F.; Cheng, C.-M. Point-of-Care RNA-Based Diagnostic Device for COVID-19. Diagnostics 2020, 10, 165. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Ng, M.-Y.; Lee, E.Y.; Yang, J.; Yang, F.; Li, X.; Wang, H.; Lui, M.M.-S.; Lo, C.S.-Y.; Leung, B.; Khong, P.-L.; et al. Imaging Profile of the COVID-19 Infection: Radiologic Findings and Literature Review. Radiol. Cardiothorac. Imaging 2020, 2, e200034. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Liu, H.; Liu, F.; Li, J.; Zhang, T.; Wang, D.; Lan, W. Clinical and CT imaging features of the COVID-19 pneumonia: Focus on pregnant women and children. J. Infect. 2020, 80, e7–e13. [Google Scholar] [CrossRef] [PubMed]
Chung, M.; Bernheim, A.; Mei, X.; Zhang, N.; Huang, M.; Zeng, X.; Cui, J.; Xu, W.; Yang, Y.; Fayad, Z.A.; et al. CT imaging features of 2019 novel coronavirus (2019–nCoV). Radiology 2020, 295, 202–207. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Kroft, L.J.; van der Velden, L.; Girón, I.H.; Roelofs, J.J.; de Roos, A.; Geleijns, J. Added Value of Ultra–low-dose Computed Tomography, Dose Equivalent to Chest X-Ray Radiography, for Diagnosing Chest Pathology. J. Thorac. Imaging 2019, 34, 179–186. [Google Scholar] [CrossRef]
Chen, N.; Zhou, M.; Dong, X.; Qu, J.; Gong, F.; Han, Y.; Qiu, Y.; Wang, J.; Liu, Y.; Wei, Y.; et al. Epidemiological and clinical characteristics of 99 cases of 2019 novel coronavirus pneumonia in Wuhan, China: A descriptive study. Lancet 2020, 395, 507–513. [Google Scholar] [CrossRef] [Green Version]
Huang, C.; Wang, Y.; Li, X.; Ren, L.; Zhao, J.; Hu, Y.; Zhang, L.; Fan, G.; Xu, J.; Gu, X.; et al. Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China. Lancet 2020, 395, 497–506. [Google Scholar] [CrossRef] [Green Version]
Corman, V.M.; Landt, O.; Kaiser, M.; Molenkamp, R.; Meijer, A.; Chu, D.K.W.; Bleicker, T.; Brünink, S.; Schneider, J.; Schmidt, M.L.; et al. Detection of 2019 novel coronavirus (2019-nCoV) by real-time RT-PCR. Eurosurveillance 2020, 25, 2000045. [Google Scholar] [CrossRef] [Green Version]
Chu, D.K.W.; Pan, Y.; Cheng, S.M.S.; Hui, K.P.Y.; Krishnan, P.; Liu, Y.; Ng, D.Y.M.; Wan, C.K.C.; Yang, P.; Wang, Q.; et al. Molecular Diagnosis of a Novel Coronavirus (2019-nCoV) Causing an Outbreak of Pneumonia. Clin. Chem. 2020, 66, 549–555. [Google Scholar] [CrossRef] [Green Version]
Zhang, N.; Wang, L.; Deng, X.; Liang, R.; Su, M.; He, C.; Hu, L.; Su, Y.; Ren, J.; Yu, F.; et al. Recent advances in the detection of respiratory virus infection in humans. J. Med. Virol. 2020, 92, 408–417. [Google Scholar] [CrossRef]
Hosseiny, M.; Kooraki, S.; Gholamrezanezhad, A.; Reddy, S.; Myers, L. Radiology Perspective of Coronavirus Disease 2019 (COVID-19): Lessons from Severe Acute Respiratory Syndrome and Middle East Respiratory Syndrome. Am. J. Roentgenol. 2020, 214, 1078–1082. [Google Scholar] [CrossRef] [PubMed]
Salehi, S.; Abedi, A.; Balakrishnan, S.; Gholamrezanezhad, A. Coronavirus Disease 2019 (COVID-19): A Systematic Review of Imaging Findings in 919 Patients. Am. J. Roentgenol. 2020, 215, 87–93. [Google Scholar] [CrossRef] [PubMed]
Suri, J.S.; Biswas, M.; Kuppili, V.; Saba, L.; Edla, D.R.; Suri, H.S.; Cuadrado-Godia, E.; Laird, J.R.; Marinhoe, R.T.; Sanches, J.M.; et al. State-of-the-art review on deep learning in medical imaging. Front. Biosci. 2019, 24, 392–426. [Google Scholar] [CrossRef] [PubMed]
Saba, L.; Biswas, M.; Kuppili, V.; Godia, E.C.; Suri, H.S.; Edla, D.R.; Omerzu, T.; Laird, J.R.; Khanna, N.N.; Mavrogeni, S.; et al. The present and future of deep learning in radiology. Eur. J. Radiol. 2019, 114, 14–24. [Google Scholar] [CrossRef]
Jena, B.; Saxena, S.; Nayak, G.K.; Saba, L.; Sharma, N.; Suri, J.S. Artificial intelligence-based hybrid deep learning models for image classification: The first narrative review. Comput. Biol. Med. 2021, 137, 104803. [Google Scholar] [CrossRef]
Singh, M.; Verm, A.; Sharma, N. An Optimized Cascaded Stochastic Resonance for the Enhancement of Brain MRI. IRBM 2018, 39, 334–342. [Google Scholar] [CrossRef]
Singh, M.; Venkatesh, V.; Verma, A.; Sharma, N. Segmentation of MRI data using multi-objective antlion based improved fuzzy c-means. Biocybern. Biomed. Eng. 2020, 40, 1250–1266. [Google Scholar] [CrossRef]
Singh, M.; Verma, A.; Sharma, N. Bat optimization based neuron model of stochastic resonance for the enhancement of MR images. Biocybern. Biomed. Eng. 2017, 37, 124–134. [Google Scholar] [CrossRef]
Singh, M.; Verma, A.; Sharma, N. Optimized Multistable Stochastic Resonance for the Enhancement of Pituitary Microadenoma in MRI. IEEE J. Biomed. Health Inform. 2017, 22, 862–873. [Google Scholar] [CrossRef]
Hussain, M.; Bird, J.J.; Faria, D.R. A Study on CNN Transfer Learning for Image Classification. In Proceedings of the UK Workshop on Computational Intelligence, Nottingham, UK, 5–7 September 2018; Springer: Cham, Switzerland, 2018; Volume 840, pp. 191–202. [Google Scholar] [CrossRef]
Lee, H.; Kwon, H. Going Deeper With Contextual CNN for Hyperspectral Image Classification. IEEE Trans. Image Process. 2017, 26, 4843–4855. [Google Scholar] [CrossRef] [Green Version]
Zhang, M.; Li, W.; Du, Q. Diverse Region-Based CNN for Hyperspectral Image Classification. IEEE Trans. Image Process. 2018, 27, 2623–2634. [Google Scholar] [CrossRef] [PubMed]
Sun, Y.; Xue, B.; Zhang, M.; Yen, G.G.; Lv, J. Automatically Designing CNN Architectures Using the Genetic Algorithm for Image Classification. IEEE Trans. Cybern. 2020, 50, 3840–3854. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Das, S.; Nayak, G.K.; Saba, L.; Kalra, M.; Suri, J.S.; Saxena, S. An artificial intelligence framework and its bias for brain tumor segmentation: A narrative review. Comput. Biol. Med. 2022, 105273. [Google Scholar] [CrossRef] [PubMed]
Qin, J.; Pan, W.; Xiang, X.; Tan, Y.; Hou, G. A biological image classification method based on improved CNN. Ecol. Inform. 2020, 58, 101093. [Google Scholar] [CrossRef]
Wei, Y.; Xia, W.; Lin, M.; Huang, J.; Ni, B.; Dong, J.; Zhao, Y.; Yan, S. HCP: A flexible CNN framework for multi-label image classifi-cation. IEEE Trans. Pattern Anal. Mach. Intell. 2015, 38, 1901–1907. [Google Scholar] [CrossRef] [Green Version]
Li, Q.; Cai, W.; Wang, X.; Zhou, Y.; Feng, D.D.; Chen, M. Medical image classification with convolutional neural network. In Proceedings of the 2014 13th International Conference on Control Automation Robotics & Vision (ICARCV), Singapore, 10–12 December 2014; pp. 844–848. [Google Scholar]
Lei, X.; Pan, H.; Huang, X. A Dilated CNN Model for Image Classification. IEEE Access 2019, 7, 124087–124095. [Google Scholar] [CrossRef]
Ren, X.; Guo, H.; Li, S.; Wang, S.; Li, J. A Novel Image Classification Method with CNN-XGBoost Model. In International Workshop on Digital Watermarking; Springer: Cham, Switzerland, 2017; pp. 378–390. [Google Scholar]
Sultana, F.; Sufian, A.; Dutta, P. Advancements in Image Classification using Convolutional Neural Network. In Proceedings of the 2018 Fourth International Conference on Research in Computational Intelligence and Communication Networks (ICRCICN), Kolkata, India, 22–23 November 2018; pp. 122–129. [Google Scholar]
Yu, S.; Jia, S.; Xu, C. Convolutional neural networks for hyperspectral image classification. Neurocomputing 2017, 219, 88–98. [Google Scholar] [CrossRef]
Zhang, C.; Pan, X.; Li, H.; Gardiner, A.; Sargent, I.; Hare, J.; Atkinson, P.M. A hybrid MLP-CNN classifier for very fine resolution remotely sensed image classification. ISPRS J. Photogramm. Remote Sens. 2018, 140, 133–144. [Google Scholar] [CrossRef] [Green Version]
Pei, Y.; Huang, Y.; Zou, Q.; Zhang, X.; Wang, S. Effects of image degradation and degradation removal to CNN-based image classification. IEEE Trans. Pattern Anal. Mach. Intell. 2019, 43, 1239–1253. [Google Scholar] [CrossRef]
Tandel, G.S.; Balestrieri, A.; Jujaray, T.; Khanna, N.N.; Saba, L.; Suri, J.S. Multiclass magnetic resonance imaging brain tumor classi-fication using artificial intelligence paradigm. Comput. Biol. Med. 2020, 122, 103804. [Google Scholar] [CrossRef]
Tandel, G.S.; Biswas, M.; Kakde, O.G.; Tiwari, A.; Suri, H.S.; Turk, M.; Laird, J.R.; Asare, C.K.; Ankrah, A.A.; Khanna, N.N.; et al. A Review on a Deep Learning Perspective in Brain Cancer Classification. Cancers 2019, 11, 111. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Tripathi, S.; Verma, A.; Sharma, N. Automatic segmentation of brain tumour in MR images using an enhanced deep learning approach. Comput. Methods Biomech. Biomed. Eng. Imaging Vis. 2021, 9, 121–130. [Google Scholar] [CrossRef]
Bhatia, S.; Sinha, Y.; Goel, L. Lung Cancer Detection: A Deep Learning Approach. In Advances in Manufacturing, Production Management and Process Control; Springer: Berlin/Heidelberg, Germany, 2018; pp. 699–705. [Google Scholar]
Gao, F.; Wu, T.; Li, J.; Zheng, B.; Ruan, L.; Shang, D.; Patel, B. SD-CNN: A shallow-deep CNN for improved breast cancer diagnosis. Comput. Med. Imaging Graph. 2018, 70, 53–62. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Khan, S.; Islam, N.; Jan, Z.; Din, I.U.; Rodrigues, J.J.P.C. A novel deep learning based framework for the detection and classification of breast cancer using transfer learning. Pattern Recognit. Lett. 2019, 125, 1–6. [Google Scholar] [CrossRef]
Feng, C.; ElAzab, A.; Yang, P.; Wang, T.; Zhou, F.; Hu, H.; Xiao, X.; Lei, B. Deep Learning Framework for Alzheimer’s Disease Diagnosis via 3D-CNN and FSBi-LSTM. IEEE Access 2019, 7, 63605–63618. [Google Scholar] [CrossRef]
Konstantonis, G.; Singh, K.V.; Sfikakis, P.P.; Jamthikar, A.D.; Kitas, G.D.; Gupta, S.K.; Saba, L.; Verrou, K.; Khanna, N.N.; Ruzsa, Z.; et al. Cardiovascular disease detection using machine learning and carotid/femoral arterial imaging frameworks in rheumatoid arthritis patients. Rheumatol. Int. 2022, 1–25. [Google Scholar] [CrossRef] [PubMed]
Boi, A.; Jamthikar, A.; Saba, L.; Gupta, D.; Sharma, A.; Loi, B.; Laird, J.R.; Khanna, N.N.; Suri, J.S. A Survey on Coronary Atherosclerotic Plaque Tissue Characterization in Intravascular Optical Coherence Tomography. Curr. Atheroscler. Rep. 2018, 20, 33. [Google Scholar] [CrossRef]
Jain, P.K.; Sharma, N.; Saba, L.; Paraskevas, K.I.; Kalra, M.K.; Johri, A.; Laird, J.R.; Nicolaides, A.N.; Suri, J.S. Unseen Artificial Intelligence—Deep Learning Paradigm for Segmentation of Low Atherosclerotic Plaque in Carotid Ultrasound: A Multicenter Cardiovascular Study. Diagnostics 2021, 11, 2257. [Google Scholar] [CrossRef]
Jain, P.K.; Sharma, N.; Giannopoulos, A.A.; Saba, L.; Nicolaides, A.; Suri, J.S. Hybrid deep learning segmentation models for atherosclerotic plaque in internal carotid artery B-mode ultrasound. Comput. Biol. Med. 2021, 136, 104721. [Google Scholar] [CrossRef]
Jain, P.K.; Gupta, S.; Bhavsar, A.; Nigam, A.; Sharma, N. Localization of common carotid artery transverse section in B-mode ultrasound images using faster RCNN: A deep learning approach. Med. Biol. Eng. Comput. 2020, 58, 471–482. [Google Scholar] [CrossRef]
Saba, L.; Jain, P.K.; Suri, H.S.; Ikeda, N.; Araki, T.; Singh, B.K.; Nicolaides, A.; Shafique, S.; Gupta, A.; Laird, J.R.; et al. Plaque Tissue Morphology-Based Stroke Risk Stratification Using Carotid Ultrasound: A Polling-Based PCA Learning Paradigm. J. Med. Syst. 2017, 41, 31. [Google Scholar] [CrossRef] [PubMed]
Araki, T.; Jain, P.K.; Suri, H.S.; Londhe, N.D.; Ikeda, N.; El-Baz, A.; Shrivastava, V.K.; Saba, L.; Nicolaides, A.; Shafique, S.; et al. Stroke Risk Stratification and its Validation using Ultrasonic Echolucent Carotid Wall Plaque Morphology: A Machine Learning Paradigm. Comput. Biol. Med. 2017, 80, 77–96. [Google Scholar] [CrossRef] [PubMed]
Rajpurkar, P.; Irvin, J.; Zhu, K.; Yang, B.; Mehta, H.; Duan, T.; Ding, D.; Bagul, A.; Langlotz, C.; Shpanskaya, K.; et al. CheXNet: Radiologist-Level Pneumonia Detection on Chest X-Rays with Deep Learning. arXiv 2017, arXiv:1711.05225. [Google Scholar]
Jaiswal, A.K.; Tiwari, P.; Kumar, S.; Gupta, D.; Khanna, A.; Rodrigues, J.J. Identifying pneumonia in chest X-rays: A deep learning approach. Measurement 2019, 145, 511–518. [Google Scholar] [CrossRef]
Varshni, D.; Thakral, K.; Agarwal, L.; Nijhawan, R.; Mittal, A. Pneumonia detection using CNN based feature extraction. In Proceedings of the IEEE International Conference on Electrical, Computer and Communication Technologies (ICECCT), Coimbatore, India, 20–22 February 2019; pp. 1–7. [Google Scholar]
GM, H.; Gourisaria, M.K.; Rautaray, S.S.; Pandey, M.A. Pneumonia detection using CNN through chest X-ray. J. Eng. Sci. Technol. 2021, 16, 861–876. [Google Scholar]
Labhane, G.; Pansare, R.; Maheshwari, S.; Tiwari, R.; Shukla, A. Detection of Pediatric Pneumonia from Chest X-Ray Images using CNN and Transfer Learning. In Proceedings of the 2020 3rd International Conference on Emerging Technologies in Computer Engineering: Machine Learning and Internet of Things (ICETCE), Jaipur, India, 7–8 February 2020; pp. 85–92. [Google Scholar]
Zhu, Y.; Chen, Y.; Lu, Z.; Pan, S.J.; Xue, G.R.; Yu, Y.; Yang, Q. Heterogeneous transfer learning for image classification. In Proceedings of the Twenty-Fifth AAAI Conference on Artificial Intelligence, San Francisco, CA, USA, 7–11 August 2011; Volume 25, pp. 1304–1309. [Google Scholar]
Saba, L.; Agarwal, M.; Patrick, A.; Puvvula, A.; Gupta, S.K.; Carriero, A.; Laird, J.R.; Kitas, G.D.; Johri, A.M.; Balestrieri, A.; et al. Six artificial intelligence paradigms for tissue characterisation and classification of non-COVID-19 pneumonia against COVID-19 pneumonia in computed tomography lungs. Int. J. Comput. Assist. Radiol. Surgery 2021, 16, 423–434. [Google Scholar] [CrossRef]
Shaha, M.; Pawar, M. Transfer Learning for Image Classification. In Proceedings of the 2018 Second International Conference on Electronics, Communication and Aerospace Technology (ICECA), Coimbatore, India, 29–31 March 2018; pp. 656–660. [Google Scholar]
Han, D.; Liu, Q.; Fan, W. A new image classification method using CNN transfer learning and web data augmentation. Expert Syst. Appl. 2018, 95, 43–56. [Google Scholar] [CrossRef]
Krishna, S.T.; Kalluri, H.K. Deep learning and transfer learning approaches for image classification. Int. J. Recent Technol. Eng. 2019, 7, 427–432. [Google Scholar]
Kaur, T.; Gandhi, T.K. Deep convolutional neural networks with transfer learning for automated brain image classification. Mach. Vis. Appl. 2020, 31, 20. [Google Scholar] [CrossRef]
Xiang, Q.; Wang, X.; Li, R.; Zhang, G.; Lai, J.; Hu, Q. Fruit Image Classification Based on MobileNetV2 with Transfer Learning Technique. In Proceedings of the 3rd International Conference on Computer Science and Application Engineering—CSAE, Sanya, China, 22–24 October 2019; Volume 121, pp. 1–7. [Google Scholar] [CrossRef]
Zhong, C.; Mu, X.; He, X.; Wang, J.; Zhu, M. SAR Target Image Classification Based on Transfer Learning and Model Compression. IEEE Geosci. Remote Sens. Lett. 2018, 16, 412–416. [Google Scholar] [CrossRef]
Rostami, M.; Kolouri, S.; Eaton, E.; Kim, K. Deep Transfer Learning for Few-Shot SAR Image Classification. Remote Sens. 2019, 11, 1374. [Google Scholar] [CrossRef] [Green Version]
Akilan, T.; Wu, Q.J.; Yang, Y.; Safaei, A. Fusion of transfer learning features and its application in image classification. In Proceedings of the 2017 IEEE 30th Canadian Conference on Electrical and Computer Engineering (CCECE), Windsor, ON, Canada, 30 April–3 March 2017; pp. 1–5. [Google Scholar]
Deng, C.; Xue, Y.; Liu, X.; Li, C.; Tao, D. Active Transfer Learning Network: A Unified Deep Joint Spectral–Spatial Feature Learning Model for Hyperspectral Image Classification. IEEE Trans. Geosci. Remote Sens. 2018, 57, 1741–1754. [Google Scholar] [CrossRef] [Green Version]
Ouhami, M.; Es-Saady, Y.; El Hajji, M.; Hafiane, A.; Canals, R.; El Yassa, M. Deep Transfer Learning Models for Tomato Disease Detection. In Proceedings of the International Conference on Image and Signal Processing, Marrakesh, Morocco, 4–6 June 2020; pp. 65–73. [Google Scholar]
Chowdhury, M.E.H.; Rahman, T.; Khandakar, A.; Mazhar, R.; Kadir, M.A.; Bin Mahbub, Z.; Islam, K.R.; Khan, M.S.; Iqbal, A.; Al Emadi, N.; et al. Can AI Help in Screening Viral and COVID-19 Pneumonia? IEEE Access 2020, 8, 132665–132676. [Google Scholar] [CrossRef]
El-Din Hemdan, E.; Shouman, M.A.; Karar, M.E. COVIDX-Net: A Framework of Deep Learning Classifiers to Diagnose COVID-19 in X-Ray Images. arXiv 2020, arXiv:2003.11055. [Google Scholar]
Hussain, E.; Hasan, M.; Rahman, A.; Lee, I.; Tamanna, T.; Parvez, M.Z. CoroDet: A deep learning based classification for COVID-19 detection using chest X-ray images. Chaos Solitons Fractals 2021, 142, 110495. [Google Scholar] [CrossRef] [PubMed]
Jain, R.; Gupta, M.; Taneja, S.; Hemanth, D.J. Deep learning based detection and analysis of COVID-19 on chest X-ray images. Appl. Intell. 2021, 51, 1690–1700. [Google Scholar] [CrossRef]
Mahdy, L.N.; Ezzat, K.A.; Elmousalami, H.H.; Ella, H.A.; Hassanien, A.E. Automatic X-ray COVID-19 Lung Image Classification System based on Multi-Level Thresholding and Support Vector Machine. MedRxiv 2020, 8, 20047787. [Google Scholar] [CrossRef]
Apostolopoulos, I.D.; Mpesiana, T.A. Covid-19: Automatic detection from X-ray images utilizing transfer learning with convolutional neural networks. Phys. Eng. Sci. Med. 2020, 43, 635–640. [Google Scholar] [CrossRef] [Green Version]
Sethy, P.; Behera, S.K.; Ratha, P.K.; Biswas, P. Detection of Coronavirus Disease (COVID-19) Based on Deep Features and Support Vector Machine. 2020. Available online: www.preprints.org (accessed on 15 January 2022).
Ozturk, T.; Talo, M.; Yildirim, E.A.; Baloglu, U.B.; Yildirim, O.; Acharya, U.R. Automated detection of COVID-19 cases using deep neural networks with X-ray images. Comput. Biol. Med. 2020, 121, 103792. [Google Scholar] [CrossRef]
Khan, A.; Shah, J.L.; Bhat, M.M. CoroNet: A deep neural network for detection and diagnosis of COVID-19 from chest x-ray images. Comput. Methods Programs Biomed. 2020, 196, 105581. [Google Scholar] [CrossRef]
Wang, L.; Wong, A. COVID-Net: A tailored deep convolutional neural network design for detection of COVID-19 cases from chest X-ray images. arXiv 2020, arXiv:2003.09871. [Google Scholar] [CrossRef] [PubMed]
Afshar, P.; Heidarian, S.; Naderkhani, F.; Oikonomou, A.; Plataniotis, K.N.; Mohammadi, A. COVID-CAPS: A capsule network-based framework for identification of COVID-19 cases from X-ray images. Pattern Recognit. Lett. 2020, 138, 638–643. [Google Scholar] [CrossRef] [PubMed]
Yang, D.; Martinez, C.; Visuña, L.; Khandhar, H.; Bhatt, C.; Carretero, J. Detection and analysis of COVID-19 in medical images using deep learning techniques. Sci. Rep. 2021, 11, 1–13. [Google Scholar] [CrossRef] [PubMed]
Nayak, S.R.; Nayak, D.R.; Sinha, U.; Arora, V.; Pachori, R.B. Application of deep learning techniques for detection of COVID-19 cases using chest X-ray images: A comprehensive study. Biomed. Signal Process. Control 2021, 64, 102365. [Google Scholar] [CrossRef] [PubMed]
Bhattacharyya, A.; Bhaik, D.; Kumar, S.; Thakur, P.; Sharma, R.; Pachori, R.B. A deep learning based approach for automatic detection of COVID-19 cases using chest X-ray images. Biomed. Signal Process. Control 2021, 71, 103182. [Google Scholar] [CrossRef]
Deb, S.D.; Jha, R.K.; Jha, K.; Tripathi, P.S. A multi model ensemble based deep convolution neural network structure for detection of COVID19. Biomed. Signal Process. Control 2021, 71, 103126. [Google Scholar] [CrossRef]
Nikolaou, V.; Massaro, S.; Fakhimi, M.; Stergioulas, L.; Garn, W. COVID-19 diagnosis from chest x-rays: Developing a simple, fast, and accurate neural network. Health Inf. Sci. Syst. 2021, 9, 1–11. [Google Scholar] [CrossRef]
Oh, Y.; Park, S.; Ye, J.C. Deep Learning COVID-19 Features on CXR Using Limited Training Data Sets. IEEE Trans. Med. Imaging 2020, 39, 2688–2700. [Google Scholar] [CrossRef]
Al-Timemy, A.H.; Khushaba, R.N.; Mosa, Z.M.; Escudero, J. An Efficient Mixture of Deep and Machine Learning Models for COVID-19 and Tuberculosis Detection Using X-ray Images in Resource Limited Settings. Available online: https://www.worldometers.info/coronavirus/ (accessed on 17 January 2022).
COVID-19 Radiography Database. Available online: https://www.kaggle.com/tawsifurrahman/covid19-radiography-database (accessed on 1 October 2021).
Tuberculosis (TB) Chest X-ray Database. Available online: https://www.kaggle.com/tawsifurrahman/tuberculosis-tb-chest-xray-dataset (accessed on 1 October 2021).
Kaggle’s Chest X-ray Images (Pneumonia) Dataset. 2020. Available online: https://www.kaggle.com/paultimothymooney/chest-xray-pneumonia (accessed on 1 October 2021).
Rahman, T.; Khandakar, A.; Qiblawey, Y.; Tahir, A.; Kiranyaz, S.; Kashem, S.B.A.; Islam, M.T.; Al Maadeed, S.; Zughaier, S.M.; Khan, M.S.; et al. Exploring the effect of image enhancement techniques on COVID-19 detection using chest X-ray images. Comput. Biol. Med. 2021, 132, 104319. [Google Scholar] [CrossRef]
Labeled Optical Coherence Tomography (OCT) and Chest X-ray Images for Classification. Available online: https://data.mendeley.com/datasets/rscbjbr9sj/2 (accessed on 1 August 2021).
Kermany, D.S.; Goldbaum, M.; Cai, W.; Valentim, C.C.S.; Liang, H.; Baxter, S.L.; McKeown, A.; Yang, G.; Wu, X.; Yan, F.; et al. Identifying Medical Diagnoses and Treatable Diseases by Image-Based Deep Learning. Cell 2018, 172, 1122–1131.e9. [Google Scholar] [CrossRef]
Rahman, T.; Khandakar, A.; Kadir, M.A.; Islam, K.R.; Islam, K.F.; Mazhar, R.; Hamid, T.; Islam, M.T.; Kashem, S.; Bin Mahbub, Z.; et al. Reliable Tuberculosis Detection Using Chest X-ray With Deep Learning, Segmentation and Visualization. IEEE Access 2020, 8, 191586–191601. [Google Scholar] [CrossRef]
Skandha, S.S.; Gupta, S.K.; Saba, L.; Koppula, V.K.; Johri, A.M.; Khanna, N.N.; Mavrogeni, S.; Laird, J.R.; Pareek, G.; Miner, M.; et al. 3-D optimized classification and characterization artificial intelligence paradigm for cardiovascular/stroke risk stratification using carotid ultrasound-based delineated plaque: Atheromatic™ 2.0. Comput. Biol. Med. 2020, 125, 103958. [Google Scholar] [CrossRef] [PubMed]
Saba, L.; Sanagala, S.S.; Gupta, S.K.; Koppula, V.K.; Johri, A.M.; Sharma, A.M.; Kolluri, R.; Bhatt, D.L.; Nicolaides, A.; Suri, J.S. Ultra-sound-based internal carotid artery plaque characterization using deep learning paradigm on a supercomputer: A cardiovas-cular disease/stroke risk assessment system. Int. J. Cardiovasc. Imaging 2021, 37, 1511–1528. [Google Scholar] [CrossRef] [PubMed]
Agarwal, M.; Saba, L.; Gupta, S.K.; Johri, A.M.; Khanna, N.N.; Mavrogeni, S.; Laird, J.R.; Pareek, G.; Miner, M.; Sfikakis, P.P.; et al. Wilson disease tissue classification and characterization using seven artificial intelligence models embedded with 3D optimi-zation paradigm on a weak training brain magnetic resonance imaging datasets: A supercomputer application. Med. Biol. Eng. Comput. 2021, 59, 511–533. [Google Scholar] [CrossRef] [PubMed]
Shorten, C.; Khoshgoftaar, T.M. A survey on Image Data Augmentation for Deep Learning. J. Big Data 2019, 6, 60. [Google Scholar] [CrossRef]
Face95. Libor Spacek’s Facial Images Databases. Available online: https://cmp.felk.cvut.cz/~spacelib/faces/faces95.html (accessed on 8 December 2021).
Peng, C.; Liu, Y.; Zhang, X.; Kang, Z.; Chen, Y.; Chen, C.; Cheng, Q. Learning discriminative representation for image classification. Knowl.-Based Syst. 2021, 233, 107517. [Google Scholar] [CrossRef]
Melo-Oliveira, M.E.; Sá-Caputo, D.; Bachur, J.A.; Paineiras-Domingos, L.L.; Sonza, A.; Lacerda, A.C.; Mendonça, V.; Seixas, A.; Taiar, R.; Bernardo-Filho, M. Reported quality of life in countries with cases of COVID-19: A systematic review. Expert Rev. Respir. Med. 2021, 15, 213–220. [Google Scholar] [CrossRef]
Sapkota, N.; Karwowski, W.; Davahli, M.R.; Al-Juaid, A.; Taiar, R.; Murata, A.; Wrobel, G.; Marek, T. The Chaotic Behavior of the Spread of Infection During the COVID-19 Pandemic in the United States and Globally. IEEE Access 2021, 9, 80692–80702. [Google Scholar] [CrossRef]
Davahli, M.R.; Karwowski, W.; Fiok, K.; Murata, A.; Sapkota, N.; Farahani, F.V.; Al-Juaid, A.; Marek, T.; Taiar, R. The COVID-19 Infection Diffusion in the US and Japan: A Graph-Theoretical Approach. Biology 2022, 11, 125. [Google Scholar] [CrossRef]
Zhang, C.; Benz, P.; Argaw, D.M.; Lee, S.; Kim, J.; Rameau, F.; Bazin, J.-C.; Kweon, I.S. ResNet or DenseNet? Introducing Dense Shortcuts to ResNet. In Proceedings of the 2021 IEEE Winter Conference on Applications of Computer Vision (WACV), Waikoloa, HI, USA, 3–8 January 2021; pp. 3549–3558. [Google Scholar]
He, G.; Ping, A.; Wang, X.; Zhu, Y. Alzheimer’s disease diagnosis model based on three-dimensional full convolutional DenseNet. In Proceedings of the 2019 10th International Conference on Information Technology in Medicine and Education (ITME), Qingdao, China, 23–25 August 2019; pp. 13–17. [Google Scholar]
Ruiz, J.; Mahmud, M.; Modasshir; Kaiser, M.S.; Initiative, F. 3D DenseNet Ensemble in 4-Way Classification of Alzheimer’s Disease. In Proceedings of the Brain Informatics: 13th International Conference, BI 2020, Padua, Italy, 19 September 2020; Springer: Cham, Switzerland, 2020; pp. 85–96. [Google Scholar]
Iandola, F.; Moskewicz, M.; Karayev, S.; Girshick, R.; Darrell, T.; Keutzer, K. Densenet: Implementing efficient convnet descriptor pyramids. arXiv 2014, arXiv:1404.1869. [Google Scholar]
Sudeep, P.; Palanisamy, P.; Rajan, J.; Baradaran, H.; Saba, L.; Gupta, A.; Suri, J.S. Speckle reduction in medical ultrasound images using an unbiased non-local means method. Biomed. Signal Process. Control 2016, 28, 1–8. [Google Scholar] [CrossRef]
Sanagala, S.S.; Nicolaides, A.; Gupta, S.K.; Koppula, V.K.; Saba, L.; Agarwal, S.; Johri, A.M.; Kalra, M.S.; Suri, J.S. Ten Fast Transfer Learning Models for Carotid Ultrasound Plaque Tissue Characterization in Augmentation Framework Embedded with Heatmaps for Stroke Risk Stratification. Diagnostics 2021, 11, 2109. [Google Scholar] [CrossRef] [PubMed]
Kusakunniran, W.; Borwarnginn, P.; Sutassananon, K.; Tongdee, T.; Saiviroonporn, P.; Karnjanapreechakorn, S.; Siriapisith, T. COVID-19 detection and heatmap generation in chest x-ray images. J. Med. Imaging 2021, 8, 014001. [Google Scholar] [CrossRef] [PubMed]
Liang, S.; Liu, H.; Gu, Y.; Guo, X.; Li, H.; Li, L.; Wu, Z.; Liu, M.; Tao, L. Fast automated detection of COVID-19 from medical images using convolutional neural networks. Commun. Biol. 2021, 4, 1–13. [Google Scholar] [CrossRef]
Kuppili, V.; Biswas, M.; Sreekumar, A.; Suri, H.S.; Saba, L.; Edla, D.R.; Marinhoe, R.T.; Sanches, J.M.; Suri, J.S. Extreme learning machine framework for risk stratification of fatty liver disease using ultrasound tissue characterization. J. Med. Syst. 2017, 41, 1–20. [Google Scholar] [CrossRef] [PubMed]
Rajaraman, S.; Siegelman, J.; Alderson, P.O.; Folio, L.S.; Folio, L.R.; Antani, S.K. Iteratively Pruned Deep Learning Ensembles for COVID-19 Detection in Chest X-Rays. IEEE Access 2020, 8, 115041–115050. [Google Scholar] [CrossRef] [PubMed]
He, Y.; Zhang, X.; Sun, J. Channel Pruning for Accelerating Very Deep Neural Networks. In Proceedings of the 2017 IEEE International Conference on Computer Vision (ICCV), Venice, Italy, 22–29 October 2017. [Google Scholar]
Adedigba, A.P.; Adeshina, S.A.; Aina, O.E.; Aibinu, A.M. Optimal hyperparameter selection of deep learning models for COVID-19 chest X-ray classification. Intell. Med. 2021, 5, 100034. [Google Scholar] [CrossRef]
Zandehshahvar, M.; van Assen, M.; Maleki, H.; Kiarashi, Y.; De Cecco, C.N.; Adibi, A. Toward understanding COVID-19 pneumonia: A deep-learning-based approach for severity analysis and monitoring the disease. Sci. Rep. 2021, 11, 1–10. [Google Scholar] [CrossRef]
El-Baz, A.; Gimel’farb, G.; Suri, J.S. Stochastic Modeling for Medical Image Analysis; CRC Press: Boca Raton, FL, USA, 2015. [Google Scholar]

Figure 1. Overall schematic diagram of the proposed method for multiclass scenario using seven deep learning models.

Figure 2. Sample chest X-ray images from each class.

Figure 3. (a) VGG16 transfer learning architecture. (b) VGG19 transfer learning architecture. (c) DenseNet201 transfer learning architecture. (d) Xception transfer learning architecture. (e) InceptionV3 transfer learning architecture. (f) NasnetMobile transfer learning architecture. (g) ResNet152 transfer leaning architecture.

Figure 4. Training and validation accuracy of best performing VGG16 network for COVID-19 and normal class.

Figure 5. Training and validation loss of best performing VGG16 network for COVID-19 and normal class.

Figure 6. Confusion matrix for the classification into COVID-19 and normal by VGG16.

Figure 7. Training and validation accuracy of best performing NasnetMobile model for COVID-19 and viral pneumonia class.

Figure 8. Training and validation loss of best performing NasnetMobile model for COVID-19 and viral pneumonia class.

Figure 9. Confusion matrix for the classification into COVID-19 and viral pneumonia by NasnetMobile.

Figure 10. Training and validation accuracy of best performing DenseNet201 model for COVID-19 and bacterial pneumonia class.

Figure 11. Training and validation loss of best performing DenseNet201 model for COVID-19 and bacterial pneumonia class.

Figure 12. Confusion matrix for the classification into COVID-19 and bacterial pneumonia by DenseNet201.

Figure 13. Training and validation accuracy of best performing VGG16 model for COVID-19 and tuberculosis class.

Figure 14. Training and validation loss of best performing VGG16 model for COVID-19 and tuberculosis class.

Figure 15. Confusion matrix for the classification into COVID-19 and tuberculosis by VGG16.

Figure 16. Training and validation accuracy of best performing VGG16 model for three-class experiment.

Figure 17. Training and validation loss of best performing VGG16 model for three-class experiment.

Figure 18. Confusion matrix for three-class classification by VGG16.

Figure 19. Training and validation accuracy of best performing VGG16 model for five-class.

Figure 20. Training and validation loss of best performing VGG16 model for five-class.

Figure 21. Confusion matrix for five-class classification by VGG16.

Figure 22. ROC curves and AUC values for two-class experiments: (a) COVID-19 and normal by VGG16; (b) COVID-19 and viral pneumonia by NasnetMobile; (c) COVID-19 and bacterial pneumonia by Densenet201; (d) COVID-19 and tuberculosis by VGG16.

Figure 23. ROC curves and AUC values for three-class experiment by VGG16.

Figure 24. ROC curves and AUC values for five-class experiment by VGG16.

Figure 25. Sample images from the first eight classes of Faces95 database.

Figure 26. Training and validation accuracy of best performing VGG16 model for Faces95 images.

Figure 27. Training and validation loss of best performing VGG16 model for Faces95 images.

Table 1. Experimental steps and class-wise distribution of chest X-ray images.

Experimental Steps	Normal	COVID-19	Viral Pneumonia	Bacterial Pneumonia	Tuberculosis	Total
Training	8133	2887	1075	2224	560	14,879
Validation	1017	362	135	278	70	1862
Testing	1017	362	135	278	70	1862

Table 2. Weighted average of performance metrics by different deep learning models for COVID-19 and normal classification.

CNN Models	Accuracy	Precision	Recall	F1-Score
VGG16	97.24	97.26	97.24	97.21
VGG19	94.85	94.94	94.85	94.72
Xception	88.69	90.03	88.69	87.58
InceptionV3	93.33	93.32	93.33	93.32
DenseNet201	96.01	96.00	96.01	95.96
NasnetMobile	92.39	92.60	92.39	92.06
ResNet152	78.75	82.85	78.75	73.02

Table 3. Weighted average of performance metrics by different deep learning models for COVID-19 and viral pneumonia classification.

CNN Models	Accuracy	Precision	Recall	F1-Score
VGG16	99.60	99.60	99.60	99.60
VGG19	99.20	99.20	99.20	99.19
Xception	99.40	99.40	99.40	99.40
InceptionV3	98.99	99.01	98.99	99.00
Densenet201	99.40	99.40	99.40	99.40
NasnetMobile	99.80	99.80	99.80	99.80
Resnet152	97.79	97.80	97.79	97.77

Table 4. Weighted average of performance metrics by different deep learning models for COVID-19 and bacterial pneumonia classification.

CNN Models	Accuracy	Precision	Recall	F1-Score
VGG16	99.22	99.22	99.22	99.22
VGG19	98.75	98.76	98.75	98.75
Xception	99.06	99.08	99.06	99.06
InceptionV3	99.53	99.53	99.53	99.53
Densenet201	99.84	99.84	99.84	99.84
NasnetMobile	99.53	99.53	99.53	99.53
Resnet152	98.59	98.60	98.59	98.59

Table 5. Weighted average of performance metrics by different deep learning models for COVID-19 and tuberculosis classification.

CNN Models	Accuracy	Precision	Recall	F1-Score
VGG16	99.31	99.31	99.31	99.30
VGG19	99.07	99.07	99.07	99.07
Xception	99.07	99.07	99.07	99.07
InceptionV3	98.38	98.47	98.38	98.40
Densenet201	98.84	98.88	98.84	98.85
NasnetMobile	93.75	95.15	93.75	94.09
Resnet152	91.20	92.25	91.20	91.56

Table 6. Weighted average of performance metrics by different deep learning models for three-class classification into COVID-19, viral pneumonia, and normal.

CNN Models	Accuracy	Precision	Recall	F1-Score
VGG16	96.63	96.63	96.63	96.63
VGG19	91.94	92.49	91.94	91.63
Xception	91.68	91.64	91.68	91.54
InceptionV3	92.54	92.47	92.54	92.43
Densenet201	95.51	95.61	95.51	95.44
NasnetMobile	92.93	93.32	92.93	92.96
Resnet152	77.21	84.70	77.21	78.57

Table 7. Weighted average of performance metrics by different deep learning models for five-class classification into COVID-19, viral pneumonia, bacterial pneumonia, tuberculosis, and normal.

CNN Models	Accuracy	Precision	Recall	F1-Score
VGG16	92.70	92.41	92.70	92.47
VGG19	89.04	90.37	89.04	87.00
Xception	83.35	84.83	83.35	80.61
InceptionV3	84.00	85.54	84.00	83.44
Densenet201	89.10	89.80	89.10	88.42
NasnetMobile	87.76	88.05	87.76	86.65
Resnet152	74.70	76.80	74.70	71.60

Table 8. Weighted average of performance metrics by different deep learning networks for facial images classification.

CNN Models	Accuracy	Precision	Recall	F1-Score
VGG16	98.61	99.07	98.61	98.52
VGG19	96.53	97.45	96.53	96.34
Xception	93.06	93.75	93.06	92.18
InceptionV3	95.83	97.22	95.83	95.56
DenseNet201	96.53	97.69	96.53	96.30
NasnetMobile	93.06	95.60	93.06	92.82
ResNet152	75.69	76.50	75.69	80.13

Table 9. Benchmarking table showing state-of-the-art methods and comparing them against the proposed model.

Author and Year	Method and Models	Number of Images Used	Classification Accuracy				AUC ¹
Author and Year	Method and Models	Number of Images Used	Two-Class	Three-Class ²	Four-Class ³	Five-Class ⁴	AUC ¹
Nayak et al. (2020) [100]	Method: CNN with transfer learning Model: ResNet-34	C ⁵: 203 Total: 406	C ⁵ & N ⁶: 98.33%	NA ⁷	NA	NA	C & N: 0.98
Choudhury et al. (2020) [88]	Method: CNN with transfer learning Model: CheXNet	C: 423 Total: 3487	NA	97.74%	NA	NA	NA
Jain et al. (2020) [91]	Method: CNN with transfer learning Model: Xception	C: 490 Total: 6432	NA	97.97%	NA	NA	NA
Bhattacharyya et al. (2021) [101]	Method: ML ⁸ + DL ⁹ DL model: VGG-19 ML model: Random Forest	C: 342 Total: 1029	NA	96.6%	NA	NA	NA
Nikolaou et al. (2021) [103]	Method: CNN with transfer learning Model: EfficientNetB0	C: 3616 Total: 15,153	C & N: 95%	93%	NA	NA	NA
Yang et al. (2021) [99]	Method: CNN with transfer learning Model: VGG16	C: 3616 Total: 8461	C & N: 98% C & VP ¹⁰: 99%	97%	NA	NA	NA
Khan et al. (2020) [96]	Method: deep learning Model: Coronet (novel CNN)	C: 284 Total: 1251	NA	95%	89.6%	NA	NA
Hussain et al. (2020) [90]	Method: deep learning Model: CoroDet (novel CNN)	C: 500 Total: 2100	C & N: 99.1%	94.2%	91.2%	NA	NA
Oh et al. (2020) [104]	Method: CNN with transfer learning Model: ResNet-18	C: 180 Total: 502	NA	NA	88.9%	NA	NA
Timemy et al. (2021) [105]	Method: ML + DL DL model: ResNet-50 ML model: ESD ¹¹	C: 435 Total: 2186	NA	NA	NA	91.6%	NA
Proposed work (Nillmani et al.)	Method: CNN with transfer learning Model: VGG16, NasnetMobile, DenseNet201	C: 3611 Total: 18,603	C & N: 97.24% ¹² C & VP: 99.80% ¹³ C & BP ¹⁴: 99.84% ¹⁵ C & T ¹⁶: 99.31% ¹²	96.63% ¹²	NA	92.70% ¹²	C & N: 0.95 ¹² C & VP: 1.0 ¹³ C & BP: 1.0 ¹⁵ C & T: 0.98 ¹² Three-class ²: 0.97 ¹² Five-class ⁴: 0.92 ¹²

¹ Area under the ROC Curve; ² COVID-19, viral pneumonia and normal; ³ COVID-19, viral pneumonia, bacterial pneumonia, and normal; ⁴ COVID-19, viral pneumonia, bacterial pneumonia, tuberculosis, and normal; ⁵ COVID-19; ⁶ Normal; ⁷ Not applicable as authors have not performed such type of experiment; ⁸ Machine learning; ⁹ Deep learning; ¹⁰ Viral pneumonia; ¹¹ Ensemble subspace discriminant; ¹² Acheived by VGG16; ¹³ Acheived by NasnetMobile; ¹⁴ bacterial pneumonia; ¹⁵ Acheived by DenseNet201; ¹⁶ tuberculosis.

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Nillmani; Jain, P.K.; Sharma, N.; Kalra, M.K.; Viskovic, K.; Saba, L.; Suri, J.S. Four Types of Multiclass Frameworks for Pneumonia Classification and Its Validation in X-ray Scans Using Seven Types of Deep Learning Artificial Intelligence Models. Diagnostics 2022, 12, 652. https://doi.org/10.3390/diagnostics12030652

AMA Style

Nillmani, Jain PK, Sharma N, Kalra MK, Viskovic K, Saba L, Suri JS. Four Types of Multiclass Frameworks for Pneumonia Classification and Its Validation in X-ray Scans Using Seven Types of Deep Learning Artificial Intelligence Models. Diagnostics. 2022; 12(3):652. https://doi.org/10.3390/diagnostics12030652

Chicago/Turabian Style

Nillmani, Pankaj K. Jain, Neeraj Sharma, Mannudeep K. Kalra, Klaudija Viskovic, Luca Saba, and Jasjit S. Suri. 2022. "Four Types of Multiclass Frameworks for Pneumonia Classification and Its Validation in X-ray Scans Using Seven Types of Deep Learning Artificial Intelligence Models" Diagnostics 12, no. 3: 652. https://doi.org/10.3390/diagnostics12030652

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Four Types of Multiclass Frameworks for Pneumonia Classification and Its Validation in X-ray Scans Using Seven Types of Deep Learning Artificial Intelligence Models

Abstract

1. Introduction

2. Related Work

3. Methodology

3.1. Dataset

3.1.1. COVID-19 Radiography Database

3.1.2. Chest X-ray Pneumonia Images

3.1.3. Tuberculosis Chest X-ray Database

3.2. Image Processing

3.3. Experimental Setup

3.4. Model Architectures

3.5. Cross-Entropy Loss Function for Models

3.6. Performance Metrics Used for Classification Evaluation

4. Results

4.1. Binary Classification

4.1.1. Binary Class Case 1: COVID-19 vs. Normal

4.1.2. Binary Class Case 2: COVID-19 vs. Viral Pneumonia

4.1.3. Binary Class Case 3: COVID-19 vs. Bacterial Pneumonia

4.1.4. Binary Class Case 4: COVID-19 and Tuberculosis

4.2. Three-Class Classification into Viral Diseases

4.3. Five-Class Classification into Viral and Bacterial Diseases

5. Performance Evaluation

6. Scientific Validation

7. Discussion

7.1. Principal Findings

7.2. Benchmarking

7.3. A Special Note on Multiclass Frameworks for Pneumonia Classification

7.4. Strengths, Weaknesses, and Extensions

8. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Abbreviation

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI