Startseite Histopathological Image Segmentation Using Modified Kernel-Based Fuzzy C-Means and Edge Bridge and Fill Technique
Artikel Open Access

Histopathological Image Segmentation Using Modified Kernel-Based Fuzzy C-Means and Edge Bridge and Fill Technique

  • Faiz Mohammad Karobari EMAIL logo und Hosahally Narayangowda Suresh
Veröffentlicht/Copyright: 20. März 2019
Veröffentlichen auch Sie bei De Gruyter Brill

Abstract

Histopathological lung cancer segmentation using region of interest is one of the emerging research area in the field of health monitoring system. In this paper, the histopathological images were collected from the database Stanford Tissue Microarray Database (TMAD). After image collection, pre-processing was performed using a normalization technique, which enhances the quality of the histopathological image by eliminating unwanted noise. After pre-processing, segmentation was carried out using the modified kernel-based fuzzy c-means clustering (KFCM) approach along with the edge bridge and fill technique (EBFT). It was a flexible high-level machine learning technique to localize the object in a complex template. The experimental result shows that the proposed approach segments the normal and abnormal cancer regions by means of precision, recall, specificity, accuracy, and Jaccard coefficient. The proposed methodology improved the classification accuracy in lung cancer segmentation up to 2.5–5% compared to the existing methods deep convolutional neural network (DCNN) and diffusion-weighted approach.

Mathematics Subject Classification: 68U10; 92C50

1 Introduction

Lung cancer is one of the predominant cancer types, which causes more than 1.4 million deaths annually [19]. Generally, there are two types of lung cancers: the non-small cell lung cancer (NSCLC) with 80%–85% of lung cancer patients and small cell lung cancer (SCLC) with 10%–15% of lung cancer patients [17]. The evaluation of microscopic histopathology slides by experienced pathologists is indispensable for establishing the diagnosis and to define the subtypes and types of lung cancers, which includes two major types of NSCLC: adenocarcinoma and squamous cell carcinoma [5], [10], [23], [26]. The distinction of squamous cell carcinoma from adenocarcinoma is essential for chemotherapeutic selection because certain antineoplastic agents are contraindicated for squamous cell carcinoma patients, due to decreased efficacy or increased toxicity [11], [12]. However, the qualitative evaluation of well-known histopathology pattern (cancer grade classification) is insufficient to predict the survival rate of patients with lung adenocarcinoma, and only the best-characterized histopathology features achieve the modest agreements among experienced pathologists [20], [24]. Poor cancer differentiation and slide quality are related with diagnostic agreement [6]. Recently, several researches were carried out to describe the additional visual features for prognostic detection of patients with lung adenocarcinoma [3], [15]. Still, there is considerable improvement needed for the inter-rater agreements of these features [8]. Erroneous or subjective evaluation of the histopathology image leads to poor therapeutic choice that results in decreased survival and loss of life quality in several patients [16]. Unsupervised image processing technology shows high accuracy, consistency, and efficiency in histopathology evaluations and also provides decision support for ensuring the diagnostic consistency. Automated histopathological analysis is valuable in prognostic determinations of various malignancies [9], [22].

In this research, histopathology lung images were collected from the dataset TMAD. After the collection of histopathology images, a new segmentation methodology was implemented along with the edge bridge and fill technique (EBFT) for histopathological lung cancer segmentation. Currently, most of the segmentation algorithms are working based on Euclidean distance, which helps to identify the similarity between the data objects. Euclidean distance is computationally inexpensive and simple, but it is sensitive to outliers and perturbations. Currently, with the popular usage of support vector machine, a new direction appears to use kernel functions. The kernel function projects the data into high-dimensional space, where data can be easily separated. To perform this operation, kernel trick is adopted for transforming the linear algorithm into non-linear algorithm using a dot operator. The kernel-based FCM works well only if the clusters are round shaped, and it works badly with non-convex shapes, due to the hard assignments of points to the clusters. To address this concern, a new clustering methodology was proposed (modified kernel-based fuzzy c-means clustering (KFCM)) for histopathological lung cancer segmentation. In this experimental research, the Euclidean distance measure was replaced with correlation distance measure because it includes numerous advantages. The distribution of correlation between the random vectors becomes narrowly focused around zero as the dimensionality increases, so the significance of small correlation grows with increasing dimensionality. Also, it is very effective in capturing the similarity between the patterns of feature vectors. Finally, the segmented output was compared with the existing methodologies by means of precision, recall, specificity, accuracy, and Jaccard coefficient.

This research paper is organized as follows. Section 2 surveys several recent papers on histopathological lung cancer detection. In Section 3, an effective approach (modified KFCM with EBFT) for histopathological lung cancer segmentation is presented. Section 4 shows the quantitative and comparative analysis of proposed and existing segmentation approaches. The conclusion is made in Section 5.

2 Literature Review

Researchers suggested several techniques for the detection of histopathological lung cancer. A brief evaluation of some important contributions to the existing literatures is presented in this section.

Xing and Yang [21] presented a new segmentation methodology, which combines an effective local repulsive balloon snake deformable approach (bottom–up) and a robust selection-based sparse shape approach (top–down) to tackle the segmentation concerns. The developed approach extensively tested on 62 cases with 6000 tumor cells. The experimental outcome confirms that the developed methodology delivers better segmentation performance compared to the existing methods. While performing with semi-supervised methodologies, the semantic gap was maximized between the feature values, which leads to poor segmentation rate.

Coudray et al. [4] developed an effective classification methodology: deep learning convolutional neural network for classifying the whole-slide histopathology images into squamous cell carcinoma and adenocarcinoma. The advantage of developed methodology was it was fully automated; there was no user intervention for segmenting the normal and cancer tissues, so the time consumption of detecting the cancer cell was very less. Extensive experiments were carried out on real-time histopathological images for demonstrating the robustness of the proposed scheme. The developed technique was less suitable for recognizing the inclined histopathology images because it took a high response time and identification rate.

Zhang et al. [25] developed an effective segmentation methodology: Gaussian-based hierarchical voting and repulsive balloon approach for classifying the lung cancers as adenocarcinoma and squamous carcinoma. The developed methodology delivers an effective histopathological lung cancer segmentation in the spatial localization and structural composition. The experiment was carried out on publicly available database, Cancer Genome Atlas (TCGA), in order to validate the developed method accuracy, robustness, and segmentation speed. In some cases, the training data were supervised evaluation or manual adjustment, which is needed to be automated.

Khosravi, et al. [13] presented an effective computational approach based on the convolution neural network for classifying dissimilar histopathology images across dissimilar types of cancer. In this literature, the developed machine learning approach splits the cancer cells from normal tissues and also handles the cell appearance variations. The developed algorithm was tested on different histopathology images with different variations in appearance and shapes. The experimental outcome shows that the developed histopathology cancer cell detection methodology was very effective compared to the existing methodologies. The developed methodology would fail, when the growth of the cancer cell boundaries was constrained.

Vu et al. [18] developed a new feature discovery methodology: discriminative feature-oriented dictionary learning (DFDL) approach for disease grading and classification in histopathology. In this literature, the developed dictionary learning approach was tested on three challenging image datasets: animal diagnostics lab dataset, cancer genome atlas dataset, and intraductal breast lesions histopathological images. The experimental performance suggested that the developed approach was robust and effective in comparison with existing approaches. In a large dataset, DFDL failed to achieve better segmentation by means of accuracy.

An unsupervised algorithm (modified KFCM with EBFT) is implemented for enhancing the performance of histopathological lung cancer segmentation and to overcome the above-mentioned drawbacks.

3 Proposed Methodology

The proposed methodology for segmenting the normal and abnormal regions from lung histopathological image is divided into three major steps: image collection, image pre-processing, and feature extraction. The workflow of the proposed histopathological lung cancer detection system is represented in Figure 1. The brief description about the proposed technique is described below.

Figure 1: Workflow of the proposed system.
Figure 1:

Workflow of the proposed system.

3.1 Image Collection

In the initial stage of histopathological lung cancer segmentation, histopathological images are taken from the standard benchmark dataset: TMAD. This dataset contains 205,161 images, which archives 349 distinct probes on 1488 tissue micro-array slides. In that, 31,306 histopathological images for 68 probes on 125 slides are released to the public. The TMAD combines the NCI thesaurus ontology for probing tissues in the cancer domain. For lung cancer, the TMAD database contains one normal image, 220 adenocarcinoma images, and 68 squamous histopathological images. The sample collected histopathological images of normal, adenocarcinoma, and squamous are represented in Figure 2A–C.

Figure 2: Sample images collected from the TMAD database.
(A) Normal image, (B) adenocarcinoma image, (C) squamous image.
Figure 2:

Sample images collected from the TMAD database.

(A) Normal image, (B) adenocarcinoma image, (C) squamous image.

After obtaining the histopathological images, an important step: region of interest (ROI) is carried out on the collected images. The ROI is defined as a subset of histopathological image or a database identified for a specific purpose. Therefore, the size of the selected ROI is 256 × 256, which is one-fourth of the original image 1024 × 1024. The sample ROI applied histopathological images of normal, adenocarcinoma, and squamous are denoted in the Figure 3A–C.

Figure 3: ROI applied images.
(A) ROI-applied normal image, (B) ROI-applied adenocarcinoma image, (C) ROI-applied squamous image.
Figure 3:

ROI applied images.

(A) ROI-applied normal image, (B) ROI-applied adenocarcinoma image, (C) ROI-applied squamous image.

3.2 Pre-processing of Histopathological Image

The ROI-applied histopathological images are used for pre-processing. In this research, normalization is performed on the histopathological image for enhancing or de-noising the images. Most of the histopathological images are collected or captured from the equipment’s, whereas the collected histopathological images majorly consist of two noises: impulse noise and machinery noise (electrical and mechanical noises) [1]. The normalization methodology is very effective to remove impulse and machinery noise and also helps to enhance the image quality significantly.

Normalization modifies the pixel intensity range values to improve the quality of images by reducing the noise from histopathology images. Then, the deformation and alternations that occurred in the histopathological image by inaccurate image capture were evaluated. Usually, image normalization contains pixel intensity variations. The entire histopathological images are converted into pre-defined values because it is a pixel-wise procedure. The general formula of image normalization approach is denoted in equation (1).

(1) In=(IMin)+newMaxnewMinMaxMin+newMin

where I is represented as the original histopathology image, In is stated as the new image, (Min=0,Max=255) is specified as the intensity range of the original image, and (newMin, newMax) is stated as the intensity range of the new histopathology image. The sample normalized histopathology images of normal, adenocarcinoma, and squamous are represented in Figure 4A–C.

Figure 4: Preprocessed image.
(A) Normalized normal image, (B) normalized adenocarcinoma image, (C) normalized squamous image.
Figure 4:

Preprocessed image.

(A) Normalized normal image, (B) normalized adenocarcinoma image, (C) normalized squamous image.

3.3 Histopathological Image Segmentation

The normalized histopathological images are used for segmentation; an effective methodology modified KFCM is undertaken for segmenting the normal and abnormal regions of histopathological images. Generally, image segmentation is the procedure of sub-dividing an image into different regions that are homogeneous with respect to some image features. The aim of image segmentation is to extract and detect the particular region from an image [14]. In this scenario, consider I as an input histopathological image that consists of a set of pi color images at pixel i(i=1,2,,N) and P={p1,p2,p3p}Rk, respectively, in the k-dimensional area. The cluster centers in the histopathological images are denoted as Q={q1,q2,q3qc}, where c is said to be a positive integer (2<cN), and uij is the membership value for each pixel i in the j-th cluster (j=1,2,c). The clusters formed in the image space are combined by assigning a separate membership value to all pixels in the KFCM algorithm. The objective function or general equation of KFCM is written in equation (2).

(2) JKFCM=i=1Nj=1cuijmpiqj2,1m<

where m is an exponent of regularization to the degree of fuzziness, m > 1, and piqj2 is the grayscale Euclidean distance between i and qj, which is stated in equation (3).

(3) j=1cuij=1,uij[0,1],0i=1NuijN

Utilizing the membership function from the alternate optimization, the cluster centers are updated iteratively using equations (4) and (5).

(4) uij=1k=1c(piqj2/piqk2)1/(m1)
(5) qj=i=1Nuijmpii=1Nuijm

The presence of noise is decreased by adding the spatial information of neighboring pixels that is denoted in equation (6).

(6) JKFCMS=i=1Nj=1cuijmpiqj2+αNRi=1Nj=1cuijm(rNiprqj2)

where α denotes a spatial information, Ni and Nr are defined as the set of pixel and cardinality of the pixels employed in the system. To avoid the neighborhood function, the term 1NRrNipiqj2 is replaced with p´iqj2, where, is a color scale-filtered image, and the Euclidean distance is replaced by the correlation distance measure.

The updated equation is represented in equation (7):

(7) JKFCMS(1,2)=i=1Nj=1cuijmpiqj2+αi=1Nj=1cuijm(pi´qj2)

In this research, a modified KFCM is proposed, which calculates the parameter ηj at every step of the iterations to replace α for every cluster [7]. The correlation function is used to calculate the parameter value, which is represented in equation (8).

(8) ηj=minjj(1C(qj,qj))maxk(1C(qk,p´))

Here, C is a correlation distance measure or correlation function. The general identification of C requires a large number of patterns, and also, many cluster centers are required to find the optimal value for ηj. To overcome this problem, the combination of spatial context and scale information are made using a fuzzy factor. The fuzzy factor Fij is included in the objective function of the KFCM, which is stated in equation (9).

(9) JCKFCM=i=1Nj=1c[u|ijm|piqj2+Fij]

Then, the altered fuzzy factor Fij is derived using equation (10).

(10) Fij=CNi,ikwik(1uij)m

This altered fuzzy controls the local neighbor relationship and replaces the distance with a correlation function, where wik denotes the fuzzy factor i, and 1C(pi,qj) denotes correlation metric function. The segmented histopathological images of normal, adenocarcinoma, and squamous are given in Figure 5A–C.

Figure 5: Segmented image.
(A) Segmented normal image, (B) segmented adenocarcinoma image, (C) segmented squamous image.
Figure 5:

Segmented image.

(A) Segmented normal image, (B) segmented adenocarcinoma image, (C) segmented squamous image.

3.3.1 Pseudo Code of Modified KFCM

In this section, the pseudo code for the modified KFCM is presented. In the modified KFCM, a few changes are carried out in step 5, compared to the conventional KFCM. Finally, the iteration stops and returns after calculating the missing value using equation (10).

  1. Fix the number of clusters c, and m > 1 for some positive constant.

  2. Set pi=0, if pi is a missing feature.

  3. Update all membership function uij using equation (4).

  4. Update all prototypes qj using equation (5).

  5. Replace the Euclidean distance with the correlation function C.

  6. Calculate the missing value using equation (10).

  7. End

3.4 Edge Bridge and Fill Technique

After histopathology cell segmentation, the EBFT is utilized to bridge the gaps in the edges of the histopathology lung image for separating the overlapped nuclei and non-nuclei cells. The step-by-step procedure of the EBFT is described below.

  1. Initially, the extreme boundary line of the overlapped cell is kept, and all the inner portions were removed. The overlapped cell region is connected to the background black region because it is necessary to distinguish the overlapped cell region from the background. The overlapped cell edge needs to be closed, so that the cell region can discriminate from the background.

  2. To close the open-edge portion of the cell region, an iterative thinning (skeletenization) and thickening (dilation) morphology is utilized in this research work. Dilation is carried out with a square structural element or disk of size four. The dilation of histopathology image A by the structuring element B is described using equation (11).

    (11) AxorB=bBAb
  3. In this research work, the size of the structural element is 12. All overlapped cells in the histopathology image segmented are shown in Figure 6. To make the segmentation of the overlapped cell more reliable and robust for any size of cell, it is better to use the small-size structural element rather than use the large structural element. In this experimental research, the disk structural element with radius 4 is utilized followed by skeletenization iteratively three times. This procedure depends on the number of iterations that happens after the closing of the overlapped cell edge; hence, the maximum limit of iterations does not affect the reliability of the overlapped cell segmentation process.

  4. The dilated thick line from equation (11) is skeletonized by eliminating the pixels from each side with respect to the central pixels of the line. This procedure makes the boundary lines more sharp with less gap. Then, the sharp boundary line is added to the overlapped cell without any gap.

  5. The holes in the overlapped cells are filled as shown in Figure 6. In the filled cell image, the overlapped nuclei and non-nuclei cells are effectively distinguished.

Figure 6: Sample image of the EBFT.
Figure 6:

Sample image of the EBFT.

The labeling is utilized to better distinguish the overlapped nuclei cells and non-nuclei cells. The sample-labeled image is denoted in Figure 7.

Figure 7: Sample-labeled image.
Figure 7:

Sample-labeled image.

4 Experimental Analysis

For experimental simulation, MATLAB (version 2017a) was employed on a PC with 3.2 GHz with i5 processor [2]. In order to estimate the efficiency of the proposed algorithm, the performance of the proposed method was compared with the diffusion-weighted approach [22] and DCNN [13] on a database: TMAD. The performance of the proposed methodology was compared by means of precision, recall, specificity, accuracy, and Jaccard coefficient.

4.1 Performance Measure

Performance measure is defined as the regular measurement of the outcomes and results that develops a reliable information about the effectiveness and efficiency of the proposed system. Also, it is the procedure of reporting, collecting, and analyzing information about the performance of a group or individual. The general formula for calculating the precision, recall, specificity, and accuracy of lung cancer detection is mathematically represented in equations (12)–(15).

(12) Precision=TP(FP+TP)×100
(13) Recall=TP(FN+TP)×100
(14) Specificity=TNFP+TN×100
(15) Accuracy=TP+TNTN+TP+FN+FP×100

Additionally, for segmentation validation, the Jaccard coefficient is expressed in terms of TP, TN, FP, and FN counts, which is obtained by matching the segmented result to the ground truth image. The general formula utilized to calculate the Jaccard coefficient is represented in equation (16).

(16) Jaccard coefficient=TP(FP+TP+FN)×100

where FP is represented as false positive, TP is denoted as true positive, FN is indicated as false negative, and TN is specified as true negative.

4.2 Quantitative Analysis on TMAD Dataset

In this experimental analysis, the TMAD dataset is used for comparing the performance evaluation of the proposed approach and the existing segmentation methods. In Table 1, performance evaluation of the proposed method (modified KFCM) and the segmentation methods (FCM and KFCM) are validated by means of precision, recall, and Jaccard coefficient. The TMAD dataset contains three classes of histopathology lung images: normal, adenocarcinoma, and squamous. Here, the performance evaluation is validated for one sample image in each class with two random multi-region ROIs. The validation result shows that the proposed method outperformed the existing methodologies by means of precision, recall, and Jaccard coefficient.

Table 1:

Precision, recall, and Jaccard coefficient comparison of the proposed and existing methods.

Images Clustering TP TN FP FN Precision (%) Recall (%) Jaccard
technique coefficient (%)
FCM 3480 59,178 2520 130 58 96.3989 56.77
KFCM 3611 60,181 2380 100 60.2737 97.3053 59.2842
Modified KFCM 4801 72,681 1947 35 71.14 99.28 70.70
FCM 3821 65,646 3326 186 53.463 95.3581 52.1069
KFCM 4018 67,706 3256 122 55.2378 97.0531 54.3267
Modified KFCM 4724 77,734 2853 63 62.3466 98.6839 61.83
FCM 3436 50,637 2419 92 58.6849 97.3923 57.777
KFCM 3546 51,689 2293 85 60.7296 97.659 59.8582
Modified KFCM 4556 59,711 1816 13 71.5003 99.7155 71.35

For the collected histopathological image, the mean or average precision of the proposed technique is: for the modified KFCM, 68.32%, and the existing methodologies FCM and KFCM delivers 56.71% and 58.4% of the average precision. The average recall of the proposed technique is 99.22%, and the existing methodologies delivered 96.38% and 97.33% of the average recall. Similarly, the average Jaccard coefficient of the proposed technique delivers 67.96%, and the existing methodologies attained 55.55% and 57.82% of the average Jaccard coefficient. The graphical comparison of the average precision, recall, and Jaccard coefficient is denoted in Figures 8 and 9. Similarly, the standard deviation of the proposed technique precision is 5.2561, and the existing techniques (FCM and KFCM) delivered 2.837 and 3.0480. The standard deviation of the proposed technique recall is 0.5178, and the existing techniques achieved 1.0200, and 0.1892. Finally, the standard deviation of the proposed technique Jaccard coefficient is 5.3154 and the existing techniques (FCM and KFCM) delivered 3.0264 and 3.0415.

Figure 8: Graphical comparison of precision and recall.
Figure 8:

Graphical comparison of precision and recall.

Figure 9: Graphical comparison of the Jaccard coefficient.
Figure 9:

Graphical comparison of the Jaccard coefficient.

In Table 2, performance evaluation of the existing and proposed method is validated in terms of accuracy and specificity. In this section, performance evaluation is validated for one sample image in each class of TMAD dataset. For the collected image, the average accuracy of the proposed technique is: for the modified KFCM, 96.92%, and the existing methodologies FCM and KFCM delivered 95.53% and 95.88% of the average accuracy. In addition, the average specificity of the proposed technique is 99.92%, and the existing methodologies delivered 95.47% and 95.80% of the average specificity. Correspondingly, the standard deviation of the proposed technique’s accuracy is 0.1517, and the existing techniques (FCM and KFCM) delivered 0.3914 and 0.5602. Similarly, the standard deviation of the proposed technique specificity is 0.0598, and the existing techniques (FCM and KFCM) achieved 0.3286 and 0.5346.

Table 2:

Accuracy and specificity comparison of the existing and proposed method.

Images Clustering technique TN TP FP FN Specificity (%) Accuracy
FCM 49,923 3045 2401 166 95.4113 95.3777
KFCM 51,957 3128 2209 146 95.9218 95.9001
Modified KFCM 59,000 4149 2008 81 99.8629 96.79
FCM 50,946 3132 2584 109 95.1728 95.2564
KFCM 51,440 3179 2580 104 95.224 95.3161
Modified KFCM 64,948 3713 2181 12 99.9815 96.90
FCM 60,831 4223 2648 74 95.8285 95.9838
KFCM 64,194 4401 2482 56 96.2775 96.432
Modified KFCM 74,338 4678 2315 47 99.9368 97.09

Tables 1 and 2 confirmed that the proposed approach performs effectively compared to the existing segmentation method on the TMAD dataset. The modified KFCM encodes both the local and shape features of the wavelet transformed histopathology image for improving the segmentation efficiency of lung cancer. The proposed modified KFCM algorithm fuses different pixel information, which represents a different correlation space to produce a new correlation value. The graphical comparison of the average accuracy and specificity is denoted in Figure 10.

Figure 10: Graphical comparison of accuracy and specificity.
Figure 10:

Graphical comparison of accuracy and specificity.

4.3 Comparative Analysis

Tables 3 and 4 represent the comparative study of the existing work and the proposed work performance. Yin et al. [22] developed an image analysis chain that was utilized to study the correlation between tumor cellularity from serial histopathological slides of a resected NSCLC tumor and diffusion coefficient (D value) calculated from the diffusion-weighted magnetic resonance imaging (DWI). On digitized histological image, color deconvolution along with cell nuclei segmentation was used in determining the cell type and local two-dimensional densities. Then, the DWT sequence information was over-laid with resected histology using non-invasive imaging modality data and prominent anatomical hallmarks of histology tissue blocks. Finally, spatial tumor cell density and cell number information were determined on the basis of the DWT data. For sample 30 histopathological images, the average accuracy of the proposed technique is: for the modified KFCM, 97.28 ± 0.014% and for the existing methodology: diffusion weighted [22] delivers 94.7 ±0.016% of the segmentation accuracy. Similarly, the specificity of the proposed technique delivers 99.95 ± 0.0075%, and the existing methodology delivers 94.3 ± 0.029% of the specificity. Compared to the existing method, the proposed methodology shows 3–5% of improvement in specificity and accuracy. The graphical comparison of accuracy and specificity is denoted in Figure 11.

Table 3:

Performance comparison of the proposed and existing approach by means of accuracy and specificity.

Method Cancer stages Accuracy (%) Specificity (%)
Diffusion weighted [22] Combined stages 94.7 ± 0.016 94.3 ± 0.029
Modified KFCM Normal 97.53 ± 0.007 99.95 ± 0.0121
Adenocarcinoma 97.02 ± 0.023 99.94 ± 0.074
Squamous 97.31 ± 0.012 99.97 ± 0.0225
Combined stages 97.28 ± 0.014 99.95 ± 0.0075
Table 4:

Performance comparison of the proposed and existing approach by means of precision, recall, and Jaccard coefficient.

Methodology Cancer stages Precision (%) Recal1 (%) Jaccard coefficient (%)
DCNN [13] Combined stages 92 92 92
Modified KFCM Normal 99.21 71.81 91.40
Adenocarcinoma 98.90 64.58 94.12
Squamous 99.54 70.04 95.81
Combined stages 99.21 68.81 93.77
Figure 11: Graphical comparison of the proposed and existing method by means of accuracy and specificity.
Figure 11:

Graphical comparison of the proposed and existing method by means of accuracy and specificity.

In Table 4, performance evaluation of the proposed method is: for the modified KFCM and the existing methodology: DCNN [13] is validated by means of precision, recall, and Jaccard coefficient. Khosravi et al. [13] developed numerous computational methodologies on the basis of CNN and also built a stand-alone pipeline for classifying the histopathology images across dissimilar types of cancer. In this research paper, stand-alone pipeline demonstrates the discriminate between two sub-types of lung cancer, five bio-markers of breast cancer, and four bio-markers of bladder cancer. The classification phase includes the ensemble of two algorithms (ResNe and inception), a basic CNN architecture, and Google’s Inceptions with three training approaches. The average precision of the proposed technique is: for the modified KFCM, 99.21% is delivered and the existing methodology: DCNN attains 92% of segmentation precision. Similarly, the Jaccard coefficient of the proposed technique delivers 93.77%, and the existing methodology achieves 92% of the Jaccard coefficient. Compared to the existing method, the proposed methodology shows better segmentation results by means of precision and Jaccard coefficient. Additionally, the recall of the proposed technique delivers 68.81%, and the existing methodology delivers 92% of the recall value. The proposed approach shows less recall value because while performing with multiple ROI on input histopathology image, the size of the pixel is decreased that reduces the performance of the proposed approach. The graphical representation of precision, recall, and Jaccard coefficient is denoted in Figure 12.

Figure 12: Graphical comparison of the proposed and existing method by means of precision, recall, and Jaccard coefficient.
Figure 12:

Graphical comparison of the proposed and existing method by means of precision, recall, and Jaccard coefficient.

4.4 Discussion about Proposed Methodology

Histopathology image segmentation plays a major role in cancer diagnosis, which delivers the necessary information for separating the non-cancer region from the cancer region. In this experimental research, histopathological image segmentation is carried out for the disease: lung cancer, which is one of the growing diseases in the medical field. Previously, several methodologies were developed to distinguish the cancer cells from non-cancer cells, but existing researches do not concentrate on the overlapped cancer and non-cancer cells. It is essential to distinguish the overlapped cancer and non-cancer cells for best recognition or segmentation. In this research, a new unsupervised machine learning approach was developed for segmenting the overlapped cancer cells from non-cancer cells. The effectiveness of the proposed methodology is shown in Tables 3 and 4. The performance analysis is verified by determining the performance metrics like precision, recall, specificity, accuracy, and Jaccard coefficient. Under such circumstance, the accuracy of the proposed methodology is 2.5% better than the existing approach (diffusion weighted). Additionally, precision and the Jaccard coefficient of the proposed methodology shows 7% and 1.5% improvement compared with the existing approach (DCNN). The proposed segmentation methodology includes numerous advantages: assists the doctors during surgery, cost efficient related to other existing machine-learning approaches, and earlier detection of lung diseases.

5 Conclusion

In this research paper, a new texture-based histopathological segmentation methodology is proposed, which is based on the modified KFCM with ROI. The modified KFCM is the most effective methodology in histopathological lung cancer segmentation. In this experimental research, the modified KFCM, along with the EBFT, is utilized for segmenting the cancer and non-cancer regions in histopathological images. The proposed methodology effectively combines the advantage of the EBFT and the modified KFCM methodology. The experimental investigation was verified on a publicly available database (TMAD dataset), which shows a superiority of the proposed methodology. The modified KFCM scheme delivered an effective segmentation performance, compared to the other obtainable approaches in histopathological lung cancer detection. The proposed methodology showed 2.5–5% of the enhancement in segmentation compared with the existing methods in terms of segmentation accuracy. In future work, an appropriate classification methodology will be used in classifying the cancer and non-cancer regions in the segmented histopathology image.

Bibliography

[1] L. Azevedo, A. M. Faustino and J. M. R. Tavares, Segmentation and 3D reconstruction of animal tissues in histological images, in: Computational and Experimental Biomedical Sciences: Methods and Applications, pp. 193–207, Springer, Cham., 2015.10.1007/978-3-319-15799-3_14Suche in Google Scholar

[2] S. Beagum, N. Dey, A. S. Ashour, D. Sifaki-Pistolla and V. E. Balas, Nonparametric de-noising filter optimization using structure-based microscopic image classification, Microsc. Res. Tech. 80 (2017), 419–429.10.1002/jemt.22811Suche in Google Scholar PubMed

[3] R. H. J. Breuer, P. E. Postmus and E. F. Smit, Molecular pathology of non-small-cell lung cancer, Respiration 72 (2005), 313–330.10.1159/000085376Suche in Google Scholar PubMed

[4] N. Coudray, A. L. Moreira, T. Sakellaropoulos, D. Fenyo, N. Razavian and A. Tsirigos, Classification and mutation prediction from non-small cell lung cancer histopathology images using deep learning, bioRxiv (2017), 1559.10.1038/s41591-018-0177-5Suche in Google Scholar PubMed

[5] N. Dey, A. S. Ashour, A. S. Ashour and A. Singh, Digital analysis of microscopic images in medicine. J. Adv. Microsc. Res. 10 (2015), 1–13.10.1166/jamr.2015.1229Suche in Google Scholar

[6] M. Dietel, L. Bubendorf, A. M. C. Dingemans, C. Dooms, G. Elmberger, R. C. García, K. M. Kerr, E. Lim, F. López-Ríos, E. Thunnissen and P. E. Van Schil, Diagnostic procedures for non-small-cell lung cancer (NSCLC): recommendations of the European Expert Group, Thorax 71 (2015), 177–184.10.1136/thoraxjnl-2014-206677Suche in Google Scholar PubMed PubMed Central

[7] Y. Ding and X. Fu, Kernel-based fuzzy c-means clustering algorithm based on genetic algorithm, Neurocomputing 188 (2016), 233–238.10.1016/j.neucom.2015.01.106Suche in Google Scholar

[8] R. Dorantes-Heredia, J. M. Ruiz-Morales and F. Cano-García, Histopathological transformation to small-cell lung carcinoma in non-small cell lung carcinoma tumors, Transl. Lung Cancer Res. 5 (2016), 401–412.10.21037/tlcr.2016.07.10Suche in Google Scholar PubMed PubMed Central

[9] L. He, L. R. Long, S. Antani and G. Thoma, Computer assisted diagnosis in histopathology, Sequence and Genome Analysis: Methods and Applications 3 (2010), 271–287.10.1117/2.1201011.003358Suche in Google Scholar

[10] S. Hore, S. Chakroborty, A. S. Ashour, N. Dey, A. S. Ashour, D. Sifaki-Pistolla, T. Bhattacharya and S. R. Chaudhuri, Finding contours of hippocampus brain cell using microscopic image analysis. J. Adv. Microsc. Res. 10 (2015), 93–103.10.1166/jamr.2015.1245Suche in Google Scholar

[11] H. D. Hosgood, C. Farah, C. C. Black, M. Schwenn and J. M. Hock, Spatial and temporal distributions of lung cancer histopathology in the state of Maine, Lung Cancer 82 (2013), 55–62.10.1016/j.lungcan.2013.06.018Suche in Google Scholar PubMed

[12] J. R. F. Junior, M. Koenigkam-Santos, F. E. G. Cipriano, A. T. Fabro and P. M. de Azevedo-Marques, Radiomics-based features for pattern recognition of lung cancer histopathology and metastases, Comput. Methods Programs Biomed. 159 (2018), 23–30.10.1016/j.cmpb.2018.02.015Suche in Google Scholar

[13] P. Khosravi, E. Kazemi, M. Imielinski, O. Elemento and I. Hajirasouliha, Deep convolutional neural networks enable discrimination of heterogeneous digital pathology images, EBioMedicine 27 (2018), 317–328.10.1016/j.ebiom.2017.12.026Suche in Google Scholar

[14] Z. Ma, J. M. R. Tavares and R. N. Jorge, A review on the current segmentation algorithms for medical images, in: Proceedings of the 1st International Conference on Imaging Theory and Applications (IMAGAPP), 2009.Suche in Google Scholar

[15] A. L. Moreira and M. Mino-Kenudson, Update on histologic classification of non-small cell lung cancer, Diagn. Histopathol. 20 (2014), 385–391.10.1016/j.mpdhp.2014.09.006Suche in Google Scholar

[16] M. G. Oser, M. J. Niederst, L. V. Sequist and J. A. Engelman, Transformation from non-small-cell lung cancer to small-cell lung cancer: molecular drivers and cells of origin, Lancet Oncol. 16 (2015), e165–e172.10.1016/S1470-2045(14)71180-5Suche in Google Scholar

[17] J. C. Sieren, J. Weydert, A. Bell, B. De Young, A. R. Smith, J. Thiesse, E. Namati and G. McLennan, An automated segmentation approach for highlighting the histological complexity of human lung cancer, Ann. Biomed. Eng. 38 (2010), 3581–3591.10.1007/s10439-010-0103-6Suche in Google Scholar PubMed PubMed Central

[18] T. H. Vu, H. S. Mousavi, V. Monga, G. Rao and U. A. Rao, Histopathological image classification using discriminative feature-oriented dictionary learning, IEEE Trans. Med. Imaging 35 (2016), 738–751.10.1109/TMI.2015.2493530Suche in Google Scholar PubMed PubMed Central

[19] C. W. Wang, Robust automated tumour segmentation on histological and immunohistochemical tissue images, PloS One 6 (2011), 15818.10.1371/journal.pone.0015818Suche in Google Scholar PubMed PubMed Central

[20] M. Wang, F. Tang, X. Pan, L. Yao, X. Wang, Y. Jing, J. Ma, G. Wang and L. Mi, Rapid diagnosis and intraoperative margin assessment of human lung cancer with fluorescence lifetime imaging microscopy, BBA Clin. 8 (2017), 7–13.10.1016/j.bbacli.2017.04.002Suche in Google Scholar PubMed PubMed Central

[21] F. Xing and L. Yang, Robust selection-based sparse shape model for lung cancer image segmentation, in: International Conference on Medical Image Computing and Computer-Assisted Intervention, pp. 404–412, Springer, Berlin, Heidelberg, 2013.10.1007/978-3-642-40760-4_51Suche in Google Scholar PubMed PubMed Central

[22] Y. Yin, O. Sedlaczek, B. Müller, A. Warth, M. González-Vallinas, B. Lahrmann, N. Grabe, H. U. Kauczor, K. Breuhahn, I. E. Vignon-Clementel and D. Drasdo, Tumor cell load and heterogeneity estimation from diffusion-weighted MRI calibrated with histological data: an example from lung cancer, IEEE Trans. Med. Imaging 37 (2018), 35–46.10.1109/TMI.2017.2698525Suche in Google Scholar PubMed

[23] K. H. Yu, C. Zhang, G. J. Berry, R. B. Altman, C. Ré, D. L. Rubin and M. Snyder, Predicting non-small cell lung cancer prognosis by fully automated microscopic pathology image features, Nat. Commun. 7 (2016), 12474.10.1038/ncomms12474Suche in Google Scholar PubMed PubMed Central

[24] K. H. Yu, G. J. Berry, D. L. Rubin, C. Ré, R. B. Altman and M. Snyder, Association of omics features with histopathology patterns in lung adenocarcinoma, Cell Syst. 5 (2017), 620–627.10.1016/j.cels.2017.10.014Suche in Google Scholar PubMed PubMed Central

[25] X. Zhang, H. Su, L. Yang and S. Zhang, Fine-grained histopathological image analysis via robust segmentation and large-scale retrieval, in: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 5361–5368, 2015.10.1109/CVPR.2015.7299174Suche in Google Scholar

[26] X. Zhang, F. Xing, H. Su, L. Yang and S. Zhang, High-throughput histopathological image analysis via robust cell segmentation and hashing, Med. Image Anal. 26 (2015), 306–315.10.1016/j.media.2015.10.005Suche in Google Scholar PubMed PubMed Central

Received: 2018-07-27
Accepted: 2019-02-11
Published Online: 2019-03-20

©2020 Walter de Gruyter GmbH, Berlin/Boston

This work is licensed under the Creative Commons Attribution 4.0 Public License.

Artikel in diesem Heft

  1. An Optimized K-Harmonic Means Algorithm Combined with Modified Particle Swarm Optimization and Cuckoo Search Algorithm
  2. Texture Feature Extraction Using Intuitionistic Fuzzy Local Binary Pattern
  3. Leaf Disease Segmentation From Agricultural Images via Hybridization of Active Contour Model and OFA
  4. Deadline Constrained Task Scheduling Method Using a Combination of Center-Based Genetic Algorithm and Group Search Optimization
  5. Efficient Classification of DDoS Attacks Using an Ensemble Feature Selection Algorithm
  6. Distributed Multi-agent Bidding-Based Approach for the Collaborative Mapping of Unknown Indoor Environments by a Homogeneous Mobile Robot Team
  7. An Efficient Technique for Three-Dimensional Image Visualization Through Two-Dimensional Images for Medical Data
  8. Combined Multi-Agent Method to Control Inter-Department Common Events Collision for University Courses Timetabling
  9. An Improved Particle Swarm Optimization Algorithm for Global Multidimensional Optimization
  10. A Kernel Probabilistic Model for Semi-supervised Co-clustering Ensemble
  11. Pythagorean Hesitant Fuzzy Information Aggregation and Their Application to Multi-Attribute Group Decision-Making Problems
  12. Using an Efficient Optimal Classifier for Soil Classification in Spatial Data Mining Over Big Data
  13. A Bayesian Multiresolution Approach for Noise Removal in Medical Magnetic Resonance Images
  14. Gbest-Guided Artificial Bee Colony Optimization Algorithm-Based Optimal Incorporation of Shunt Capacitors in Distribution Networks under Load Growth
  15. Graded Soft Expert Set as a Generalization of Hesitant Fuzzy Set
  16. Universal Liver Extraction Algorithm: An Improved Chan–Vese Model
  17. Software Effort Estimation Using Modified Fuzzy C Means Clustering and Hybrid ABC-MCS Optimization in Neural Network
  18. Handwritten Indic Script Recognition Based on the Dempster–Shafer Theory of Evidence
  19. An Integrated Intuitionistic Fuzzy AHP and TOPSIS Approach to Evaluation of Outsource Manufacturers
  20. Automatically Assess Day Similarity Using Visual Lifelogs
  21. A Novel Bio-Inspired Algorithm Based on Social Spiders for Improving Performance and Efficiency of Data Clustering
  22. Discriminative Training Using Noise Robust Integrated Features and Refined HMM Modeling
  23. Self-Adaptive Mussels Wandering Optimization Algorithm with Application for Artificial Neural Network Training
  24. A Framework for Image Alignment of TerraSAR-X Images Using Fractional Derivatives and View Synthesis Approach
  25. Intelligent Systems for Structural Damage Assessment
  26. Some Interval-Valued Pythagorean Fuzzy Einstein Weighted Averaging Aggregation Operators and Their Application to Group Decision Making
  27. Fuzzy Adaptive Genetic Algorithm for Improving the Solution of Industrial Optimization Problems
  28. Approach to Multiple Attribute Group Decision Making Based on Hesitant Fuzzy Linguistic Aggregation Operators
  29. Cubic Ordered Weighted Distance Operator and Application in Group Decision-Making
  30. Fault Signal Recognition in Power Distribution System using Deep Belief Network
  31. Selector: PSO as Model Selector for Dual-Stage Diabetes Network
  32. Oppositional Gravitational Search Algorithm and Artificial Neural Network-based Classification of Kidney Images
  33. Improving Image Search through MKFCM Clustering Strategy-Based Re-ranking Measure
  34. Sparse Decomposition Technique for Segmentation and Compression of Compound Images
  35. Automatic Genetic Fuzzy c-Means
  36. Harmony Search Algorithm for Patient Admission Scheduling Problem
  37. Speech Signal Compression Algorithm Based on the JPEG Technique
  38. i-Vector-Based Speaker Verification on Limited Data Using Fusion Techniques
  39. Prediction of User Future Request Utilizing the Combination of Both ANN and FCM in Web Page Recommendation
  40. Presentation of ACT/R-RBF Hybrid Architecture to Develop Decision Making in Continuous and Non-continuous Data
  41. An Overview of Segmentation Algorithms for the Analysis of Anomalies on Medical Images
  42. Blind Restoration Algorithm Using Residual Measures for Motion-Blurred Noisy Images
  43. Extreme Learning Machine for Credit Risk Analysis
  44. A Genetic Algorithm Approach for Group Recommender System Based on Partial Rankings
  45. Improvements in Spoken Query System to Access the Agricultural Commodity Prices and Weather Information in Kannada Language/Dialects
  46. A One-Pass Approach for Slope and Slant Estimation of Tri-Script Handwritten Words
  47. Secure Communication through MultiAgent System-Based Diabetes Diagnosing and Classification
  48. Development of a Two-Stage Segmentation-Based Word Searching Method for Handwritten Document Images
  49. Pythagorean Fuzzy Einstein Hybrid Averaging Aggregation Operator and its Application to Multiple-Attribute Group Decision Making
  50. Ensembles of Text and Time-Series Models for Automatic Generation of Financial Trading Signals from Social Media Content
  51. A Flame Detection Method Based on Novel Gradient Features
  52. Modeling and Optimization of a Liquid Flow Process using an Artificial Neural Network-Based Flower Pollination Algorithm
  53. Spectral Graph-based Features for Recognition of Handwritten Characters: A Case Study on Handwritten Devanagari Numerals
  54. A Grey Wolf Optimizer for Text Document Clustering
  55. Classification of Masses in Digital Mammograms Using the Genetic Ensemble Method
  56. A Hybrid Grey Wolf Optimiser Algorithm for Solving Time Series Classification Problems
  57. Gray Method for Multiple Attribute Decision Making with Incomplete Weight Information under the Pythagorean Fuzzy Setting
  58. Multi-Agent System Based on the Extreme Learning Machine and Fuzzy Control for Intelligent Energy Management in Microgrid
  59. Deep CNN Combined With Relevance Feedback for Trademark Image Retrieval
  60. Cognitively Motivated Query Abstraction Model Based on Associative Root-Pattern Networks
  61. Improved Adaptive Neuro-Fuzzy Inference System Using Gray Wolf Optimization: A Case Study in Predicting Biochar Yield
  62. Predict Forex Trend via Convolutional Neural Networks
  63. Optimizing Integrated Features for Hindi Automatic Speech Recognition System
  64. A Novel Weakest t-norm based Fuzzy Fault Tree Analysis Through Qualitative Data Processing and Its Application in System Reliability Evaluation
  65. FCNB: Fuzzy Correlative Naive Bayes Classifier with MapReduce Framework for Big Data Classification
  66. A Modified Jaya Algorithm for Mixed-Variable Optimization Problems
  67. An Improved Robust Fuzzy Algorithm for Unsupervised Learning
  68. Hybridizing the Cuckoo Search Algorithm with Different Mutation Operators for Numerical Optimization Problems
  69. An Efficient Lossless ROI Image Compression Using Wavelet-Based Modified Region Growing Algorithm
  70. Predicting Automatic Trigger Speed for Vehicle-Activated Signs
  71. Group Recommender Systems – An Evolutionary Approach Based on Multi-expert System for Consensus
  72. Enriching Documents by Linking Salient Entities and Lexical-Semantic Expansion
  73. A New Feature Selection Method for Sentiment Analysis in Short Text
  74. Optimizing Software Modularity with Minimum Possible Variations
  75. Optimizing the Self-Organizing Team Size Using a Genetic Algorithm in Agile Practices
  76. Aspect-Oriented Sentiment Analysis: A Topic Modeling-Powered Approach
  77. Feature Pair Index Graph for Clustering
  78. Tangramob: An Agent-Based Simulation Framework for Validating Urban Smart Mobility Solutions
  79. A New Algorithm Based on Magic Square and a Novel Chaotic System for Image Encryption
  80. Video Steganography Using Knight Tour Algorithm and LSB Method for Encrypted Data
  81. Clay-Based Brick Porosity Estimation Using Image Processing Techniques
  82. AGCS Technique to Improve the Performance of Neural Networks
  83. A Color Image Encryption Technique Based on Bit-Level Permutation and Alternate Logistic Maps
  84. A Hybrid of Deep CNN and Bidirectional LSTM for Automatic Speech Recognition
  85. Database Creation and Dialect-Wise Comparative Analysis of Prosodic Features for Punjabi Language
  86. Trapezoidal Linguistic Cubic Fuzzy TOPSIS Method and Application in a Group Decision Making Program
  87. Histopathological Image Segmentation Using Modified Kernel-Based Fuzzy C-Means and Edge Bridge and Fill Technique
  88. Proximal Support Vector Machine-Based Hybrid Approach for Edge Detection in Noisy Images
  89. Early Detection of Parkinson’s Disease by Using SPECT Imaging and Biomarkers
  90. Image Compression Based on Block SVD Power Method
  91. Noise Reduction Using Modified Wiener Filter in Digital Hearing Aid for Speech Signal Enhancement
  92. Secure Fingerprint Authentication Using Deep Learning and Minutiae Verification
  93. The Use of Natural Language Processing Approach for Converting Pseudo Code to C# Code
  94. Non-word Attributes’ Efficiency in Text Mining Authorship Prediction
  95. Design and Evaluation of Outlier Detection Based on Semantic Condensed Nearest Neighbor
  96. An Efficient Quality Inspection of Food Products Using Neural Network Classification
  97. Opposition Intensity-Based Cuckoo Search Algorithm for Data Privacy Preservation
  98. M-HMOGA: A New Multi-Objective Feature Selection Algorithm for Handwritten Numeral Classification
  99. Analogy-Based Approaches to Improve Software Project Effort Estimation Accuracy
  100. Linear Regression Supporting Vector Machine and Hybrid LOG Filter-Based Image Restoration
  101. Fractional Fuzzy Clustering and Particle Whale Optimization-Based MapReduce Framework for Big Data Clustering
  102. Implementation of Improved Ship-Iceberg Classifier Using Deep Learning
  103. Hybrid Approach for Face Recognition from a Single Sample per Person by Combining VLC and GOM
  104. Polarity Analysis of Customer Reviews Based on Part-of-Speech Subcategory
  105. A 4D Trajectory Prediction Model Based on the BP Neural Network
  106. A Blind Medical Image Watermarking for Secure E-Healthcare Application Using Crypto-Watermarking System
  107. Discriminating Healthy Wheat Grains from Grains Infected with Fusarium graminearum Using Texture Characteristics of Image-Processing Technique, Discriminant Analysis, and Support Vector Machine Methods
  108. License Plate Recognition in Urban Road Based on Vehicle Tracking and Result Integration
  109. Binary Genetic Swarm Optimization: A Combination of GA and PSO for Feature Selection
  110. Enhanced Twitter Sentiment Analysis Using Hybrid Approach and by Accounting Local Contextual Semantic
  111. Cloud Security: LKM and Optimal Fuzzy System for Intrusion Detection in Cloud Environment
  112. Power Average Operators of Trapezoidal Cubic Fuzzy Numbers and Application to Multi-attribute Group Decision Making
Heruntergeladen am 12.9.2025 von https://www.degruyterbrill.com/document/doi/10.1515/jisys-2018-0316/html
Button zum nach oben scrollen