Startseite Construction of image segmentation system combining TC and swarm intelligence algorithm
Artikel Open Access

Construction of image segmentation system combining TC and swarm intelligence algorithm

  • Yongtao Wang EMAIL logo
Veröffentlicht/Copyright: 30. August 2025
Veröffentlichen auch Sie bei De Gruyter Brill

Abstract

Considering the slow convergence speed and local optima in most current swarm intelligence algorithms for image segmentation, this study combines the thresholding clustering with intelligent algorithms. The algorithm is optimized based on the characteristics of image segmentation. First, an improved differential evolution algorithm is applied to enhance the speed of threshold image segmentation. The convergence performance of the algorithm is improved by adjusting the crossover probability and mutation strategy. Second, considering the threshold difference, an improved two-dimensional Otsu threshold segmentation algorithm based on cuckoo algorithm is proposed, and fractional calculus processing and fractional enhancement filtering are introduced to improve the image segmentation quality. The loss value in image feature segmentation was less than 5 × 102, the accuracy exceeded 90% in different types of image segmentation, the running time did not exceed 2 s, and the structural similarity reached 0.912, which was significantly better than other algorithms. Moreover, the pixel accuracy of the image processing algorithm exceeded 90%, with good segmentation details and edge extraction effects. The proposed image segmentation method performs well in segmentation accuracy, edge clarity, and computational efficiency, which can improve the segmentation accuracy and application effect in image processing systems and provide a method for improving the generalization ability of segmentation models in different scenarios.

1 Introduction

Internet and computer vision technology have greatly increased image data. The dependence of medical imaging, face recognition, biometric security, and other applications on image analysis makes image segmentation results extremely important. The quality of segmentation results will directly affect a series of links in image processing [1]. Image segmentation is to divide the overall informationized image into different modules and regions based on feature differences, which facilitates further image analysis and understanding [2]. Relying on image characteristics, segmentation technique has two categories: edge detection and region detection. Overall, image segmentation problems are complex and difficult [3]. The image threshold segmentation technology mainly divides the set of pixel regions based on the grayscale level of the image. However, due to its excessive dependence on the selected threshold, its segmentation performance has significant differences and limitations. The traditional maximum inter-class variance method (Otsu) is widely used in current threshold segmentation algorithms. It mainly achieves threshold segmentation based on the maximum interclass variance of the image. However, it ignores the pixel information and spatial characteristics of the image during the segmentation process, making its segmentation accuracy difficult to meet practical needs [4]. Therefore, to improve segmentation accuracy and address the shortcomings of the traditional Otsu algorithm, this study combines it with swarm intelligence algorithms with strong optimization capabilities, using improved differential evolution (DE) and cuckoo search (CS) to improve image segmentation accuracy and efficiency. The innovation lies in combining threshold image segmentation characteristics and algorithm performance, which can improve image segmentation performance while ensuring processing accuracy and adaptability for different types of images.

2 Related works

As a fundamental step in image processing and visual analysis, image segmentation is extensively applied in character recognition, medical image analysis, and visual inspection. Traditional segmentation algorithms are significantly affected by factors such as threshold quantity and background noise. Improving the real-time performance of algorithm analysis without affecting search accuracy is a research hotspot. Some scholars have made different attempts. For example, Khrissi et al. used meta-heuristic algorithms to optimize clustering and improve image segmentation quality. The results showed that this method had better image processing performance than other basic methods and required less computation time [5]. Chowdhary et al. designed an intuition-based possibility fuzzy c-means method to improve medical image overlap clustering and data noise problems. The method had good classification performance. The average segmentation accuracy at different noise levels exceeded 85% [6]. Liu et al. considered the limitations of existing image segmentation methods in segmenting small and irregular particles. The morphology was used to segment key regions in images, and deep learning methods were used to achieve image classification. The results showed that the segmentation model could effectively solve the adhesion and overlap problems between adjacent particles. The segmentation performance was good [7]. Zhang et al. optimized the Deeplabv3 + network structure and loss function design for tongue image segmentation. The algorithm effectively reduced image misjudgment and improved the accuracy of tongue edge segmentation [8]. Vadivel and Suguna applied convolutional neural networks to leaf disease image recognition. The results showed that the classification accuracy of this method far exceeded 95% [9]. Stringer et al. proposed a cell segmentation method based on deep learning and trained the dataset using a three-dimensional extension of the cell model. The model did not require parameter adjustment and had good applicability and effectiveness [10].

Threshold segmentation technology is extensively used in image processing due to its high efficiency, accuracy, and precision. Sharma et al. proposed a multi-level threshold segmentation method, which utilized an improved firefly algorithm and the principle of class variance derivation to process images. The method had better structural similarity index measurement and objective function value compared with other meta-heuristic algorithms [11]. Resma and Nair proposed a meta-heuristic Krill flock optimization algorithm for image segmentation. The results showed that this multi-level threshold processing approach exhibited better applicability and lower computational efficiency compared with other algorithms [12]. In response to the limitations of solving and measuring multi-threshold image segmentation techniques, Dhal et al. conducted a literature review and discussion on the application of natural inspired optimization algorithms. The development of multi-threshold image models under this algorithm should consider multiple factors [13]. Wang et al. designed a multi-class segmentation method for aggregated images on the basis of optimized chaotic sparrow search algorithm, which improved processing speed by introducing new parameters. This method had a significant segmentation effect on high-precision images such as aggregates [14]. Nyo et al. used the Otsu thresholding method to achieve image segmentation and used morphological operations to obtain accurate target regions. The results showed that the segmentation accuracy of this method exceeded 95%, and the application effect in medicine was significant [15]. Rawas and El-Zaart designed an image segmentation model based on precise parallel algorithms and combined it with benchmark distribution thresholding techniques to extract and partition segmentation regions. The model had high segmentation accuracy on different benchmark datasets and significantly reduced processing time [16].

In summary, most scholars utilize swarm intelligence algorithms to solve image segmentation processing, but they rarely consider the premature convergence of the algorithm. Image segmentation methods that rely on threshold solving ideas ignore the attention to image pixel information and spatial characteristics, which limits previous research methods. In response to the shortcomings of current image threshold segmentation methods, this study combines threshold algorithms with swarm intelligence algorithms. Then, the fractional calculus processing idea is introduced to improve algorithm performance, enhancing image segmentation accuracy and quality.

3 Construction of image segmentation system combining thresholding clustering (TC) algorithm and swarm intelligence algorithm

The study combines the threshold image segmentation method with swarm intelligence algorithms to design an image segmentation system. First, the traditional TC is adaptively improved using an improved DE algorithm. Considering the complexity of image shapes and differences in target contrast, K-means clustering and subtraction clustering are combined to achieve image threshold segmentation. Subsequently, considering the threshold difference, the Otsu segmentation idea is combined with the improved cuckoo algorithm to improve image segmentation accuracy.

3.1 Threshold segmentation image processing based on improved differential algorithm

Otsu segmentation using the concept of class variance requires threshold traversal within the grayscale range, which can be time-consuming to some extent. Therefore, to improve the speed and accuracy of threshold solution, the DE is used to segment threshold image. The DE has good global search ability, but its traditional idea requires setting relevant parameters and mutation strategies, which is difficult to meet the needs of population evolution and high-dimensional data solving. Therefore, an improved DE based on improved adaptive control parameters is proposed. The improved DE optimizes the convergence performance of the algorithm by adjusting the crossover probability, as expressed mathematically in Eq. (1):

(1) CR i = rand n i × ( μ CR , 0.1 ) ,

where μ CR represents the mean. After each iteration, the control parameters are adaptively updated by introducing a random positive number for solving, as shown in Eq. (2):

(2) μ CR = ( 1 c ) × μ CR + c × m A × ( S CR ) , μ F = ( 1 c ) × μ F + c × m L × ( S F )

where S CR represents the set of parameters CR , S F represents a set of parameters F , c is a random positive number, m A is the arithmetic mean, and m L is the Lehmer mean. Figure 1 shows the parameter changes of variation factors.

Figure 1 
                  Changes in parameters of variation factors.
Figure 1

Changes in parameters of variation factors.

In Figure 1, the amplitude exhibited by excessively large mutation factors is more pronounced, while excessively small factors can cause the population to fall into local optima. Therefore, according to the different stages of the population, an adaptive mutation factor is set to implement parameter adjustment strategies. Eq. (3) is the adaptive mutation factor:

(3) F i = F max ( F max F min ) × gen gen total k .

where gen represents the current population algebra, gen total is the total number of iterations, F max signifies the upper bound of the variation factor, F min signifies the lower bound of the variation factor, and k is a random number. Meanwhile, adaptive adjustments are made to the mutation operation. The improved mutation strategy is shown in Eq. (4):

(4) v i g + 1 = x r 1 g + u i × F i × ( x r 2 g x r 3 g ) + ( 1 u i ) × F i × ( x best g x r 4 g ) .

where u i is the disturbance-scale parameter, v i g + 1 is an individual in the population, x best g is the optimal population individual, and x r 2 g is the experimental individual vector. Subsequently, considering the complex shape of the image and its contrast difference with the target, the study combines K-means clustering with subtractive clustering (SC) to achieve image threshold segmentation. The K-means algorithm is an unsupervised clustering algorithm, which often utilizes Euclidean distance to achieve distance partitioning. It can classify images on the basis of the distance between the feature values of image pixels and the cluster center values. Eq. (5) is the minimum cost function of the algorithm:

(5) J = j = 1 k i = 1 n dist ( x ij , m j ) ,

where x i j represents the data point, m j signifies the cluster center, dist is the distance, n signifies the data point, and k signifies the clusters. The traditional K-means algorithm is sensitive to the initial clustering center. The processing time of clustering results is related to the number of iterations and image noise points. Therefore, an improved SC algorithm is applied to achieve improvement. The SC algorithm identifies cluster centers by determining the data points with the highest density. To ensure that the obtained cluster centers are more in line with the actual situation, an improved method for cluster center identification based on distance measurement is proposed on the SC algorithm, taking into account the distance between data points and cluster centers. Figure 2 is a schematic diagram of the improved clustering algorithm process.

Figure 2 
                  Schematic diagram of the improved clustering algorithm process.
Figure 2

Schematic diagram of the improved clustering algorithm process.

The parameters of the SC algorithm are initialized and improved, including the number of clusters, hypersphere radius, and constraint coefficient. Then, the density values of each pixel are calculated. The density values of data points are updated based on the actual clustering centers to obtain all initial clustering points. The pixel distance of the clustering points is calculated to allocate the clustering centers, calculate the center positions, and end the process when the termination condition is met.

3.2 2D Otsu threshold segmentation algorithm based on the improved CS

Considering the threshold difference, this study introduces the Otsu segmentation for image segmentation. Otsu segmentation uses the maximum value of interclass variance in images to select the optimal threshold. Traditional one-dimensional Otsu thresholds that use grayscale pixels to distinguish between target and background images are inevitably affected by noise interference, leading to prominent excessive or inaccurate image segmentation [17]. Therefore, a two-dimensional Otsu threshold segmentation algorithm is proposed to expand the threshold search space by increasing the grayscale information of adjacent pixels. The probability of image occurrence in the two-dimensional Otsu threshold segmentation algorithm can be explained, as shown in Eq. (6):

(6) p ij = n ij MN ,

where MN signifies the two-dimensional size of the image, i signifies the grayscale, j represents the gradient, and n represents the pixels. Based on segmentation thresholds, images are divided into background and target categories. The occurrence probability can be mathematically represented, thereby reflecting the segmentation quality. Eq. (7) is the dispersion function:

(7) S ( s , t ) = P A × ( μ A μ ) × ( μ A μ ) T + P B × ( μ B μ ) × ( μ B μ ) T ,

where A is the background image, B is the target image, s is the threshold average grayscale, t is the gradient, μ A is the mean vector of the background image, μ B signifies the mean vector of the target image, μ is the total mean vector of pixels, and T represents the matrix transpose. To further enhance image quality, the study introduces fractional calculus processing and fractional enhancement filtering to improve the information quality of image preprocessing. Fractional calculus is a generalization of classical integer calculus, allowing the order of derivatives and integrals to be any real or complex number. Its definitions include Grunwald–Letnikov (G–L), Riemann–Liouville (R–L), and Caputo [18]. The fractional order is generated by the gamma function, where G–L is defined as transforming the differential order of the gamma function from an integer to a fraction and deriving it from the derivative definition of the integer order. The definition of R–L is to transform the order of calculus from integers to fractions using the Gamma function and then integrate and differentiate them. The definition of Caputo fractional calculus is a transformation defined by R–L, which involves differentiating a function and then integrating it. Eq. (8) is a mathematical expression of three defined forms:

(8) R L = 1 Γ ( n α ) d n f d t n a x f ( τ ) ( t τ ) α n + 1 d τ G L = h α k = 0 N ( 1 ) k Γ ( α + k ) k ! Γ ( α ) f ( t k h ) Caputo = 1 Γ ( n α ) a t ( x τ ) α n + 1 f ( n ) ( τ ) d τ

where α represents the fractional order, n represents the order, Γ is the Gamma function, f ( ) is the function to be processed, τ is the integral variable, h is the discretization step size, t is the time, x is the independent variable, k is the summation variable, and N is the total number of discrete points. Fractional-order differentiation and integration are linear operators, where fractional-order derivatives depend on the global information of the function rather than local information, and can describe the memory and historical dependency characteristics of the system. The fractional-order differential filter enhances the high-frequency components of the signal through fractional-order differentiation, while retaining the low-frequency components [19]. The fractional-order integral filter smooths the signal through fractional-order integration and suppresses high-frequency noise. Fractional-order filters convert signals to the frequency domain through Fourier transform, apply fractional-order differentiation or integration operators, and then inverse-transform back to the time domain. The implementation in the time domain is obtained by discretizing fractional-order differentiation and integration. Namely, a two-dimensional Otsu threshold segmentation algorithm based on the improved CS is designed. The CS algorithm is a biomimetic biological intelligent optimization algorithm that effectively solves the optimal problem by simulating the parasitic brooding of cuckoo birds. The transformation and updating of the nest position can achieve the optimal position of the parasitic nest [20]. Eq. (9) is the updated the position of cuckoos.

(9) X k t + 1 = X k t + α 0 Levy ( β ) ( X k t X best ) ( k = 1 , 2 , 3 , . . . , n ) ,

where X k t + 1 represents the candidate solution, X k t represents the current solution, represents the point-to-point multiplication, α 0 is the step size control factor, Levy ( β ) represents the Levi’s flight, and X best represents the optimal solution. In the CS algorithm, cuckoo can add new solutions according to the random preference walk strategy. The CS algorithm process is illustrated in Figure 3.

Figure 3 
                  CS algorithm flow diagram.
Figure 3

CS algorithm flow diagram.

In Figure 3, after inputting the initial parameters and relevant boundary conditions, the initial value can be solved. The steps of Levy flight and preference random walk can be executed until the global optimal position and optimal value are output. However, the updated cuckoo individuals are more prone to getting stuck in local optima problems. The flight mechanism of the CS algorithm may experience occasional large jumps after multiple gatherings. Therefore, the study improves the CS algorithm through the genetic and memory properties of fractional calculus and applies the fractional differential G–L to the Levy flight mechanism to update the position. Eq. (10) is the updated position:

(10) X k t + 1 X k t = α 0  Levy ( β ) ( X k t X best ) ( k = 1 , 2 , 3 , . . . , n ) ,

where X k t + 1 X k t can be represented as a difference of order 1. G–L can redefine and generalize the order derivatives of functions, expanding the boundaries of calculus. The first four terms defined by G–L are selected to express Levi’s flight, as shown in Eq. (11):

(11) X k t + 1 = α X k t + 1 2 α ( 1 α ) X k t 1 + 1 6 α ( 1 α ) ( 2 α ) X k t 2 + 1 24 α ( 1 α ) ( 2 α ) ( 3 α ) X k t 3 + α 0  Levy ( β ) ( X k t X best ) .

Meanwhile, the position information of cuckoo birds is used to adaptively adjust the fractional order. The evolutionary factor is introduced to dynamically adjust the algorithm. The factor is displayed in Eq. (12):

(12) f = d g d min d max d min ,

where f represents the evolutionary factor, d g signifies the average distance between the global optimal solution and other individuals, and d max and d min signify the maximum and minimum values of the interclass distance. Considering that fractional-order values greater than 0.5 can cause the algorithm to converge too quickly, Eq. (13) is used to constrain the order:

(13) α ( f ) = 0.5 α e 0.47 ,

where e is the base of the natural logarithm function. The process of image segmentation algorithm is displayed in Figure 4.

Figure 4 
                  Image segmentation algorithm flow.
Figure 4

Image segmentation algorithm flow.

In Figure 4, the population needs to be initialized. The dispersion and objective function values of the segmented image need to be calculated. Based on the sorting results, the initial optimal solution position and resolution are obtained. Then, the improved Levy flight mechanism is executed, and the position update and resolution are achieved through random walk preference. After the iteration process is completed, the best threshold obtained can be used for image segmentation. The image segmentation process is implemented using the threshold of the individual’s optimal position. The output image ends the process.

4 Analysis of image segmentation results

The experimental environment for the research is Intel(R) Core (TM) i5-8300H CPU @ 2.30 GHz, with 16GB of memory. The system is Windows 10 (64 bit) Professional Edition, and the software is MATLAB R2016a (64 bit). The study selects different types of images from Pascal VOC dataset, USI-SIPI image database of the University of California, and Biomedical Imaging International Symposium Competition database, including people, landscapes, scenes, and medicine. Each image has a resolution of 512 × 512 pixels and a size of 256*256. The study selects indicators such as fitness value, peak signal-to-noise ratio, and structural similarity index to objectively evaluate the quality of image segmentation. The proposed method is compared with lightweight Attention Convolutional Neural Network (ACNN), Multi-Modal Feature Fusion Algorithm (RGB-D Attention, RGB-DA), Fractional Pigeon-Inspired Optimization (FPIO-Otsu), and spatial information Adaptive Fuzzy Clustering (AFC). Composite and mixed functions from the CEC20217 benchmark dataset are selected for testing. Figure 5 displays the fitness results.

Figure 5 
               Fitness results of different algorithms: (a) composite function and (b) mixed function.
Figure 5

Fitness results of different algorithms: (a) composite function and (b) mixed function.

In Figure 5(a), on the composite function, the proposed algorithm had a slow curve descent rate in finding the optimal value, converging to 1.3 × 104. The fitness curves of other comparison algorithms had different fluctuations, and there was a break in the curve after more than 600 iterations. The optimal values of AFC and ACNN algorithms were less than 0.53 × 104. In Figure 5(b), when the iteration was greater than 200, the curves with good fitness performance were research model > FPIO-Otsu algorithm > RGB-DA algorithm > ACNN algorithm > AFC algorithm. The loss and error during image feature segmentation of the aforementioned algorithms are analyzed. The results are shown in Figure 6.

Figure 6 
               Data loss and error during image segmentation processing: (a) loss situation and (b) error situation.
Figure 6

Data loss and error during image segmentation processing: (a) loss situation and (b) error situation.

In Figure 6(a), the proposed algorithm showed a relatively small loss value on the loss data and remained stable in the later stages of iteration, with a loss value of less than 5 × 102. The number of iterations required for the loss curve of other comparison algorithms to converge was generally greater than 400, with an average loss value greater than 5 × 102. In Figure 6(b), the image feature segmentation error curves of the five algorithms showed a decreasing trend with the increase of iterations. The error value of the proposed algorithm tended to the average value of 0.56 after more than 200 iterations. The FPIO-Otsu algorithm performed second only to the research algorithm, with an error value of less than 0.65. The worst-performing AFC algorithm had an average error value greater than 0.70. The image segmentation accuracy for the aforementioned algorithms is shown in Table 1.

Table 1

Image segmentation accuracy under different algorithm processing

Image type Model Accuracy rate (%) Recall (%) F1
Character image Research algorithm 94.33 96.15 97.02
FPIO-Otsu 92.26 93.22 92.28
RGB-DA 90.15 89.39 89.61
ACNN 85.33 88.42 87.11
AFC 82.25 86.13 83.38
Landscape image Research algorithm 95.02 97.14 96.65
FPIO-Otsu 91.15 90.33 91.38
RGB-DA 87.98 87.22 88.01
ACNN 80.24 83.26 82.21
AFC 87.22 88.13 87.34
Scene image Research algorithm 92.74 90.92 92.11
FPIO-Otsu 90.78 92.82 91.67
RGB-DA 86.92 89.74 90.96
ACNN 82.93 84.87 83.82
AFC 81.16 85.32 84.53
Medical imaging Research algorithm 90.25 90.24 90.09
FPIO-Otsu 84.32 87.16 88.38
RGB-DA 80.35 82.29 81.24
ACNN 74.23 75.23 76.23
AFC 75.23 76.23 77.23

In Table 1, the proposed algorithm demonstrated a feature recognition accuracy of over 90% for all four types of image segmentation, with recall rates of 96.15, 97.14, 90.92, and 90.24%, respectively. The FPIO-Otsu algorithm and RGB-DA algorithm performed better, with an overall accuracy of over 80% for image feature classification and over 85% for segmentation accuracy on character and landscape images, indicating their good processing effect on images with clear features. The AFC algorithm performed the worst in image segmentation accuracy, with segmentation accuracy and recall of 75.23 and 76.23% on medical images, respectively. On character and landscape images, the maximum segmentation accuracy difference among the five algorithms did not exceed 15%, while the difference was obvious on scene and medical images. In comparison algorithms, the single segmentation method is difficult to ensure image details, and the recognition effect is slightly worse than the segmentation algorithms proposed in the research. For example, the FPIO-Otsu algorithm utilizes intelligent algorithm and threshold solving idea for image segmentation. Although it has good segmentation accuracy, its accuracy is less than 90% when processing medical images due to not considering thresholds. The average running time of image segmentation is analyzed, as displayed in Table 2.

Table 2

Average running time (s) of image segmentation using different algorithms

Image type Model Number of thresholds
3 4 5
Character image Research algorithm 1.62044 1.67633 1.45733
FPIO-Otsu 1.70022 1.71933 1.56877
RGB-DA 1.72466 2.53712 1.64712
ACNN 1.70523 2.60322 1.69912
AFC 1.72512 2.62071 1.77491
Landscape image Research algorithm 1.62622 1.70278 1.49922
FPIO-Otsu 1.74122 1.73422 1.55967
RGB-DA 1.72678 2.55011 1.66967
ACNN 1.71933 2.60711 1.73311
AFC 1.80689 2.62622 1.26822
Scene image Research algorithm 1.69825 1.70955 1.89922
FPIO-Otsu 1.73088 1.77333 1.56788
RGB-DA 1.74444 1.89155 1.99211
ACNN 1.77444 1.96411 1.88211
AFC 1.81478 2.05633 1.90055
Medical imaging Research algorithm 1.52922 1.69933 1.25733
FPIO-Otsu 2.68855 2.53712 2.56877
RGB-DA 2.71844 2.67289 2.64712
ACNN 2.82733 2.70088 2.69912
AFC 2.85133 2.78522 2.77491

In Table 2, the average running time of the proposed algorithm for image segmentation was lower than other comparison algorithms under the same conditions. Its running time did not exceed 2 s at different threshold numbers, while other algorithms showed more significant changes in running time, with an average running time of over 2s on medical images. The aforementioned results indicate that the proposed algorithm has good processing efficiency. Subsequently, further analysis is conducted on the image segmentation quality, as displayed in Table 3.

Table 3

Evaluation of image segmentation quality under different algorithms

Algorithm Information entropy Structural similarity Mean contrast Peak signal-to-noise ratio Brightness relationship factor (%) Mutual information
AFC 7.658 0.817 1.547 1.352 0.758 6.028
ACNN 7.691 0.824 1.751 1.161 0.804 7.094
RGB-DA 7.805 0.845 1.069 1.341 0.825 6.436
FPIO-Otsu 7.626 0.877 1.495 1.585 0.901 7.385
Research algorithm 7.962 0.912 1.825 2.194 0.934 9.192

In Table 3, the information entropy, structural similarity, contrast mean, peak signal-to-noise ratio, brightness relationship factor, and mutual information of the proposed segmentation algorithm were 7.962, 0.912, 1.825, 2.194, 0.934, and 9.192, respectively. Its overall image segmentation quality was significantly better than other algorithms. The FPIO-Otsu algorithm performed well next. The structural similarity values of the RGB-DA algorithm, ACNN algorithm, and AFC algorithm were 0.845, 0.824, and 0.817, respectively, indicating poor feature extraction performance of the background and target images during image segmentation. Its brightness relationship factor was also less than 0.85, indicating that there may be some information loss. The proposed algorithm exhibits good segmentation accuracy, and the quality evaluation index values are much higher than other algorithms under the same conditions. Table 4 shows the pixel segmentation results of the image.

Table 4

Pixel segmentation results of different algorithms for image segmentation

Algorithm Pixel accuracy (%) Average pixel accuracy (%) Average IoU (%)
AFC 88.32 75.46 62.38
ACNN 89.15 77.39 64.51
RGB-DA 90.06 82.35 71.35
FPIO-Otsu 91.78 85.13 72.33
Research algorithm 94.33 88.76 79.25

Pixel accuracy can reflect the proportion of correctly classified pixels, with a higher value indicating better segmentation performance. Intersection over union (IoU) can reflect the proportion of the segmented area to the real area, with a higher value indicating better model performance. In Table 4, the pixel accuracy, average pixel accuracy, and average IoU of the research algorithm were 94.33, 88.76, and 79.25%, which exceeded other comparison algorithms. The average pixel accuracy and average IoU of the AFC algorithm and the ACNN algorithm were lower than 80 and 65%, respectively, indicating poor segmentation performance. Subsequently, FPIO-Otsu algorithm and RGB-DA algorithm with good performance are selected for image segmentation instance verification and compared with the segmentation algorithm proposed in the study. The results are displayed in Figure 7.

Figure 7 
               Retinal vascular segmentation results. Original image: (a) research model, (b) FPIO-Otsu, and (c) RGB-DA.
Figure 7

Retinal vascular segmentation results. Original image: (a) research model, (b) FPIO-Otsu, and (c) RGB-DA.

In Figure 7, the proposed algorithm had good segmentation performance. It could capture the details of some blood vessel extensions well. The FPIO-Otsu algorithm utilized the pigeon swarm algorithm and fractional-order thinking to solve problems, but there was oversegmentation in the central part. The continuity in details was insufficient, resulting in slightly lower segmentation accuracy than the research algorithm. The RGB-DA algorithm performed segmentation based on image characteristics, making it difficult to extract vascular edges. Figure 8 shows the segmentation results of indoor and outdoor scene images.

Figure 8 
               Image segmentation results of indoor and outdoor scenes: (a) indoor scene and (b) outdoor scene.
Figure 8

Image segmentation results of indoor and outdoor scenes: (a) indoor scene and (b) outdoor scene.

In Figure 8(a), the proposed model could effectively segment different objects with clear detail processing and distinct feature differentiation. However, the other two algorithms exhibited information loss and target blurring during image segmentation. In Figure 8(b), the other two comparison algorithms had poor segmentation performance for vehicle shape and pedestrian contour, while the proposed model could better reflect the image situation and perform well in segmentation accuracy.

5 Conclusion

To address the shortcomings of traditional swarm intelligence algorithms in image segmentation, a new method combining TC algorithm and swarm intelligence algorithm for image segmentation was designed and tested. From the results, the loss value in image feature segmentation was less than 5 × 102. The error value tended to 0.56 after more than 200 iterations. However, the error curves of other comparison algorithms had fluctuations, with the worst-performing AFC algorithm having an average error value greater than 0.70. The algorithm proposed in the study showed feature recognition accuracy of over 90% in four types of image segmentation: character, landscape, scene, and medical. The FPIO-Otsu algorithm and RGB-DA algorithm performed better, with an overall accuracy of over 80% for image feature classification. The running time of the research algorithm under different threshold numbers did not exceed 2 s, which was much shorter than other comparison algorithms under the same conditions. The information entropy, structural similarity, contrast mean, peak signal-to-noise ratio, brightness relationship factor, and mutual information of the research algorithm reached 7.962, 0.912, 1.825, 2.194, 0.934, and 9.192, respectively. Its overall image segmentation quality was significantly better than other algorithms. The structural similarity values of RGB-DA algorithm, ACNN algorithm, and AFC algorithm were 0.845, 0.824, and 0.817, respectively, indicating poor feature extraction performance of the background and target images during image segmentation. Their brightness relationship factor was also less than 0.85, indicating that there may be some information loss. The pixel accuracy, average pixel accuracy, and average IoU of the research algorithm were 94.33, 88.76, and 79.25%, respectively, which were much higher than other comparison algorithms. Moreover, its segmentation details and edge extraction effects on medical images and scene images were significant. The FPIO-Otsu algorithm and RGB-DA algorithm exhibited information loss and target blurring during image segmentation. The improvement proposed by combining image segmentation with swarm intelligence algorithms can effectively improve segmentation quality and ensure algorithm convergence and solving speed. With the advancement of image processing technology, strengthening the research on segmentation accuracy of multi-target images and dynamic video images is an important focus in the future.

  1. Funding information: The author states no funding involved.

  2. Author contributions: The author has accepted responsibility for the entire content of this manuscript and approved its submission.

  3. Conflict of interest: The author states no conflict of interest.

  4. Data availability statement: All data generated or analyzed during this study are included in this published article.

References

[1] Minaee S, Boykov Y, Porikli F, Plaza A, Kehtarnavaz N, Terzopoulos D. Image segmentation using deep learning: A survey. IEEE Trans Pattern Anal Mach Intell. 2021;44(7):3523–42.10.1109/TPAMI.2021.3059968Suche in Google Scholar PubMed

[2] Yu Y, Wang C, Fu Q, Kou R, Huang F, Yang B, et al. Techniques and challenges of image segmentation: A review. Electronics. 2023;12(5):1199.10.3390/electronics12051199Suche in Google Scholar

[3] Wang R, Lei T, Cui R, Zhang B, Meng H, Nandi AK. Medical image segmentation using deep learning: A survey. IET Image Process. 2022;16(5):1243–67.10.1049/ipr2.12419Suche in Google Scholar

[4] Pal R, Mukhopadhyay S, Chakraborty D, Suganthan PN. Very high-resolution satellite image segmentation using variable-length multi-objective genetic clustering for multi-class change detection. J King Saud Univ Comput Inf Sci. 2022;34(10):9964–76.10.1016/j.jksuci.2021.12.023Suche in Google Scholar

[5] Khrissi L, El Akkad N, Satori H, Satori K. Clustering method and sine cosine algorithm for image segmentation. Evol Intell. 2022;15(1):669–82.10.1007/s12065-020-00544-zSuche in Google Scholar

[6] Chowdhary CL, Mittal M, Kumaresan P, Pattanaik PA, Marszalek Z. An efficient segmentation and classification system in medical images using intuitionist possibilistic fuzzy C-mean clustering and fuzzy SVM algorithm. Sensors. 2020;20(14):3903.10.3390/s20143903Suche in Google Scholar PubMed PubMed Central

[7] Liu Y, Zhang Z, Liu X, Wang L. Efficient image segmentation based on deep learning for mineral image classification. Adv Powder Technol. 2021;32(10):3885–903.10.1016/j.apt.2021.08.038Suche in Google Scholar

[8] Zhang X, Bian H, Cai Y, Zhang K, Li H. An improved tongue image segmentation algorithm based on Deeplabv3 + framework. IET Image Process. 2022;16(5):1473–85.10.1049/ipr2.12425Suche in Google Scholar

[9] Vadivel T, Suguna R. Retracted Article: Automatic recognition of tomato leaf disease using fast enhanced learning with image processing. Acta Agric Scand B Soil Plant Sci. 2022;72(1):312–24.10.1080/09064710.2021.1976266Suche in Google Scholar

[10] Stringer C, Wang T, Michaelos M, Pachitariu M. Cellpose: A generalist algorithm for cellular segmentation. Nat Methods. 2021;18(1):100–6.10.1038/s41592-020-01018-xSuche in Google Scholar PubMed

[11] Sharma A, Chaturvedi R, Bhargava A. A novel opposition based improved firefly algorithm for multilevel image segmentation. Multimed Tools Appl. 2022;81(11):15521–44.10.1007/s11042-022-12303-6Suche in Google Scholar

[12] Resma KB, Nair MS. Multilevel thresholding for image segmentation using Krill Herd Optimization algorithm. J King Saud Univ Comput Inf Sci. 2021;33(5):528–41.10.1016/j.jksuci.2018.04.007Suche in Google Scholar

[13] Dhal KG, Das A, Ray S, Gálvez J, Das S. Nature-inspired optimization algorithms and their application in multi-thresholding image segmentation. Arch Comput Methods Eng. 2020;27(3):855–88.10.1007/s11831-019-09334-ySuche in Google Scholar

[14] Wang M, Wang W, Feng S, Li L. Adaptive multi-class segmentation model of aggregate image based on improved sparrow search algorithm. KSII Trans Internet Inf Syst. 2023;17(2):391–411.10.3837/tiis.2023.02.006Suche in Google Scholar

[15] Nyo MT, Mebarek-Oudina F, Hlaing SS, Khan NA. Otsu’s thresholding technique for MRI image brain tumor segmentation. Multimed Tools Appl. 2022;81(30):43837–49.10.1007/s11042-022-13215-1Suche in Google Scholar

[16] Rawas S, El-Zaart A. Precise and parallel segmentation model (PPSM) via MCET using hybrid distributions. Appl Comput Inf. 2024;20(3/4):262–78.10.1108/ACI-11-2020-0123Suche in Google Scholar

[17] Mokayed H, Quan TZ, Alkhaled L, Sivakumar V. Real-time human detection and counting system using deep learning computer vision techniques. Artif Intell Appl. 2023;1(4):221–9.10.47852/bonviewAIA2202391Suche in Google Scholar

[18] Huang G, Qin H, Chen Q, Shi Z, Jiang S, Huang C. Research on application of fractional calculus operator in image underlying processing. Fractals. 2024;8(1):37.10.3390/fractalfract8010037Suche in Google Scholar

[19] Arora S, Mathur T, Agarwal S, Tiwari K, Gupta P. Applications of fractional calculus in computer vision: A survey. Neurocomputing. 2022;489:407–28.10.1016/j.neucom.2021.10.122Suche in Google Scholar

[20] Wu J, Wu C, Lin Y, Yoshinaga T, Zhong L, Chen X, et al. Semantic segmentation-based semantic communication system for image transmission. Digit Commun Netw. 2024;10(3):519–27.10.1016/j.dcan.2023.02.006Suche in Google Scholar

Received: 2024-12-13
Revised: 2025-03-31
Accepted: 2025-05-12
Published Online: 2025-08-30

© 2025 the author(s), published by De Gruyter

This work is licensed under the Creative Commons Attribution 4.0 International License.

Artikel in diesem Heft

  1. Research Articles
  2. Generalized (ψ,φ)-contraction to investigate Volterra integral inclusions and fractal fractional PDEs in super-metric space with numerical experiments
  3. Solitons in ultrasound imaging: Exploring applications and enhancements via the Westervelt equation
  4. Stochastic improved Simpson for solving nonlinear fractional-order systems using product integration rules
  5. Exploring dynamical features like bifurcation assessment, sensitivity visualization, and solitary wave solutions of the integrable Akbota equation
  6. Research on surface defect detection method and optimization of paper-plastic composite bag based on improved combined segmentation algorithm
  7. Impact the sulphur content in Iraqi crude oil on the mechanical properties and corrosion behaviour of carbon steel in various types of API 5L pipelines and ASTM 106 grade B
  8. Unravelling quiescent optical solitons: An exploration of the complex Ginzburg–Landau equation with nonlinear chromatic dispersion and self-phase modulation
  9. Perturbation-iteration approach for fractional-order logistic differential equations
  10. Variational formulations for the Euler and Navier–Stokes systems in fluid mechanics and related models
  11. Rotor response to unbalanced load and system performance considering variable bearing profile
  12. DeepFowl: Disease prediction from chicken excreta images using deep learning
  13. Channel flow of Ellis fluid due to cilia motion
  14. A case study of fractional-order varicella virus model to nonlinear dynamics strategy for control and prevalence
  15. Multi-point estimation weldment recognition and estimation of pose with data-driven robotics design
  16. Analysis of Hall current and nonuniform heating effects on magneto-convection between vertically aligned plates under the influence of electric and magnetic fields
  17. A comparative study on residual power series method and differential transform method through the time-fractional telegraph equation
  18. Insights from the nonlinear Schrödinger–Hirota equation with chromatic dispersion: Dynamics in fiber–optic communication
  19. Mathematical analysis of Jeffrey ferrofluid on stretching surface with the Darcy–Forchheimer model
  20. Exploring the interaction between lump, stripe and double-stripe, and periodic wave solutions of the Konopelchenko–Dubrovsky–Kaup–Kupershmidt system
  21. Computational investigation of tuberculosis and HIV/AIDS co-infection in fuzzy environment
  22. Signature verification by geometry and image processing
  23. Theoretical and numerical approach for quantifying sensitivity to system parameters of nonlinear systems
  24. Chaotic behaviors, stability, and solitary wave propagations of M-fractional LWE equation in magneto-electro-elastic circular rod
  25. Dynamic analysis and optimization of syphilis spread: Simulations, integrating treatment and public health interventions
  26. Visco-thermoelastic rectangular plate under uniform loading: A study of deflection
  27. Threshold dynamics and optimal control of an epidemiological smoking model
  28. Numerical computational model for an unsteady hybrid nanofluid flow in a porous medium past an MHD rotating sheet
  29. Regression prediction model of fabric brightness based on light and shadow reconstruction of layered images
  30. Dynamics and prevention of gemini virus infection in red chili crops studied with generalized fractional operator: Analysis and modeling
  31. Qualitative analysis on existence and stability of nonlinear fractional dynamic equations on time scales
  32. Fractional-order super-twisting sliding mode active disturbance rejection control for electro-hydraulic position servo systems
  33. Analytical exploration and parametric insights into optical solitons in magneto-optic waveguides: Advances in nonlinear dynamics for applied sciences
  34. Bifurcation dynamics and optical soliton structures in the nonlinear Schrödinger–Bopp–Podolsky system
  35. Review Article
  36. Haar wavelet collocation method for existence and numerical solutions of fourth-order integro-differential equations with bounded coefficients
  37. Special Issue: Nonlinear Analysis and Design of Communication Networks for IoT Applications - Part II
  38. Silicon-based all-optical wavelength converter for on-chip optical interconnection
  39. Research on a path-tracking control system of unmanned rollers based on an optimization algorithm and real-time feedback
  40. Analysis of the sports action recognition model based on the LSTM recurrent neural network
  41. Industrial robot trajectory error compensation based on enhanced transfer convolutional neural networks
  42. Research on IoT network performance prediction model of power grid warehouse based on nonlinear GA-BP neural network
  43. Interactive recommendation of social network communication between cities based on GNN and user preferences
  44. Application of improved P-BEM in time varying channel prediction in 5G high-speed mobile communication system
  45. Construction of a BIM smart building collaborative design model combining the Internet of Things
  46. Optimizing malicious website prediction: An advanced XGBoost-based machine learning model
  47. Economic operation analysis of the power grid combining communication network and distributed optimization algorithm
  48. Sports video temporal action detection technology based on an improved MSST algorithm
  49. Internet of things data security and privacy protection based on improved federated learning
  50. Enterprise power emission reduction technology based on the LSTM–SVM model
  51. Construction of multi-style face models based on artistic image generation algorithms
  52. Research and application of interactive digital twin monitoring system for photovoltaic power station based on global perception
  53. Special Issue: Decision and Control in Nonlinear Systems - Part II
  54. Animation video frame prediction based on ConvGRU fine-grained synthesis flow
  55. Application of GGNN inference propagation model for martial art intensity evaluation
  56. Benefit evaluation of building energy-saving renovation projects based on BWM weighting method
  57. Deep neural network application in real-time economic dispatch and frequency control of microgrids
  58. Real-time force/position control of soft growing robots: A data-driven model predictive approach
  59. Mechanical product design and manufacturing system based on CNN and server optimization algorithm
  60. Application of finite element analysis in the formal analysis of ancient architectural plaque section
  61. Research on territorial spatial planning based on data mining and geographic information visualization
  62. Fault diagnosis of agricultural sprinkler irrigation machinery equipment based on machine vision
  63. Closure technology of large span steel truss arch bridge with temporarily fixed edge supports
  64. Intelligent accounting question-answering robot based on a large language model and knowledge graph
  65. Analysis of manufacturing and retailer blockchain decision based on resource recyclability
  66. Flexible manufacturing workshop mechanical processing and product scheduling algorithm based on MES
  67. Exploration of indoor environment perception and design model based on virtual reality technology
  68. Tennis automatic ball-picking robot based on image object detection and positioning technology
  69. A new CNN deep learning model for computer-intelligent color matching
  70. Design of AR-based general computer technology experiment demonstration platform
  71. Indoor environment monitoring method based on the fusion of audio recognition and video patrol features
  72. Health condition prediction method of the computer numerical control machine tool parts by ensembling digital twins and improved LSTM networks
  73. Establishment of a green degree evaluation model for wall materials based on lifecycle
  74. Quantitative evaluation of college music teaching pronunciation based on nonlinear feature extraction
  75. Multi-index nonlinear robust virtual synchronous generator control method for microgrid inverters
  76. Manufacturing engineering production line scheduling management technology integrating availability constraints and heuristic rules
  77. Analysis of digital intelligent financial audit system based on improved BiLSTM neural network
  78. Attention community discovery model applied to complex network information analysis
  79. A neural collaborative filtering recommendation algorithm based on attention mechanism and contrastive learning
  80. Rehabilitation training method for motor dysfunction based on video stream matching
  81. Research on façade design for cold-region buildings based on artificial neural networks and parametric modeling techniques
  82. Intelligent implementation of muscle strain identification algorithm in Mi health exercise induced waist muscle strain
  83. Optimization design of urban rainwater and flood drainage system based on SWMM
  84. Improved GA for construction progress and cost management in construction projects
  85. Evaluation and prediction of SVM parameters in engineering cost based on random forest hybrid optimization
  86. Museum intelligent warning system based on wireless data module
  87. Optimization design and research of mechatronics based on torque motor control algorithm
  88. Special Issue: Nonlinear Engineering’s significance in Materials Science
  89. Experimental research on the degradation of chemical industrial wastewater by combined hydrodynamic cavitation based on nonlinear dynamic model
  90. Study on low-cycle fatigue life of nickel-based superalloy GH4586 at various temperatures
  91. Some results of solutions to neutral stochastic functional operator-differential equations
  92. Ultrasonic cavitation did not occur in high-pressure CO2 liquid
  93. Research on the performance of a novel type of cemented filler material for coal mine opening and filling
  94. Testing of recycled fine aggregate concrete’s mechanical properties using recycled fine aggregate concrete and research on technology for highway construction
  95. A modified fuzzy TOPSIS approach for the condition assessment of existing bridges
  96. Nonlinear structural and vibration analysis of straddle monorail pantograph under random excitations
  97. Achieving high efficiency and stability in blue OLEDs: Role of wide-gap hosts and emitter interactions
  98. Construction of teaching quality evaluation model of online dance teaching course based on improved PSO-BPNN
  99. Enhanced electrical conductivity and electromagnetic shielding properties of multi-component polymer/graphite nanocomposites prepared by solid-state shear milling
  100. Optimization of thermal characteristics of buried composite phase-change energy storage walls based on nonlinear engineering methods
  101. A higher-performance big data-based movie recommendation system
  102. Nonlinear impact of minimum wage on labor employment in China
  103. Nonlinear comprehensive evaluation method based on information entropy and discrimination optimization
  104. Application of numerical calculation methods in stability analysis of pile foundation under complex foundation conditions
  105. Research on the contribution of shale gas development and utilization in Sichuan Province to carbon peak based on the PSA process
  106. Characteristics of tight oil reservoirs and their impact on seepage flow from a nonlinear engineering perspective
  107. Nonlinear deformation decomposition and mode identification of plane structures via orthogonal theory
  108. Numerical simulation of damage mechanism in rock with cracks impacted by self-excited pulsed jet based on SPH-FEM coupling method: The perspective of nonlinear engineering and materials science
  109. Cross-scale modeling and collaborative optimization of ethanol-catalyzed coupling to produce C4 olefins: Nonlinear modeling and collaborative optimization strategies
  110. Unequal width T-node stress concentration factor analysis of stiffened rectangular steel pipe concrete
  111. Special Issue: Advances in Nonlinear Dynamics and Control
  112. Development of a cognitive blood glucose–insulin control strategy design for a nonlinear diabetic patient model
  113. Big data-based optimized model of building design in the context of rural revitalization
  114. Multi-UAV assisted air-to-ground data collection for ground sensors with unknown positions
  115. Design of urban and rural elderly care public areas integrating person-environment fit theory
  116. Application of lossless signal transmission technology in piano timbre recognition
  117. Application of improved GA in optimizing rural tourism routes
  118. Architectural animation generation system based on AL-GAN algorithm
  119. Advanced sentiment analysis in online shopping: Implementing LSTM models analyzing E-commerce user sentiments
  120. Intelligent recommendation algorithm for piano tracks based on the CNN model
  121. Visualization of large-scale user association feature data based on a nonlinear dimensionality reduction method
  122. Low-carbon economic optimization of microgrid clusters based on an energy interaction operation strategy
  123. Optimization effect of video data extraction and search based on Faster-RCNN hybrid model on intelligent information systems
  124. Construction of image segmentation system combining TC and swarm intelligence algorithm
  125. Particle swarm optimization and fuzzy C-means clustering algorithm for the adhesive layer defect detection
  126. Optimization of student learning status by instructional intervention decision-making techniques incorporating reinforcement learning
  127. Fuzzy model-based stabilization control and state estimation of nonlinear systems
  128. Optimization of distribution network scheduling based on BA and photovoltaic uncertainty
  129. Tai Chi movement segmentation and recognition on the grounds of multi-sensor data fusion and the DBSCAN algorithm
  130. Special Issue: Dynamic Engineering and Control Methods for the Nonlinear Systems - Part III
  131. Generalized numerical RKM method for solving sixth-order fractional partial differential equations
Heruntergeladen am 14.11.2025 von https://www.degruyterbrill.com/document/doi/10.1515/nleng-2025-0152/html
Button zum nach oben scrollen