Advancements in machine learning applications for mineral prospecting and geophysical inversion: A review

Ke Liu; Xinhai Dun; Wen Yang; Yan Zeng; Yihang Guo

doi:10.1515/geo-2025-0765

40% Rabatt

auf Fachbücher bei De Gruyter Brill *

Artikel Open Access

Advancements in machine learning applications for mineral prospecting and geophysical inversion: A review

Ke Liu , Xinhai Dun , Wen Yang , Yan Zeng und Yihang Guo

Veröffentlicht/Copyright: 15. November 2025

Veröffentlicht von

Veröffentlichen auch Sie bei De Gruyter Brill

Manuskript einreichen Informationen für Autor*innen

Aus der Zeitschrift Open Geosciences Band 17 Heft 1

Abstract

Mineral deposits are a globally important resource. However, the supply of shallow minerals is close to depletion, forcing exploration activities to expand to deeper areas. Deep exploration has greater challenges compared to shallow exploration, and how to effectively extract the intrinsic connection between exploration data and deep concealment, and how to quickly and accurately locate target zones, remain urgent challenges to be solved. Mineral prediction and geophysical inversion are the core links in mineral exploration, and how to make up for the shortcomings of traditional methods in these links has become an important topic of current research. In the past decade, with the wide application of big data technology in the field of geological prospecting, more and more geological data have provided support for the application of machine learning (ML) in geophysical exploration and mineral prediction. ML overcomes the limitations of traditional methods to a certain extent, such as reducing human subjectivity and improving the ability to mine the laws among geological data, showing great potential. This study summarizes the progress of the application of ML, especially deep learning, in the field of mineral exploration in recent years, focuses on the two key aspects of geophysical inversion and mineral prediction, analyzes the advantages and limitations of the various methods, and makes concluding comments on the future direction of development, with the aim of providing valuable references for the on-site application of ML in mineral exploration and the direction of future research.

Keywords: deep learning; machine learning; mineral exploration; geophysical inversion

1 Introduction

Mineral resources play a key role in economic and industrial development trends worldwide. The reserves of Earth are rich in mineral resources, but the majority of these resources have not been effectively exploited, which is an important problem to be solved in the field of geological exploration. The use of appropriate and reliable exploration technology is crucial for exploring mineral resources, as this strategy can not only improve the efficiency of mineral resources discovery but also play a key supporting role in ensuring the stable supply of national resources. With the continuous development of mineral resource applications, surface mineral resources are being increasingly depleted, and exploration activities are gradually shifting to more complex and difficult-to-reach deep areas. Because of the deep burial depths of these mineral resources, various interference factors are encountered, and the difficulty of obtaining mineral information increases accordingly. In addition, previous exploration work was carried out in shallow areas, and many methods and technologies are not suitable for deep exploration [1,2,3].

In the context of the rapid development of big data, computer-based data processing technology has provided a solid foundation for the wide application of artificial intelligence and deep learning (DL). These technologies have exhibited great potential in many fields because of their unique advantages in terms of automatic feature extraction and complex nonlinear problem processing. Especially in the field of Earth science, the applications of DL technology are receiving increasing attention. Mineral exploration, as part of Earth science, has also gradually been applied to DL. In recent years, many scholars have explored the combination of DL and mineral exploration and applied it to mineral prospect mapping (MPM) and mineral inversion. Compared with traditional prospecting methods, DL can be directly driven by data to find complex nonlinear relationships in mineralization scenarios, so it has specific advantages for addressing complex nonlinear metallogenic mechanisms [4]. For example, Liu et al. used convolutional neural networks (CNNs) to find lead-zinc deposits in ore deposit prediction tasks and used network algorithms to consistently find the distributions of surface elements and the spatial locations of underground ore bodies [5,6,7,8]. Li et al. [9] conducted gravity anomaly inversion via the UNet++ network architecture. Compared with traditional methods, which require large amounts of memory, DL does not need to rely on prior information, which greatly improves its efficiency. Subsequently, improved models based on CNNs and other neural networks were applied to mineral exploration. These results demonstrate the marked effectiveness of DL techniques in terms of simulating complex geological processes, identifying geological anomalies, and revealing hidden patterns. These advances not only promote technological innovation in the field of Earth sciences but also provide strong technical support for improving the accuracy and efficiency of mineral exploration.

Figure 1 illustrates a comparison of the main processes of machine learning (ML) and traditional methods in mineral exploration. Traditional methods rely on the collection and comprehensive analysis of observational data from multiple sources (e.g., satellite data, surface surveys, remotely sensed data, and base station measurements), and summarize the laws and physical formulas of mineralization through scientific reasoning on these data. These laws and formulas reflect the cause-and-effect relationship of the mineralization process and can be used to deduce and predict similar mineralization processes. In contrast, while ML also requires observations from multiple sources, its derivation process no longer relies on explicit theoretical principles or formulas, but instead constructs predictive models through regression and clustering analyses of large datasets. This data-driven approach has a certain advantage in predictive ability, but its internal mechanism often presents a “black box” characteristic. In contrast, ML methods are data-driven while traditional methods are knowledge-driven.

Figure 1

ML and traditional method process.

At present, the applications of DL with respect to exploration mainly include MPM and geophysical data inversion. This article delves into the specific applications of DL in these two areas and summarizes the major advances achieved in this field in recent years. Through the review and analysis of various DL methods, the advantages and disadvantages of these technologies and their potential application value can be understood. The purpose of this article is to provide readers with a comprehensive perspective, focusing on the hot field of DL-based exploration. This not only helps in determining the current research status but also provides guidance and inspiration for future research. Through a comprehensive analysis, this study provides a reference for researchers and practitioners in the related fields to promote the further development and application of this emerging technology in Earth science.

Finally, the structure of this study is as follows: Section 1: Introduction, Section 2: Applications of DL in MPM, Section 3: Applications of DL in geophysical inversion, and Section 4: Summary of the development status of DL in mineral prediction and future development directions.

2 DL in MPM

MPM is a technique for visualizing the potential of mineral resources by synthesizing geological, geophysical, and geochemical data. It delineates and prioritizes regions for mineral exploration, providing a foundation for subsequent detailed investigations of prospective mineral zones. MPM approaches are typically categorized into two types: knowledge-driven and data-driven techniques. Knowledge-driven methods rely on the expertise of specialists and are generally suited for exploring smaller areas. In contrast, data-driven methods are adept at mining beneath substantial overburden, leveraging sophisticated mathematical models to uncover intricate and concealed data relationships. The advent of big data has facilitated the integration of DL into MPM, and form a data-driven technique. DL excels at extracting relationships between mineral deposits and exploration data. It outperforms traditional statistical learning and ML methods, such as the weight of evidence, support vector machine, and random forest, by handling larger datasets, discerning more complex features, and identifying high-dimensional relationships within complex spatial configurations [10].

2.1 Basic applications of DL

The fundamental concept of MPM involves integrating diverse geographic datasets, including geochemical, geophysical, geological, and remote sensing data. These datasets are readily converted into image formats, offering a convenient method for consolidating various mineralized elements with disparate structures. CNNs, which form a class of DL algorithms, are extensively utilized in image and video analyses for tasks such as feature extraction, object detection, and image classification. CNNs excel at extracting features while preserving spatial relationships, which is beneficial for uncovering the spatial correlations among multiple ore-controlling factors in mineral prediction scenarios. For example, Li et al. [11] used a CNN in combination with the chemical data and aeromagnetic data of 25 elements to make intelligent prospecting predictions for copper deposits in the Xixiaokouzi area of Gansu Province, and they identified five prediction areas. Gao et al. [12] used a ResNet CNN to test data obtained from the west of Guangdong Province and built a direct channel using ResNet to closely connect the input data with the output results. This design can significantly reduce the loss of geological information caused by traditional convolutional operations and, to some extent, alleviate the common gradient disappearance or gradient explosion problems encountered by DL networks. According to previous research results, CNNs can attain higher prediction reliability than other methods.

It should be noted that most of the modeling work in previous studies has remained at the 2D model stage due to the constraints of computational resources. However, 2D models have certain inherent limitations compared to 3D models, especially in the accurate positioning of ore bodies. In order to solve this problem, the field of mineral exploration has gradually transitioned from 2D modeling to 3D modeling. With the rapid development of DL technology, it provides new methods and ideas for 3D modeling, which further promotes the progress of mineral exploration technology.

3D MPM more accurately represents the spatial positioning and morphology characteristics of geological formations, including their depths, heights, and planar distributions, thus enhancing the understanding of the 3D structures and occurrence trends of mineral deposits. For example, Li et al. [13] used the geochemical data of 12 elements and related geological data to conduct 3D modeling and then used a 3D CNN to learn the spatial position relationships among the data for 3D deposit prediction tasks. The results revealed the high performance of their approach, and they delineated two prospecting targets in combination with the metallogenic law. Notably, 3D geological modeling is an important part of 3D MPM, and the accuracy and reliability of 3D geological modeling may be affected by the inaccuracy, uneven distributions and diversity of the Earth science datasets used for modeling. As a result, Li et al. [14] improved the modeling scheme based on a 3D CNN. First, a "full lithology inversion" method was applied to verify and modify the 3D geological model, and then 3D MPM prediction was performed by integrating multisource data such as chemical, geological, and geophysical data for 3D CNN model training. Zhang et al. [15] designed a lightweight 3D CNN based on the GoogLeNet structure and combined it with an end-to-end scheme, which replaced the fully connected layer in the traditional CNN with a deconvolution layer. This improvement not only reduced the number of required parameters to avoid many calculations but also sped up the convergence process of the model. Although the applications of 3D CNNs in MPM have been widely recognized, this type of optimized lightweight network has uptime and resource consumption advantages, further increasing the efficiency of 3D CNNs in MPM applications. Table 1 summarizes the neural networks that have been commonly used in recent years, as well as their characteristics.

Table 1

Networks and their characteristics

Year	Author	Neural network name	Network characteristics
2021	Yang et al.	GoogLeNet	CNN uses the Inception architecture to extract features from different convolution kernels
2021	Xu et al.	Deep regression neural network	Training samples with continuous values help improve the fault tolerance of training datasets and reduce the uncertainty of positive samples
2022	Ding et al.	Siamese network	The new category recognition capabilities of twin networks are used to alleviate the problem of insufficient training data
2022	Gao et al.	ResNet	CNN, which establishes a direct channel and combines the input and output, alleviates the problems of disappearing or exploding gradients in convolutional operations and deep networks
2022	Zou et al.	Variational autoencoder (VAE)	A geologically constrained VAE can enhance the probabilities of areas with high mineralization potential and improve the interpretability of the obtained results
2023	Li et al.	Attention-based convolutional neural network	This is a network based on a CNN with the addition of a channel attention mechanism
2023	Zhang et al.	Lightweight CNN	This is a lightweight CNN based on the Inception structure of GoogLeNet
2023	Xie et al.	AEGAN	This is a network that combines the advantages of an autoencoder (AE) and a generative adversarial network (GAN)
2023	Zou et al.	Graph convolutional network	The use of a graph structure to represent data can effectively capture complex and nonlinear spatial relationships
2024	Li et al.	CNN-transformers	Through multinetwork integration, a CNN is combined with a transformer to enhance the ability to effectively capture global features
2024	Liu et al.	Convolutional block attention module (CBAM)	This is a CNN that incorporates channel and spatial attention mechanisms

2.2 Problems and strategies

DL technology has led to impressive outcomes in MPM research and applications. However, several challenges are encountered in practical geological exploration scenarios. The accuracy demanded by geological exploration tasks means that the quality of the utilized data and prediction method are pivotal to the success of mineral resource forecasting. A primary challenge faced when applying DL to mineral exploration is the limited availability of data samples. The high dimensionality and complexity of geological data, coupled with the high costs associated with data collection, restrict the sizes of training datasets. This limitation impacts the training efficacy of the developed model and its ability to generalize. Given the intimate link between data volumes and generalizability, it is reasonable to infer that data scarcity can significantly impair the generalization ability of a model. Strategies for solving the problems related to limited data and poor generalizability are discussed below.

2.2.1 Data scarcity problem

Data scarcity poses a significant challenge to DL, as the training process of a neural network necessitates a sufficiently large dataset to iteratively refine the model parameters. Insufficient data can lead to issues such as overfitting and significant prediction inaccuracies. Given that mineralization events are inherently rare, obtaining a substantial number of ore deposit samples for analysis purposes is often challenging. Consequently, optimizing the performance of a model with a limited dataset is a critical issue in the field.

2.2.1.1 Data enhancement strategies

Data augmentation is a widely used technique in DL that enhances the diversity of data through methods such as image rotation, scaling, and cropping. Combining different DL networks with data augmentation techniques, especially in the case of data scarcity, has yielded good research results [16]. However, geological exploration data possess directional and spatial relationships. For example, rotation operations may change the direction of a fault and lead to absurd geological phenomena and prediction results. This outcome does not comply with the core principle of data enhancement – does not change the semantics of the labels. Some researchers have generated sufficient samples by randomly discarding data enhancement methods on the basis of CNNs, and some have increased the amount of available data by conducting clipping to ensure that geological spatial information is not destroyed. The research results show that these methods have certain effects. Similar data enhancement methods include sliding windows and injected noise, which make small spatial adjustments to the original image data, but an increase in the number of samples may lead to data redundancy and thus overfitting.

Conditional GANs (CGANs) are powerful generative models that can create additional training samples when data are limited, thereby enhancing the generalizability of the constructed model. In geological exploration scenarios, CGANs can learn from various datasets and generate feature samples without distorting geological characteristics or altering their spatial significance. Wu et al. [17] used the idea of a generative model and applied a CGAN for data enhancement to solve the small sample size problem, and they compared the sliding window technique with their new approach; the results showed that the data enhancement strategy based on the CGAN had stronger anti-overfitting capabilities. AE technology has been widely used in the field of geochemistry. The basic idea of an AE is to encode and reconstruct the sample data during the training process and use the induced reconstruction error to optimize the model parameters in reverse. However, owing to the low contribution rates of samples with low probabilities in AE networks, these samples are easily recognized as abnormal samples by trained models. The features learned by an AE may only be the identity representation of the original input, and the essential features of the sample cannot be extracted. Compared with AEs, GANs have a greater ability to generate more realistic and high-quality data. Xie et al. [18] proposed an AEGAN model, which embeds an AE network into an adversarial network structure, and built a stable and high-quality generation model by combining the characteristics of the AE and GAN. This approach compensates for the shortcomings of AEs and GANs.

2.2.1.2 Transfer learning strategies

In addition to data augmentation, transfer learning is another approach for solving problems with few samples. The basic idea of transfer learning is to utilize an existing model (usually one that has been trained on a large-scale dataset) to solve a new problem. This approach accelerates the learning process and improves the performance of the constructed model on the new task, especially if the amount of data available for the new task is small, and it can improve the performance of the model in scenarios with limited resources. Huang et al. [19] proposed a DL framework for selective knowledge transfer that captures features at different scales via inflated convolution and selectively transfers the weight parameters of the source network, which helps improve the performance of the developed model without adding additional computational costs. A masking technique was also introduced to maintain the weights between the associated mineral elements and the main elements. And Li et al. [20] proposed a 3D mineral prediction method based on transfer learning. Compared with traditional methods that can only predict the study area when information is abundant, transfer learning can still predict deep mineral prospects in cases with incomplete data. However, transfer learning usually requires experiments to select relevant and differentiated source domains and specific algorithms that contribute to the training process.

2.2.1.3 Self-supervised and semi-supervised learning strategies

Currently, MPM methods primarily employ supervised learning to train networks. However, a contradiction is present between the scarcity of mineral exploration data and the requirement of a substantial number of high-quality samples for supervised learning. Additionally, the collection and labeling of these samples demand significant human resources. Miao et al. [21] utilized self-supervised comparative learning to complete prediction tasks. Self-supervised learning is a ML approach that does not rely on externally labeled data but instead generates training signals from the structure and properties of the input data themselves. This mineral prediction method initially leverages many unlabeled exploration samples for comparative learning, enabling the extraction and understanding of data features. The learned features are then transferred to a downstream supervised learning network. Since the relationships and essential features have already been identified, the downstream network can make minor adjustments with a limited number of samples. By integrating self-supervised comparative learning, the sample size requirements of the network can be significantly reduced.

In addition, the Siamese network is a commonly used DL framework for similarity metrics and comparison tasks and is often employed for image recognition and text similarity analyses. The network structure is characterized by two identical subnetworks that share a common set of parameters and are able to simultaneously process two input samples and evaluate the similarity between the samples by comparing their feature representations. In this way, the similarities and differences between the samples can be learned to distinguish the similarities and differences between unseen objects. Ding et al. [22] used the Siamese network architecture for mine prediction; because this network can distinguish between different data, it can alleviate the insufficient sample problem encountered in mineral exploration scenarios to a certain extent. The experimental results showed that the developed model achieved better performance than a CNN model alone in finding mines.

2.2.2 Generalizability problem

In geological prediction tasks, the complexity and variability of actual geological conditions often hinder the acquired training sample set from adequately encompassing all potential features. This mismatch can introduce significant bias to the prediction results of the utilized model; this phenomenon is commonly referred to as poor generalization, where models exhibit suboptimal performance on new, previously unseen data.

2.2.2.1 Attention mechanism-based strategies

The use of attention mechanisms in DL allows models to concentrate on the salient aspects of the input data, thereby enhancing the ability to extract pivotal feature channels from intricate datasets. Gao et al. [23] proposed the multiscale feature attention framework DL model, which uses convolution kernels with different sizes to generate more labeled sample data and uses channel and spatial attention mechanisms to assign different weights to geological image feature data. Xu et al. [24] proposed the Geo-DCNN, which is based on a deep CNN, and added a pyramid feature extraction structure and a multiscale classification structure to this model, as well as an attention mechanism for identifying the spatial distribution and channel weights; these adjustments improved the feature extraction ability and recognition efficiency of the model. Liu et al. [25] fused a 3D CNN with a CBAM to generate a 3D CNN-based MPM model with an attention mechanism. This model integrates the integrated attention mechanism of the CBAM into CNNs to enhance the overall performance of the network. The verification results showed that the CNN + CBAM model could focus on the target object and capture high-dimensional ore control characteristics. The model was more accurate in terms of predicting deep drilling data, and the exploration results were highly consistent with the real values or spatial distribution characteristics of known voxels.

The attention mechanism, as a specially designed network structure that mimics human visual perception, enables the model to automatically focus on key features in the input while ignoring irrelevant details. By integrating the spatial attention module and the channel attention module, the mechanism significantly improves the model's ability to identify and emphasize key geological features in mineralization prediction. In the application of multiphysics modeling, the attention model realizes the efficient integration of multi-source data by dynamically capturing the dependencies and spatial heterogeneity of input data. This approach effectively compensates for the shortcomings of the traditional MPM approach, which tends to ignore the correlation between evidence layers when fusing different evidence layers to generate mineral potential maps, resulting in the loss of useful information. The attention mechanism not only enhances the accuracy of prediction, but also provides a more robust and efficient solution for mineralization prediction [26].

2.2.2.2 Multinetwork combinations and multiscale strategies

In the field of DL, CNNs are popular because of their excellent performance in image recognition and processing tasks. However, the traditional CNN architectures have some limitations when addressing certain problems. Specifically, they typically extract features at a single scale, which restricts the capacity of the constructed model to capture multiscale information. This is a critical issue in mineralization studies, where mechanisms must be understood in the context of broader geological patterns. Yang et al. [27] improved upon GoogLeNet by introducing four sets of convolutional kernels to simultaneously extract and integrate features at different scales, thereby significantly improving the accuracy of their model. This approach allows the network to capture information from multiple scales in a single iteration, effectively enriching its feature representations.

The integration of neural networks is also a strategy for enhancing the generalizability of models [28]. Li et al. [29] proposed a novel network architecture that combines a transformer and a CNN: a CNN-transformer model. The transformer architecture is well-known for its ability to address long-range dependency problems, and it especially exhibits significant advantages in global feature extraction cases. However, it is relatively weak at capturing local details. In contrast, CNNs are strong local extraction methods but possess weak global capture capabilities. By combining the strengths of CNNs and transformers to form a CNN-transformer network, the resulting network is able to capture both local and global key information, thus achieving a more comprehensive feature understanding in image processing and recognition tasks. This innovative network design not only compensates for the shortcomings of single models but also opens new possibilities for the application of DL in complex data processing tasks involving geological and mineral exploration.

2.2.3 Wrap-up and discussion

The above describes the difficulties in MPM and the strategies for their solution, and analyzes the characteristics of various neural networks. For the problem of insufficient data sample size, commonly used methods include data augmentation, transfer learning, and self-supervised and semi-supervised learning, and these techniques have improved the model performance to a certain extent. However, for the training of DL models, the quantity and quality of data are still key factors that need to be further improved. On the issue of data, a central challenge is how to transform geologic data into a format suitable for processing by DL models. For electromagnetic data, gravity data, and other data presented in the form of numbers and images, existing technologies have made significant progress. However, in the field of geological exploration, a large amount of data still exist in tabular or linguistic form. These data are usually recorded and organized according to traditional exploration needs, and how to effectively transform them into training data that can be directly utilized by DL models is a direction worthy of in-depth research. The breakthrough in this research direction can not only enrich the dataset in the field of geological and mineral exploration, but also further promote the application and development of DL technology in this field. For the problem of generalization ability, the model performance can be improved by combining different types of networks to integrate their respective advantages. This process requires continuous experimentation and optimization. In addition, the increase in the amount of data will further boost the generalization ability of the model, laying the foundation for the application of the model in a wider range of scenarios [30,31,32,33].

3 DL methods for inversion

Inversion is an important concept in scientific research and engineering technology. It is widely used to recover or reconstruct original physical parameters or models from indirect measurements by inverting known observations to deduce the causes of the obtained results. In the field of mineral exploration, inversion is an important method for inversely determining underground structures and mineralization distributions based on geophysical observation data and analyzing the properties of the underground medium that causes these changes, such as its resistivity, magnetism, and elasticity, through surface or underground geophysical data. These physical properties are often closely related to the distributions of mineral resources [34,35,36,37,38].

Conventional inversion methods, such as OCCAM inversion and sharp boundary inversion (SBI) [39], provide detailed subsurface structure but face several challenges [40,41]. A major issue is their reliance on an initial physical model to reduce the probability of the objective function falling into local minima; however, they are still prone to local minima under fewer constraints [42]. In addition, these methods require a large amount of computational resources for successive iterations, leading to significant time costs. In some cases, the computational cost of classical inversion methods may be lower than that of model training, but their real-time performance does not meet the needs of some projects.

The emergence of DL provides a new direction for inversion. Since DL can approximate any nonlinear complex function, the observed characteristics can fit a function that has never been seen before, which is suitable for solving the inverse problem of mineral exploration, namely, the inversion problem. Moreover, DL-based inversion transfers the time cost to the training stage, and once the network is effectively trained, results can be obtained in a very short amount of time. Thus, this approach is more of a real-time strategy than the traditional method.

3.1 Basic applications of DL

In geophysical exploration scenarios, DL-based inversion predominantly utilizes CNNs [43], which are adept at generating high-dimensional structured outputs. Given that geophysical inversion entails structured output tasks, the selection scheme of CNNs is logical. At present, many inversion model variants are based on CNNs [44,45]. For example, Puzyrev used a fully convolutional network (FCN) for geomagnetic inversion and achieved good performance. Unlike the classic CNN, this FCN uses a deconvolution layer instead of a fully connected layer and classifies each pixel of an image such that each pixel provides detailed category information. One of the most common CNNs is based on the UNet framework and some improved variants of UNet [46,47,48]. As shown in Figure 2, the structure of UNet network consists of downsampling and upsampling paths for extracting high-level semantic features and restoring spatial resolution, respectively. In the downsampling process, UNet extracts multi-scale features of geophysical data through multiple convolution and pooling operations; in the upsampling process, decoding paths are utilized to gradually restore the features to the same resolution as the original data and to fuse different levels of features through jump connections. This structure gives UNet excellent multi-scale feature extraction capability, which is able to capture multi-level information from local details to overall trends in geophysical data, and is crucial for portraying complex geological formations and ore body morphology.

Figure 2

Basic architecture of UNet.

For example, Zhou et al. [49] used UNet to conduct intelligent 3D gravity inversion and combined shallow (low-level) feature maps with deep (high-level) feature maps through quick connections so that the final layer of the network simultaneously utilized feature information acquired from different levels to generate accurate and detailed morphological output results. Li et al. [9] used a UNet++ CNN to invert the 3D densities of gravity anomalies and conducted experiments in Australia. The results showed that both the generalization and antinoise capabilities of the model improved to a certain extent. In comparison, UNet++ has more flexibility and adaptability due to the use of UNet and has a more complex jump connection strategy. This helps to fuse features more efficiently during the decoding process. Liu et al. [43] designed a multihead CNN model based on the UNet framework for electromagnetic inversion to operate in multiscale environments.

Although many of the existing frameworks are based on CNNs, applications involving other networks are also available. The fundamental problem of conducting inversion based on transient electromagnetic data is time series processing. Bang et al. [50] used a recurrent neural network (RNN), which has certain sequence processing advantages, to invert and predict underground resistivity by using electromagnetic data. Compared with CNNs, this model could better process continuous data and consider the correlations between sequences and datasets. In addition, the long short-term memory (LSTM) network is a modified RNN and an improved model with recursive properties that effectively solves the gradient explosion and disappearance problems faced by RNNs in long series training tasks. For example, Fan et al. [51] used this network to conduct transient electromagnetic inversion and obtained good results. Some scholars have also applied different networks to magnetotelluric (MT) inversion to compare and verify the performance of various networks. For example, Rahmani et al. [52,53] compared the performance of the VAE, ResNet and UNet networks and reported that UNet was superior [53,54]. Generally, CNNs occupy a strong position in the current inversion hierarchy and have good performance. However, with working environment changes and the improvements exhibited by exploration methods, the performance of various networks cannot be determined by absolute differences, and it is necessary to reasonably select and improve these methods according to the type of given data and various prerequisites. Table 2 encapsulates the neural networks that have been commonly utilized in recent years, along with their distinctive characteristics.

Table 2

Networks and their characteristics

Year	Author	Neural network name	Network characteristics
2020	Liu et al.	Ersinvnet	A network based on CNN incorporates a depth weighting function and a smoothing constraint into the loss function
2021	Bang et al.	RNN	This method has advantages in terms of processing sequential data, accounting for the correlations of sequences and datasets
2021	Wu et al.	GAN	This is a semi-supervised learning approach that leverages limited labeled data and rich unlabeled data to mitigate sample size issues
2022	Fan et al.	LSTM	This is an improved RNN model that can flexibly adapt to the timing characteristics of network learning tasks
2023	Zhou et al.	UNet	This is an improved four-layer subsampling network based on a CNN
2024	Bai et al.	GMNet	This is an improved joint inversion network based on a CNN
2024	Li et al.	UNet++	This is an improved UNet network that improves the feature transfer and fusion processes by adding more skip connections
2024	Liu et al.	Transformer	The self-attention mechanism of this network is better able to process sequence data, which allows the network to efficiently capture long-distance dependencies in data
2024	Shi et al.	3D UNet	Based on UNet framework, a 3D convolution kernel is used to directly extract 3D spatial patterns from data, making 3D output models more consistent in terms of depth

3.2 Problems and strategies

Similarly, the application of DL to the inversion of geophysical observation data also faces challenges related to data scarcity and limited generalizability.

3.2.1 Data scarcity problem

In geophysical inversion tasks, prediction bias is a prevalent issue that often stems from the paucity of real data. The efficacy of DL-based inversion is contingent upon the complexity and diversity of the input training samples, and the scarcity of authentic data is inescapable. Currently, the primary avenue for acquiring more real samples is to expand the drilling depth and exploration range. However, the additional real data procured through intensified exploration did little in terms of enhancing DL models. Training datasets are typically assembled by creating synthetic underground models followed by performing forward modeling to generate corresponding data. Given the substantial computational demands of forward modeling, it is impractical to indefinitely produce datasets that encompass all possible scenarios, which may lead to significant discrepancies between the constructed synthetic models and actual subsurface conditions, thereby adversely affecting the generalization capabilities of these models [55,56].

3.2.1.1 Joint inversion strategies

An underground model has many physical properties, and different physical inversion methods can infer the target underground structure according to these properties. For example, electromagnetic data can be used to infer subsurface resistivity, and seismic and gravity data can be employed to infer subsurface velocity and density conditions [57,58]. Therefore, more appropriate methods should be selected for different exploration environments [59]. In recent years, with the continuous accumulation of geophysical data, underground information has become increasingly diverse. The inversion of a single physical attribute does not consider the correlations and constraints among multiple underground attributes. To effectively use these physical observation data and give full play to the feature extraction capabilities of DL networks, joint inversion can be adopted [60–63].

As shown in Figure 3, the joint inversion schematic illustrates the process of subsurface model reconstruction through the combined training of signals with different geophysical attributes. As the joint inversion combines the characteristics of multiple observation data, it is able to realize mutual constraints and complementary information between the data in the training process. When the error of the training results is controlled within a reasonable range, the final prediction results are output; otherwise, a new round of iterative training will be entered to further optimize the model performance by adjusting the neural network parameters.

Figure 3

Joint inversion process.

For example, Wang et al. [64] combined seismic data and electromagnetic data to conduct joint inversion. Seismic data can provide a high-resolution underground velocity model, whereas electromagnetic data, although they provide resistivity data with low resolutions, are sensitive to underground anomalies. Therefore, the combination of these two types of data enables them to effectively complement and constrain each other. Bai et al. [65] designed a neural network based on a CNN for the inversion of gravity data and magnetic data, which are two physical properties that are often mutually verified and constrained in traditional methods. Hu et al. [62] proposed a DL-enhanced framework for joint multiphysics field inversion that uses DL networks as a bridge connecting individual inversions to achieve information extraction and fusion. Guo et al. [59] proposed jointly inverting MT data and seismic data in combination with DL constraints. The key to this method is that it attempts to understand the interrelationships between the resistivity and velocity of the Earth's interior and their structural characteristics through DL models. Through this method, researchers are able to more accurately reconstruct the structure of the Earth’s interior. A better understanding of the relationship between resistivity and velocity is thus attained.

Data with different attributes can be combined, and multicomponent electromagnetic data with different sizes can also be used for joint inversion. Magnetic fields have multiscale characteristics; in other words, the magnetic fields observed at different altitudes can represent different scales. Shi et al. [66] combined multitype and multiscale magnetic tensor data and magnetic three-component data for inversion purposes, which helped the researchers conduct debugging between different scales and different types of magnetic data and reduced the number of solutions yielded by the inversion results. Some of the above joint inversion methods were carried out through pure DL, and some were assisted by DL to associate various independent physical observation data. Compared with independent physical attribute inversion, joint inversion has a higher data utilization rate and produces better inversion results. Joint inversion is an inevitable research direction for the future.

3.2.1.2 Self-supervised and semi-supervised learning strategies

As a result, mineral exploration analysts have begun to look at another DL strategy, self-supervised learning, as a way to solve the sample scarcity problem. Self-supervised learning can use unlabeled data to learn, extract the operation rules and main features of tasks, and then fine-tune the constructed model on a small amount of labeled data [67]. In this way, inversion can be carried out in mineral exploration scenarios with samples that lack labels and small amounts of subsurface real data. For example, Li et al. [68] proposed a self-supervised 3D gravity inversion method, which directly learns the data of the target exploration site through a closed loop consisting of an inversion model and a forward model so that the law of the mineral exploration data themselves can be utilized instead of relying on manually labeled training data. This method is suitable not only for 3D gravity inversion but also for other physical inversions. For example, Li et al. [69] subsequently applied this method to 3D magnetic field inversion with slight modifications. In the absence of real data, some researchers have used GANs to reduce the reliance of the existing methods on labeled samples. For example, Qiao et al. [70] designed a CycleGAN based on the inversion of gravity anomalies; this approach can generate data that are close to those of the initial sample with a small amount of prior information so that a small number of training samples can be used to obtain good prediction results. These experiments and studies have proven the feasibility of unsupervised learning and semisupervised learning (GANs are semisupervised learning techniques) for performing inversions when data are lacking [71,72]; this is also a direction for future research.

3.2.2 Generalizability problem

DL-based inversion methods are predominantly data driven strategies, and their performance is largely contingent upon the quality of the input training dataset. Consequently, these methods accurately complete inversion tasks when the given field data are within the range of the training data. However, when the field data deviate from the training dataset, significant errors can occur, and this phenomenon is indicative of limited generalizability.

3.2.2.1 Strategies that add constraints and prior information

A nonpure data-driven inversion method integrates physical constraints with prior information and expert knowledge into a neural network. Such constraints can enable the network to learn the inherent physical laws exhibited by the inversion process during training to more reasonably guide the network parameter updates [42]. For example, if the observed resistivity data have vertical variation characteristics, when the same anomaly is located at different vertical locations, the apparent resistivity data will present large differences, which will lead to ambiguity during the DL inversion process [73–76]. Liu et al. [77] incorporated layered information into their network to address this phenomenon, and close correlations were observed between the data of different strata and the geological anomalies present at the corresponding depths. Therefore, when analyzing apparent resistivity data, formation information can be regarded as an effective supplement to the depth information processed by a neural network. This approach enhances the ability of the utilized model to identify the details of underground structures, thereby improving the accuracy of geological anomaly detection. Ling et al. [78] introduced a combined data-driven and physically driven weighted loss function to make their network consider the depth factor, which alleviated the dependence of the network on the training set range to a certain extent. In addition, since the anomalous area of interest occupies only a small part of the total underground space, a model may be biased to fit more background data (because the amount of background data is larger), resulting in inaccurate anomalous body predictions. In this context, Jiao et al. [42] designed a multiconstrained UNet++ network for magnetic data inversion, in which the Dice coefficient and the previous fit constraint terms were integrated into the loss function. The core idea of the Dice coefficient is to guide the model to focus on the accurate location of the anomalous body source while avoiding the limitation of the local minimum value during the optimization process of the loss function. The constraint item of the previous item is used to calculate the difference between the magnetic field anomaly data obtained after simulating the 3D model predicted by the network in the forward direction and the actual observed data to ensure the consistency of the predicted results with the actual geological conditions. In this way, the model is able to more effectively capture the key features contained in the data, which improves its overall predictive performance. Effectively integrating physical constraints and expert knowledge into DL models and building hybrid architectures based on them by combining multiple networks not only helps to improve the generalization ability of the models, but also significantly improves the inversion accuracy. However, the current difficulty lies in how to effectively integrate this geologic knowledge and reasonably fuse it with DL methods. This area still needs further in-depth research and exploration [79,80].

3.2.2.2 Noise removal strategies

The noise faced by mineral exploration methods can be widely divided into internal noise and external noise. The so-called internal noise is inherent in system hardware and equipment, whereas external noise includes man-made noise, natural noise, and the noise induced by movements [81–83]. The advantages of introducing physical knowledge and constraints, which can improve the degree of generalization of a network to a certain extent and provided it with more antinoise properties than those possessed by a pure data-driven approach, are discussed above. However, it is often difficult to obtain satisfactory results when such complex noise interferes with geophysical measurements [44]. Therefore, denoising inversion data is an important prerequisite for accurately performing inversion. For noise processing, the traditional method involves addressing all types of noise in a step-by-step manner, but these methods require many resources, and some parameters need to be set by relevant personnel. Although all types of noise can be effectively processed, the experience and knowledge reserves of relevant staff have high requirements, and large amounts of computing resources are needed.

In the inversion process of geophysical data, DL models have demonstrated their unique advantages in noise removal. By utilizing a variety of DL architectures, including artificial neural networks, CNNs, and self-coders, we are able to effectively remove noise from exploration data. Compared with traditional methods, these techniques can reduce the influence of manual subjective judgment on the inversion results, thus improving the objectivity and accuracy of data processing [83,84]. Utilizing DL to denoise noisy data before conducting inversion is the most direct method. For example, Zhang et al. [85] designed a deep residual network by first using a small time window to scan an MT time series and then by using Gramian angular field technology to assist the noise detection process so that the input time series could be effectively divided into noise-free and noise-containing data segments to form a training set; this training set was then input into the network for training. Han et al. [82] built two RNNs for denoising: one network for screening out noisy data and another network for denoising. The results of denoising greatly improve the subsequent inversion process. DL-based denoising can also be combined with traditional methods. For example, Wang et al. [86] combined a deep residual denoising CNN with shift-invariant sparse coding (SISC, which is a sparse representation technology that is used to further process and optimize denoised data) for denoising. Specifically, the DL part is responsible for automatically learning noise features from the input data for preliminary denoising. The denoising results are then transmitted to the SISC module for further fine processing to improve the quality of the denoised data.

All of the above denoising processes use DL to denoise geophysical detection data before performing inversion. Another strategy is to inject noise into the given measurement data and use an inversion neural network to automatically denoise and obtain the inversion results. Compared with conducting processing before inversion, this is an end-to-end method with continuity and unity. For example, Liu et al. [44] proposed three different noise injection strategies, including the addition of synthetic relative noise following a Gaussian distribution, a multiwindow filtering scheme, and a data enhancement method based on two strategies to simulate the noise situation in real measurement data. Notably, both conducting noise processing before inversion and the noise injection task implemented during inversion are single-task learning networks in DL. Among the available noise removal methods, a type of network called the multitask learning network can carry out noise removal and other tasks simultaneously [87]. For example, Liu et al. [88] proposed a multitask learning neural network structure based on a shared-layer transformer, which can simultaneously denoise and invert aeromagnetic data and uses shared features to mutually constrain the denoising and inversion tasks. Zhang et al. [89] designed a multitask learning framework based on an improved UNet structure for gravity data inversion and denoising. Through its multitask learning framework, this network can perform data inversion while conducting denoising, effectively improving the accuracy of the inversion results obtained for noisy data. The multitask learning strategy can indeed improve the accuracy of inversion, but it should be noted that the weights selected for different tasks affect the final result. Multiple tasks can bring more constraints and information but also introduce some biases; thus, highly correlated tasks tend to improve, whereas poorly correlated tasks tend to cause this approach to backfire.

3.2.3 Wrap-up and discussion

The application and development of DL in geophysical inversion cannot be separated from the concerns of data quality and model generalization ability. To address these concerns, we improve data quality through joint inversion, noise removal, and self-supervised learning. Meanwhile, the generalization ability of the model is significantly enhanced by introducing physical constraints and a priori information. Although the full-scale application of DL in the field of geological exploration has not yet been realized, the reliability of its model prediction results is expected to gradually improve and converge to an acceptable level within the error range with the advancement of drilling technology and the continuous accumulation of geological data. However, this is not the ultimate goal. As a kind of “black box” technology, the current research mainly focuses on the application level, and its deep working principle still needs to be further explored and revealed [90–93].

4 Discussion and outlook

4.1 Discussion

The above text describes the development and challenges of ML and DL in recent years as well as their solutions. Generally speaking, the traditional geological exploration methods still have a certain advantage in terms of accuracy, but ML can generate results quickly after training, which is a significant improvement in the exploration process with high real-time requirements.

Deep exploration, especially deep mineral exploration, faces challenges that cannot be effectively addressed by traditional techniques, especially in high-dimensional data processing and noise suppression. While traditional exploration methods are often limited by simplifying assumptions and manual feature extraction, ML and DL techniques are better able to process complex, high-dimensional, and noisy geophysical data, automatically extract features and reveal subsurface resource distribution. In particular, CNNs and RNNs excel in noise processing and spatio-temporal data modeling. However, these techniques still face problems such as data scarcity and overfitting, especially DL models that require large amounts of labeled data. Therefore, how to optimize algorithms and datasets to improve the generalization ability of models remains a key challenge in deep exploration.

Although the intermediate process of ML is still a black box, it is expected that ML tools will be elevated from an auxiliary role to a central decision-making tool through the widespread application of interpretable techniques and transparent models in mineral exploration. Future research could focus on exploring the development of more efficient interpretive frameworks, physically constrained models that incorporate domain knowledge, and more accurate quantification methods for uncertainty. In this way, ML techniques can not only provide more accurate predictions but also meet the needs of real-world applications in terms of transparency and trust, providing comprehensive support for in-depth exploration.

In geophysical data inversion, ML and DL techniques have a promising application, but also face a series of challenges. These challenges mainly include noise interference, sparse sampling, high-dimensional nonlinearity, and insufficient data labeling. Meanwhile, geophysical data often come from multiple observation modalities (e.g., seismic, electromagnetic, etc.), and the fusion of multimodal data and the lack of physical constraints further increase the complexity of model application. To cope with these problems, researchers have proposed a variety of innovative approaches to enhance the robustness and practical application value of the models. For example, by developing noise robust models, self-supervised learning, and GANs, the data quality can be effectively improved and the limitation of insufficient data labeling can be alleviated. In addition, the integration of geophysical data from different sources (e.g., seismic and electromagnetic data) using a multimodal DL approach can capture the complementary properties of multi-source information in a more comprehensive way, thereby improving the inversion accuracy. To further enhance the physical consistency of the model, the researchers embedded geophysical laws into the model and utilized a physically guided inversion framework to significantly improve the accuracy and confidence of the predictions.

In addition, geologic data usually show sparse distribution and high acquisition cost, and models relying solely on data-driven models may perform poorly due to insufficient training data. In this context, embedding geological domain knowledge into the model becomes an effective complementary strategy. This approach avoids the learning of erroneous patterns by the model by reflecting the intrinsic laws of geologic mineralization. For example, geological knowledge can be utilized to generate key features (e.g., gravity anomaly gradients, fault distances, etc.) related to the mineralization process as inputs to the ML model; or, based on the experience of geological experts, higher weights can be assigned to specific features to enhance the influence of key variables.

Innovative network combinations and tuning can lead to significant enhancements in several ways. For example, although ML algorithms are usually categorized into two main types: supervised and unsupervised learning, these two are not used independently. Mineral exploration data (e.g., geophysical and geochemical data) are often highly spatially correlated, and geological features of different ore bodies may partially overlap. The traditional K-Means clustering algorithm is difficult to avoid the overlap problem due to its inherent characteristics. In this regard, some researchers used a combination of unsupervised learning clustering algorithm and neural network supervised learning to solve the overlap problem [94]. They use neural networks to train the data in the overlapping region separately, while avoiding interference with the classification of other valid data. In the field of geological and mineral exploration, researchers should flexibly adjust and combine different network structures according to the actual problems. This approach not only can deal with complex problems more effectively, but also provide the possibility of solving the “black box” problem in ML, thus promoting the further development of the field.

At the same time, our thinking cannot stop at improving neural networks and datasets; other developments in geology are likewise contributing to the application of DL in geological exploration. For example, research into the dynamic characteristic of silt and its effect on peak ground acceleration, while not directly improving the quality of the dataset or the structure of the network, provides an important theoretical basis for soil quality analysis [95]. This research has helped to identify areas where variations in the dynamic properties of soils have a greater impact on the likelihood of the presence of mineral resources, allowing for more precise delineation of exploration areas. These geological research advances provide a key theoretical basis for reliability design in geoengineering, helping to more accurately assess the safety of engineering structures under seismic action. In addition, this law provides important a priori knowledge for DL models, which can be used to optimize the initialization parameter settings of the model, so that the model can better adapt to the data distribution at the early stage of training, and improve the training efficiency and accuracy.

In the field of geological exploration, traditional methods and ML have their own advantages and limitations. Overall, traditional methods, based on explicit theoretical derivations and physical formulas, have certain advantages in the interpretability and accuracy of mineralization laws; while ML, relying on large-scale data-driven modeling capabilities, has shown great potential for rapid processing of complex data and real-time prediction. This complementarity not only allows the two to play different roles in the exploration process, but also provides an important opportunity to explore hybrid methods that combine them. Embedding traditional geological rules into a ML framework, especially in the combination of reinforcement learning and DL, allows models to incorporate physical laws and geological knowledge in the decision-making process. This hybrid strategy can find a balance between theoretical interpretability and data-driven predictive power, providing more efficient and accurate solutions for mineral exploration. Future research directions are not limited to a single path of traditional or ML methods, but should aim to develop hybrid methods that incorporate the advantages of both. In practice, traditional geological methods can help screen key features in a dataset and ensure that ML models focus on the most relevant information in mineral exploration. This fusion approach not only improves the efficiency and accuracy of the model, but also effectively avoids the waste of resources and result bias brought by blindly processing big data.

In the face of the complex challenges of geological exploration, relying solely on traditional methods or ML techniques is clearly not enough to cope with all the demands. In the future, through the development of more efficient interpretive frameworks, physical constraint models integrating domain knowledge, and more precise quantification of uncertainty, exploration technology will achieve higher transparency and trust. At the same time, the deepening of international cooperation and the strengthening of multidisciplinary cross-research will help to alleviate data scarcity and technical barriers, and promote a comprehensive revolution in mineral exploration methods. In this context, the in-depth combination of traditional methods and ML technology will become the core driving force for innovation in the exploration field, opening up broader possibilities for geoscientific research and practical applications.

4.2 Outlook

Overall, DL technology has achieved significant breakthroughs in the field of mineral resource exploration and is widely used in many aspects of the exploration process. Nevertheless, several challenges in this field still need to be further explored and solved. Some suggestions and reflections on the path forward are as follows.

4.2.1 Data processing

Geological data are increasingly used in ML, providing a valuable resource for model optimization and research. However, the issues of data quality and consistency remain key challenges in realizing effective applications. The collection of geological observation data usually relies on diverse measurement tools and methods, which leads to possible noise and systematic bias among data. At the same time, differences in the temporal and spatial resolution of different datasets make them difficult to integrate directly, adding to the complexity of data fusion. Especially in extreme environmental regions with harsh collection conditions, the difficulty of data collection is greatly increased, leading to the problem of scarce and incomplete datasets becoming more prominent. Therefore, there is an urgent need to strengthen international cooperation on a global scale to develop unified standards and formats for geological data collection and to promote the sharing of data resources. Such cooperation will not only help to solve the problems of data quality and consistency, but also lay a more solid foundation for the application of ML in the field of geology. Through these efforts, we can expect significant progress in the quality and usability of geological data, thus promoting the further development of related research and applications.

In terms of collaboration and strategy, knowledge-sharing, cross-disciplinary, cross-country mineral exploration databases must be built to link the data and results published by various researchers. This pattern of knowledge sharing provides researchers with a platform for data exchange so that each person's independent data can be linked to form a grand knowledge network.

The fundamental principle of DL is to analyze the relationships among various data to obtain final prediction results. Making good use of these data is an important prerequisite for accurately performing prediction, and the employed data processing scheme often affects the final results. At present, the geological exploration data used for DL training are mostly observational data in image formats, such as electromagnetic data and gravity data, and prospecting information is present in these images. However, in addition to these observation data in image formats, many other types of data, such as literature data and tabular data, have indicative prospecting roles. These data can be used together with observational data to complement each other. How to extract such text and icon data (in what way or in what format) is worth considering, and challenges are present.

Importantly, more data are not always better. While the rapid growth of big data provides a wealth of information resources, it can also introduce spam and invalid data. Therefore, the reliability and accuracy of mineral exploration information sources should be strictly judged and screened. The mixing of junk data not only fails to provide valuable information but also may adversely affect the output prediction results. Therefore, ensuring that the utilized data are reliable and relevant is an important prerequisite for obtaining a highly accurate final prediction.

4.2.2 Fusion of knowledge and data

DL methods have achieved remarkable results in mineral exploration by taking data as inputs and directly producing predictions. However, despite the successes demonstrated by DL models in practice, their internal learning mechanisms and decision-making bases are often opaque, so they are referred to as "black boxes." For this type of model, we should not be satisfied with the results obtained at the application level and should also deeply understand its working principles and realize the transformation from using DL to recognizing DL.

To improve the interpretability of DL models, researchers have begun to explore the integration of physical constraints and prior knowledge into network architectures. This approach, called a physical constraint network, enhances the explanatory power of the developed model in specific application scenarios by introducing domain knowledge. Although this type of research is still in its initial stage and the constraints introduced have been relatively limited, improving the interpretability and understanding of models is highly important. How to choose the correct knowledge and how to effectively integrate it remain the challenges of the current research.

As a structured semantic knowledge base, a knowledge graph expresses entities and their relationships in the real world with a graphical structure, which provides a new perspective for DL models. Recently, some researchers have attempted to combine knowledge graphs with DL, using the relationships and reasoning capabilities of the given graphs to guide the learning processes of neural networks. This method of fusing knowledge graphs not only provides rich semantic information for DL models but also enhances the generalizability and interpretability of these models, demonstrating great development potential and application prospects.

4.2.3 Correct understanding of the end goal

In the applications of DL techniques related to mineral prediction, scholars have introduced a variety of technical strategies for different scenarios, such as the integration of attention mechanisms and the fusion of different network architectures. During this process, the input data are used primarily as a basis for evaluating the performance of the constructed model, whereas the model itself is seen as a tool for achieving prediction goals. At present, many researchers are focusing on the optimization of models and the expansion of sample data.

Most importantly, however, we must recognize that models and data are only means for revealing underlying patterns, not the end goal of research. Our fundamental goal should be to reveal the basic principles of mineralization. DL models should be considered tools for progressively exploring the mechanisms of mineralization and the influences of various factors. In fact, the pursuit of model interpretability is essentially a way to explore metallogenic principles. Therefore, more attention should be given to how to understand the internal mechanism of the mineralization process through DL technology and how to apply these technologies to actual mineral explorations to accurately predict minerals. Research methods with principles as their ultimate goals will help promote the development of mineralization theory and its practical applications.

4.2.4 Intelligent 3D transformation

At present, the applications of DL technology in the field of mineral exploration are focused on specific processes, such as geophysical data inversion, geological mapping, and signal noise reduction. These applications are often independent and separate. How to integrate these parts to form a true sense of fully automatic intelligence is a potential direction for practical applications. The realization of this goal will greatly promote the innovation and progress of mineral exploration technology.

Moreover, the applications of MPM and inversion technology must also gradually change from 2D to 3D. At present, due to data acquisition limitations, research mainly focuses on the 2D level, and 3D geological models are constructed by analyzing multiple 2D profiles. However, the true shapes of underground geological structures are 3D, and 2D data have inherent limitations that can lead to inadequate resolutions and the absence of crucial minute details. With the increase in the amount of available geoscience data and the progress of related technology, future research should pay more attention to 3D data.

4.2.5 Ethics and the environment

The potential impacts of mineral exploration on the environment and communities have always been an important concern. As ML technology continues to advance, its application in mineral exploration offers new possibilities for reducing environmental damage. By combining ML with geologic data analysis, the accuracy of exploration can be significantly improved, thereby reducing the ecological damage caused by blind drilling. This technical means can not only locate the distribution of resources more accurately, but also optimize the exploration process and reduce unnecessary repetitive operations.

In addition, with advanced model optimization methods, ML can effectively integrate multi-source data (e.g., geophysical, geochemical, and remote sensing data) to maximize the extraction of useful information. This data-driven decision-making approach makes exploration activities more accurate and efficient, and improves resource utilization efficiency while reducing environmental impact. In the long run, this technological improvement lays the foundation for realizing sustainable development of mineral resources and provides a new solution for balancing mining activities with environmental protection.

We also always need to be mindful of the ethical and industry environment impacts of new technologies. At the data collection stage, we strictly ensure the accuracy and privacy of data, while emphasizing the transparency of data sharing. Ensure that laws governing the sharing of data and the independent intellectual property rights of each business are improved to ensure healthy competition in collaboration. Factors such as social acceptance, equitable distribution of resources, and community interests should also be considered to form a more comprehensive and rational decision support system. This can help balance economic gains with clear social responsibilities and ensure fairness and transparency in the development process. Mineral extraction often involves impacts on local communities, especially indigenous or poor groups. The introduction of ML techniques should assist in the development of mineral resources and ensure that it does not lead to injustice or deprivation of these groups, and that the economic benefits of mineral extraction are distributed equitably to all parties involved.

5 Conclusion

The swift advancement of artificial intelligence, coupled with the exponential growth in data availability, has ushered in innovative mineral exploration approaches. This is particularly evident in the realms of target prediction and geophysical data interpretation. The increasing applications of DL and ML in geological and mineral exploration scenarios have revolutionized the way we understand and analyze complex metallogenic systems. These advanced computational techniques are capable of discerning patterns within intricate geological data, thereby enhancing our understanding and filling gaps in our knowledge. They also serve to minimize subjective biases and significantly increase the efficiency of exploration efforts.

This study delves into the integration of DL and artificial intelligence within the field of mineral exploration. This study provides an overview of the latest mapping advancements for mineral exploration prospects and the techniques used for inverting geophysical data. The discussion encompasses the unique attributes and challenges associated with various neural network models, as well as the strategies employed to address these issues. The focus of this study extends beyond the technology itself, aiming to shed light on the future trajectory and conceptual evolution of the field. Researchers are encouraged to not only concentrate on refining and innovating neural network models but also consider the underlying principles that govern mineralization processes.

Acknowledgments

This work was partly supported by the WSGRI Engineering & Surveying Incorporation Limited.

Funding information: Authors state no funding involved.
Author contributions: Ke Liu: selected literature, constructed the framework, and drafted the manuscript. Xinhai Dun, Wen Yang, and Yan Zeng: assisted with literature-related work and summarized research findings. Yihang Guo: supervised progress, performed academic revisions, and finalized the manuscript.
Conflict of interest: The authors declare no competing interests.

References

[1] Lv QT, Zhang XP, Tang JT, Jin S, Liang LZ, Niu JJ, et al. Review on the advancement in technology and equipment of geophysical exploration for metallic deposits in China. Chin J Geophys. 2019;62:3629–64. 10.6038/cjg2019N0056.Suche in Google Scholar

[2] Guo Z, Xue G, Liu J, Wu X. Electromagnetic methods for mineral exploration in China: A review. Ore Geol Rev. 2020;118:103357. 10.1016/j.oregeorev.2020.103357.Suche in Google Scholar

[3] Xue G, Chen W, Cheng J, Liu S, Yu J, Lei K, et al. A review of electrical and electromagnetic methods for coal mine exploration in China. IEEE Access. 2019;7:177332–41. 10.1109/access.2019.2951774.Suche in Google Scholar

[4] Yu S, Ma J. Deep learning for geophysics: Current and future trends. Rev Geophys. 2021;59:2021RG000742. 10.1029/2021rg000742.Suche in Google Scholar

[5] Skirrow RG, Murr J, Schofield A, Huston DL, van der Wielen S, Czarnota K, et al. Mapping iron oxide Cu-Au (IOCG) mineral potential in Australia using a knowledge-driven mineral systems-based approach. Ore Geol Rev. 2019;113:103011. 10.1016/j.oregeorev.2019.103011.Suche in Google Scholar

[6] Mousavi SM, Beroza GC. Deep-learning seismology. Science. 2022;377:725. 10.1126/science.abm4470.Suche in Google Scholar PubMed

[7] Sun T, Li H, Wu K, Chen F, Zhu Z, Hu Z. Data-driven predictive modelling of mineral prospectivity using machine learning and deep learning methods: A case study from Southern Jiangxi Province, China. Minerals. 2020;10:102. 10.3390/min10020102.Suche in Google Scholar

[8] Hu Z, Liu S, Hu X, Fu L, Qu J, Wang H, et al. Inversion of magnetic data using deep neural networks. Phys Earth Planet Inter. 2021;311:106653. 10.1016/j.pepi.2021.106653.Suche in Google Scholar

[9] Li B, Lu B, An G, Ju P, Zhu W, Su Z. 3D density inversion of gravity anomalies based on U-Net++. Chin J Geophys. 2024;67(2):752–67. 10.6037/cjg2023Q0924.Suche in Google Scholar

[10] Li Q, Chen G, Luo L. Mineral prospectivity mapping using attention-based convolutional neural network. Ore Geol Rev. 2023;156:105381. 10.1016/j.oregeorev.2023.105381.Suche in Google Scholar

[11] Li Z, Xue L, Ran X, Li Y, Dong G, Li Y, et al. Intelligent prospect prediction method based on convolutional neural network: A case study of copper deposits in Longshou Mountain area. Gansu Prov J Jilin Univ (Earth Sci Ed). 2022;52(2):418–33. 10.13278/j.cnki.jjuese.20210081.Suche in Google Scholar

[12] Gao L, Huang Y, Zhang X, Liu Q, Chen Z. Prediction of prospecting target based on ResNet convolutional neural network. Appl Sci. 2022;12:11433. 10.3390/app122211433.Suche in Google Scholar

[13] Li C, Liu B, Xiao K, Kong Y, Wang L, Tang R, et al. Metallogenic prediction of the Zaozigou gold deposit using 3D geological and geochemical modeling. Minerals. 2023;13:1205. 10.3390/min13091205.Suche in Google Scholar

[14] Li X, Xue C, Chen Y, Yuan F, Li Y, Zheng C, et al. 3D convolutional neural network-based 3D mineral prospectivity modeling for targeting concealed mineralization within Chating area, middle-lower Yangtze River metallogenic belt, China. Ore Geol Rev. 2023;157:105444. 10.1016/j.oregeorev.2023.105444.Suche in Google Scholar

[15] Zhang B, Xu K, Khan U, Li X, Du L, Xu Z. A lightweight convolutional neural network with end-to-end learning for three-dimensional mineral prospectivity modeling: A case study of the Sanhetun Area, Heilongjiang Province, Northeastern China. Ore Geol Rev. 2023;163:105788. 10.1016/j.oregeorev.2023.105788.Suche in Google Scholar

[16] Liu Y, Wang X, Zhang Z, Deng F. Deep learning-based data augmentation for large-scale mineral image recognition and classification. Miner Eng. 2023;204:108411. 10.1016/j.mineng.2023.108411.Suche in Google Scholar

[17] Wu Y, Liu B, Gao Y, Li C, Tang R, Kong Y, et al. Mineral prospecting mapping with conditional generative adversarial network augmented data. Ore Geol Rev. 2023;163:105787. 10.1016/j.oregeorev.2023.105787.Suche in Google Scholar

[18] Xie M, Liu B, Wang L, Li C, Kong Y, Tang R. Auto encoder generative adversarial networks-based mineral prospectivity mapping in Lhasa area, Tibet. J Geochem Explor. 2023;255:107326. 10.1016/j.gexplo.2023.107326.Suche in Google Scholar

[19] Huang Y, Feng Q, Zhang W, Zhang L, Gao L. Prediction of prospecting target based on selective transfer network. Minerals. 2022;12:1112. 10.3390/min12091112.Suche in Google Scholar

[20] Li S, Liu C, Chen J. Mineral prospecting prediction via transfer learning based on geological big data: A case study of Huayuan, Hunan, China. Minerals. 2023;13:504. 10.3390/min13040504.Suche in Google Scholar

[21] Miao Q, Wang P, Zhao H, Li Z, Qi Y, Mao J, et al. Mineral prospectivity prediction based on self-supervised contrastive learning and geochemical data: A case study of the gold deposit in the Malanyu District, Hebei Province, China. Nat Resour Res. 2024;33:1377–91. 10.1007/s11053-024-10335-3.Suche in Google Scholar

[22] Ding K, Xue L, Ran X, Wang J, Yan Q. Siamese network-based prospecting prediction method: A case study from the Au deposit in the Chongli mineral concentrate area in Zhangjiakou, Hebei Province, China. Ore Geol Rev. 2022;148:105024. 10.1016/j.oregeorev.2022.105024.Suche in Google Scholar

[23] Gao L, Wang K, Zhang X, Wang C. Intelligent identification and prediction of mineral resources deposits based on deep learning. Sustainability. 2023;15:10269. 10.3390/su151310269.Suche in Google Scholar

[24] Xu K, Zhao S, Wu C, Zhang S, Yuan L, Yang C, et al. Manganese mineral prospectivity based on deep convolutional neural networks in Songtao of northeastern Guizhou. Earth Sci Inform. 2024;17:1681–97. 10.1007/s12145-024-01224-7.Suche in Google Scholar

[25] Liu Z, Yu S, Deng H, Jiang G, Wang R, Yang X, et al. 3D mineral prospectivity modeling in the Sanshandao goldfield, China, using the convolutional neural network with attention mechanism. Ore Geol Rev. 2024;164:105861. 10.1016/j.oregeorev.2023.105861.Suche in Google Scholar

[26] Yin B, Zuo R, Sun S. Mineral prospectivity mapping using deep self-attention model. Nat Resour Res. 2023;32(1):37–56. 10.1007/s11053-022-10142-8.Suche in Google Scholar

[27] Yang N, Zhang Z, Yang J, Hong Z, Shi J. A convolutional neural network of GoogLeNet applied in mineral prospectivity prediction based on multi-source geoinformation. Nat Resour Res. 2021;30:3905–23. 10.1007/s11053-021-09934-1.Suche in Google Scholar

[28] Ning C, Wu B, Wu B. Transformer and convolutional hybrid neural network for seismic impedance inversion. IEEE J Sel Top Appl Earth Observ Remote Sens. 2024;17:4436–49. 10.1109/jstars.2024.3358610.Suche in Google Scholar

[29] Li C, Xiao K, Sun L, Tang R, Dong X, Qiao B, et al. CNN-transformers for mineral prospectivity mapping in the Maodeng–Baiyinchagan area, Southern Great Xing'an Range. Ore Geol Rev. 2024;167:106007. 10.1016/j.oregeorev.2024.106007.Suche in Google Scholar

[30] Saffarvarkiani E, Passi K, Godwin A. Injury prediction for Canadian mineral exploration using machine learning. 2024 IEEE Conference on Cognitive and Computational Aspects of Situation Management (CogSIMA); 2024. 10.1109/CogSIMA61085.2024.10553924.Suche in Google Scholar

[31] Qiu Q, Tian M, Ma K, Tan YJ, Tao L, Xie Z. A question answering system based on mineral exploration ontology generation: A deep learning methodology. Ore Geol Rev. 2023;153:105294. 10.1016/j.oregeorev.2023.105294.Suche in Google Scholar

[32] Juliani C, Juliani E. Deep learning of terrain morphology and pattern discovery via network-based representational similarity analysis for deep-sea mineral exploration. Ore Geol Rev. 2021;129:103936. 10.1016/j.oregeorev.2020.103936.Suche in Google Scholar

[33] Dong J, Qiu Q, Xie Z, Ma K, Hu A, Wang H. Understanding table content for mineral exploration reports using deep learning and natural language processing. Ore Geol Rev. 2023;156:105383. 10.1016/j.oregeorev.2023.105383.Suche in Google Scholar

[34] Zhao G, Wang X, Liu J, Liang S, Li J. 3D gravity inversion in frequency domain based on mixed L1+L2 norm. Chinese. J Geophys. 2024;67(4):1627–40. 10.6038/cjg2023R0231.Suche in Google Scholar

[35] Xin C, Ye L, Huang H, Liu D, Pang Y, Fu R, et al. Comparison of detection effect between 2D and 3D inversion of high-density resistivity method in Panzhou coal mine disaster area. Chin J Eng Geophys. 2022;19:776–81. 10.3969/j.issn.1672-7940.2022.06.004.Suche in Google Scholar

[36] Zhang L, Zhang G, Liu Y, Fan Z. Deep learning for 3D inversion of gravity data. IEEE Trans Geosci Remote Sens. 2022;60:1–18. 10.1109/tgrs.2021.3110606.Suche in Google Scholar

[37] Huang R, Liu S, Qi R, Zhang Y. Deep learning 3D sparse inversion of gravity data. J Geophys Res Solid Earth. 2021;126:e2021JB022476. 10.1029/2021jb022476.Suche in Google Scholar

[38] Wang R, Wang C, Xu Z, Li H, Zhang Y, Li Q, et al. Prediction method of underground density abnormal body in gravity data based on convolutional neural network. In Proceedings of the 2021 20th International Conference on Ubiquitous Computing and Communications (IUCC/CIT/DSCI/SmartCNS); 2021. p. 190–5. 10.1109/IUCC-CIT-DSCI-smartCNS55181.2021.00041.Suche in Google Scholar

[39] Liao X, Shi Z, Zhang Z, Yan Q, Liu P. 2D inversion of magnetotelluric data using deep learning technology. Acta Geophys. 2022;70:1047–60. 10.1007/s11600-022-00773-z.Suche in Google Scholar

[40] Yuan C, Chen H, Deng J, Yu H, Feng M. Research on magnetotelluric 3D inversion guided by one-dimensional inversion results. Prog Geophys. 2023;38(6):2622–32. 10.6038/Pg2023HH0034.Suche in Google Scholar

[41] Fu X, Tan HD, Dong Y, Wang M. Application of supervised descent method for 2D magnetotelluric inversion and its application. Geophys Geochem Explor. 2024;48(1):175–84. 10.11720/wtyht.2024.1417.Suche in Google Scholar

[42] Jiao J, Zeng X, Liu H, Yu P, Lin T, Zhou S. Three-dimension inversion of magnetic data based on multi-constraint UNet++. Appl Sci. 2024;14:5730. 10.3390/app14135730.Suche in Google Scholar

[43] Liu X, Craven JA, Tschirhart V. Retrieval of subsurface resistivity from magnetotelluric data using a deep-learning-based inversion technique. Minerals. 2023;13:461. 10.3390/min13040461.Suche in Google Scholar

[44] Liu W, Wang H, Xi Z, Wang L. Physics-informed deep learning inversion with application to noisy magnetotelluric measurements. Remote Sens. 2023;16:62. 10.3390/rs16010062.Suche in Google Scholar

[45] Cheng Y, Fu LY. Nonlinear seismic inversion by physics-informed Caianiello convolutional neural networks for overpressure prediction of source rocks in the offshore Xihu depression, East China. J Pet Sci Eng. 2022;215:110654. 10.1016/j.petrol.2022.110654.Suche in Google Scholar

[46] Guo J, Li Y, Jessell MW, Giraud J, Li C, Wu L, et al. 3D geological structure inversion from Noddy-generated magnetic data using deep learning methods. Comput Geosci. 2021;149:104701. 10.1016/j.cageo.2021.104701.Suche in Google Scholar

[47] Puzyrev V. Deep learning electromagnetic inversion with convolutional neural networks. Geophys J Int. 2019;218:817–32. 10.1093/gji/ggz204.Suche in Google Scholar

[48] Li Y, Han LG, Zhou S, Lin T. Gravity data density interface inversion based on U-Net deep learning network. Chin J Geophys. 2023;66(1):401–11. 10.6037/cjg2022Q0362.Suche in Google Scholar

[49] Zhou X, Chen Z, Lv Y, Wang S. 3D gravity intelligent inversion by U-Net network with data augmentation. IEEE Trans Geosci Remote Sens. 2023;61:1–13. 10.1109/tgrs.2023.3241310.Suche in Google Scholar

[50] Bang M, Oh S, Noh K, Seol SJ, Byun J. Imaging subsurface orebodies with airborne electromagnetic data using a recurrent neural network. Geophysics. 2021;86:E407–19. 10.1190/GEO2020-0871.1.Suche in Google Scholar

[51] Fan T, Xue GQ, Li P, Yan B, Bao L, Song JQ, et al. TEM real-time inversion based on long-short term memory network. Chin J Geophys. 2022;65(9):3650–63. 10.6038/cjg2022P0572.Suche in Google Scholar

[52] Rahmani Jevinani M, Habibian Dehkordi B, Ferguson IJ, Rohban MH. Deep learning-based 1-D magnetotelluric inversion: Performance comparison of architectures. Earth Sci Inf. 2024;17:1663–77. 10.1007/s12145-024-01233-6.Suche in Google Scholar

[53] Liu Z, Chen H, Ren Z, Tang J, Xu Z, Chen Y, et al. Deep learning audio magnetotellurics inversion using residual-based deep convolution neural network. J Appl Geophys. 2021;188:104309. 10.1016/j.jappgeo.2021.104309.Suche in Google Scholar

[54] Deng F, Hu J, Wang X, Yu S, Zhang B, Li S, et al. Magnetotelluric deep learning forward modeling and its application in inversion. Remote Sens. 2023;15:3667. 10.3390/rs15143667.Suche in Google Scholar

[55] Wang X, Jiang P, Deng F, Wang S, Yang R, Yuan C. Three-dimensional magnetotelluric forward modeling through deep learning. IEEE Trans Geosci Remote Sens. 2024;62:1–13. 10.1109/tgrs.2024.3401587.Suche in Google Scholar

[56] Zhang ZH, Liao XL, Cao YY, Hou ZL, Fan XT, Xu ZX, et al. Joint gravity and gravity gradient inversion based on deep learning. Chin J Geophys. 2021;64:1435–52. 10.6083/cjg2021O0151.Suche in Google Scholar

[57] Jia Z, Li Y, Wang Y, Li Y, Jin S, Li Y, et al. Deep learning for 3-D magnetic inversion. IEEE Trans Geosci Remote Sens. 2023;61:1–10. 10.1109/tgrs.2023.3253888.Suche in Google Scholar

[58] Zhou Z, Wang J, Meng X, Fang Y. 3-D joint inversion of gravity and magnetic data based on a deep learning network with automatic recognition of structural similarity. IEEE Trans Geosci Remote Sens. 2023;61:1–16. 10.1109/tgrs.2023.3330988.Suche in Google Scholar

[59] Guo R, Yao HM, Li M, Ng MKP, Jiang L, Abubakar A. Joint inversion of audio-magnetotelluric and seismic travel time data with deep learning constraint. IEEE Trans Geosci Remote Sens. 2021;59:7982–95. 10.1109/tgrs.2020.3032743.Suche in Google Scholar

[60] Hu Y, Wei X, Wu X, Sun J, Huang Y, Chen J. Three-dimensional cooperative inversion of airborne magnetic and gravity gradient data using deep-learning techniques. Geophysics. 2024;89:WB67–79. 10.1190/GEO2023-0225.1.Suche in Google Scholar

[61] Jiao J, Dong S, Zhou S, Zeng Z, Lin T. 3-D gravity and magnetic joint inversion based on deep learning combined with measurement data constraint. IEEE Trans Geosci Remote Sens. 2024;62:1–14. 10.1109/tgrs.2023.3339303.Suche in Google Scholar

[62] Hu Y, Wei X, Wu X, Sun J, Chen J, Huang Y, et al. A deep learning-enhanced framework for multiphysics joint inversion. Geophysics. 2023;88:K13–26. 10.1190/geo2021-0589.1.Suche in Google Scholar

[63] Hu Y, Wei X, Wu X, Sun J, Huang Y, Chen J. Deep learning enhanced 3D joint inversion. In Proceedings of the IEEE URSI 2023; 2023. 10.23919/USNC-URSI54200.2023.10289413.Suche in Google Scholar

[64] Wang Y, Jia Z, Lu W. SEMI Net: Seismic-electromagnetic joint inversion network. IEEE Trans Geosci Remote Sens. 2024;62:1–15. 10.1109/tgrs.2024.3365818.Suche in Google Scholar

[65] Bai Z, Wang Y, Wang C, Yu C, Lukyanenko D, Stepanova I, et al. Joint gravity and magnetic inversion using CNNs’ deep learning. Remote Sens. 2024;16:7115. 10.3390/rs16071115.Suche in Google Scholar

[66] Shi X, Jia Z, Geng H, Liu S, Li Y. Deep learning inversion for multivariate magnetic data. IEEE Trans Geosci Remote Sens. 2024;62:1–10. 10.1109/tgrs.2023.3337413.Suche in Google Scholar

[67] Wu B, Meng D, Zhao H. Semi-supervised learning for seismic impedance inversion using generative adversarial networks. Remote Sens. 2021;13:509. 10.3390/rs13050909.Suche in Google Scholar

[68] Li Y, Jia Z, Lu W. Self-supervised deep learning for 3D gravity inversion. IEEE Trans Geosci Remote Sens. 2022;60:1–11. 10.1109/tgrs.2022.3225449.Suche in Google Scholar

[69] Li Y, Jia Z, Lu W, Song C. Self-supervised knowledge-driven method for 3-D magnetic inversion. IEEE Geosci Remote Sens Lett. 2024;21:1–5. 10.1109/lgrs.2024.3360307.Suche in Google Scholar

[70] Qiao SB, Li HP, Qi R, Zhang YJ, Xie SM. 3D gravity inversion using cycle-consistent generative adversarial network. Appl Geophys. 2024;21(4):1–13. 10.1007/s11770-024-1096-5.Suche in Google Scholar

[71] Marano GC, Rosso MM, Aloisio A, Cirrincione G. Generative adversarial networks review in earthquake-related engineering fields. Bull Earthq Eng. 2023;22:3511–62. 10.1007/s10518-023-01645-7.Suche in Google Scholar

[72] Meng D, Wu B, Liu N, Chen W. Semi-supervised deep learning seismic impedance inversion using generative adversarial networks. In Proceedings of the IGARSS 2020 - 2020 IEEE International Geoscience and Remote Sensing Symposium; 2020. p. 1393–6. 10.1109/IGARSS39084.2020.9323119.Suche in Google Scholar

[73] Zhou H, Guo R, Li M, Yang F, Xu S, Chen M, et al. An intelligent MT data inversion method with seismic attribute enhancement. IEEE Trans Geosci Remote Sens. 2023;61:1–14. 10.1109/tgrs.2023.3245589.Suche in Google Scholar

[74] Liu W, Wang H, Xi Z, Zhang R. Smooth deep learning magnetotelluric inversion based on physics-informed Swin Transformer and multiwindow Savitzky–Golay filter. IEEE Trans Geosci Remote Sens. 2023;61:1–14. 10.1109/tgrs.2023.3304313.Suche in Google Scholar

[75] Wu G, Wei Y, Dong S, Zhang T, Yang C, Qin L, et al. Improved gravity inversion method based on deep learning with physical constraint and its application to the airborne gravity data in east Antarctica. Remote Sens. 2023;15:2–19. 10.3390/rs15204933.Suche in Google Scholar

[76] Wang Y, Wang Q, Lu W, Li H. Physics-constrained seismic impedance inversion based on deep learning. IEEE Geosci Remote Sens Lett. 2022;19:1–5. 10.1109/lgrs.2021.3072132.Suche in Google Scholar

[77] Liu B, Guo Q, Li S, Liu B, Ren Y, Pang Y, et al. Deep learning inversion of electrical resistivity data. IEEE Trans Geosci Remote Sens. 2020;58:5715–28. 10.1109/tgrs.2020.2969040.Suche in Google Scholar

[78] Ling W, Pan K, Zhang J, He D, Zhong X, Ren Z, et al. A 3-D magnetotelluric inversion method based on the joint data-driven and physics-driven deep learning technology. IEEE Trans Geosci Remote Sens. 2024;62:1–13. 10.1109/tgrs.2024.3369179.Suche in Google Scholar

[79] Su Y, Cao D, Liu S, Hou Z, Feng J. Seismic impedance inversion based on deep learning with geophysical constraints. Geoenergy Sci Eng. 2023;225:1–10. 10.1016/j.geoen.2023.211671.Suche in Google Scholar

[80] Zuo R, Xu Y. A physically constrained hybrid deep learning model to mine a geochemical data cube in support of mineral exploration. Comput Geosci. 2024;182:105490. 10.1016/j.cageo.2023.105490.Suche in Google Scholar

[81] Zhong T, Chen Y, Lu SP, Dong XT, Li Y, Yang BJ. New technology of seismic random noise suppression based on DMFF-Net. Chin J Geophys. 2022;65(11):4418–32. 10.6038/cjg2022P0613.Suche in Google Scholar

[82] Han Y, An ZG, Di QY, Wang ZX, Kang LL. Research on noise suppression of magnetotelluric signals based on recurrent neural networks. Chin J Geophys. 2023;66(10):4317–31. 10.6038/cjg2023R0123.Suche in Google Scholar

[83] Wu X, Xue G, He Y, Xue J. Removal of multisource noise in airborne electromagnetic data based on deep learning. Geophysics. 2020;85(6):B207–22. 10.1190/geo2019-0555.1.Suche in Google Scholar

[84] Caesary D, Cho A, Yu H, Joung I, Song S, Cho S, et al. Introduction to geophysical exploration data denoising using deep learning. Geophys Geophys Explor. 2020;23(3):117–30. 10.7582/GGE.2020.23.3.00117.Suche in Google Scholar

[85] Zhang L, Ren Z, Xiao X, Tang J, Li G. Identification and suppression of magnetotelluric noise via a deep residual network. Minerals. 2022;12:1–21. 10.3390/min12060766.Suche in Google Scholar

[86] Wang X, Bai X, Li G, Sun L, Ye H, Tong T. Noise attenuation for CSEM data via deep residual denoising convolutional neural network and shift-invariant sparse coding. Remote Sens. 2023;15:1–19. 10.3390/rs15184456.Suche in Google Scholar

[87] Lv M, Zhang Y, Liu S. Fast forward approximation and multitask inversion of gravity anomaly based on UNet3+. Geophys J Int. 2023;234:972–84. 10.1093/gji/ggad106.Suche in Google Scholar

[88] Liu Y, Zhang Y, Guo C, Zhang S, Kang H, Zhao Q. A multi-task learning network based on the Transformer network for airborne electromagnetic detection imaging and denoising. J Geophys Eng. 2024;21:1056–70. 10.1093/jge/gxae054.Suche in Google Scholar

[89] Zhang L, Zhang G, Fan Z, Ma J. A multitask deep learning for simultaneous denoising and inversion of 3-D gravity data. IEEE Trans Geosci Remote Sens. 2022;60:1–17. 10.1109/tgrs.2022.3220437.Suche in Google Scholar

[90] Li Q, Chen G, Wang D. Mineral prospectivity mapping using semi-supervised machine learning. Math Geosci. 2025;57:275–305. 10.1007/s11004-024-10161-6.Suche in Google Scholar

[91] Yang F, Zuo R, Xiong Y, Xu Y, Nie J, Zhang G. Dual-branch convolutional neural network and its post hoc interpretability for mapping mineral prospectivity. Math Geosci. 2024;56:1487–515. 10.1007/s11004-024-10137-6.Suche in Google Scholar

[92] Wan T, Li T, Kang X, Zhang R. The application of the supervised descent method in the inversion of 3D direct current resistivity data. Minerals. 2024;14:1095. 10.3390/min14111095.Suche in Google Scholar

[93] Zhang C, Zuo R. Incorporating geological knowledge into deep learning to enhance geochemical anomaly identification related to mineralization and interpretability. Math Geosci. 2024;56:1233–54. 10.1007/s11004-023-10133-2.Suche in Google Scholar

[94] Cui C, Thammano A. A novel classification model based on hybrid K-means and neural network for classification problems. HighTech Innov J. 2024;5(3):716–28. 10.28991/HIJ-2024-05-03-012.Suche in Google Scholar

[95] Li B, Li X. Study on the test error of silt dynamic characteristic and its influence on the peak ground acceleration. HighTech Innov J. 2023;4(1):65–74. 10.28991/HIJ-2023-04-01-05.Suche in Google Scholar

Received: 2024-10-17

Revised: 2024-12-27

Accepted: 2025-01-20

Published Online: 2025-11-15

This work is licensed under the Creative Commons Attribution 4.0 International License.

Artikel in diesem Heft

https://doi.org/10.1515/geo-2025-0765

Schlagwörter für diesen Artikel

deep learning; machine learning; mineral exploration; geophysical inversion

Creative Commons

BY 4.0