Anchor-controlled generative adversarial network for high-fidelity electromagnetic and structurally diverse metasurface design

Yunhui Zeng; Hongkun Cao; Xin Jin

doi:10.1515/nanoph-2025-0210

40% Rabatt

auf Fachbücher bei De Gruyter Brill *

Artikel Open Access

Anchor-controlled generative adversarial network for high-fidelity electromagnetic and structurally diverse metasurface design

Yunhui Zeng , Hongkun Cao und Xin Jin

Veröffentlicht/Copyright: 15. Juli 2025

Veröffentlicht von

Veröffentlichen auch Sie bei De Gruyter Brill

Manuskript einreichen Informationen für Autor*innen

Aus der Zeitschrift Nanophotonics Band 14 Heft 17

Abstract

Metasurfaces, capable of manipulating light at subwavelength scales, hold great potential for advancing optoelectronic applications. Generative models, particularly Generative Adversarial Networks (GANs), offer a promising approach for metasurface inverse design by efficiently navigating complex design spaces and capturing underlying data patterns. However, existing generative models struggle to achieve high electromagnetic fidelity and structural diversity. These challenges arise from the lack of explicit electromagnetic constraints during training, which hinders accurate structure-to-electromagnetic mapping, and the absence of mechanisms to handle one-to-many mappings dilemma, resulting in insufficient structural diversity. To address these issues, we propose the Anchor-controlled Generative Adversarial Network (AcGAN), a novel framework that improves both electromagnetic fidelity and structural diversity. To achieve high electromagnetic fidelity, AcGAN proposes the Spectral Overlap Coefficient (SOC) for precise spectral fidelity assessment and develops AnchorNet, which provides real-time physics-guided feedback on electromagnetic performance to refine the structure-to-electromagnetic mapping. To enhance structural diversity, AcGAN incorporates a cluster-guided controller that refines input processing and ensures multilevel spectral integration, guiding the generation process to explore multiple configurations. Empirical analysis shows that AcGAN reduces the Mean Squared Error (MSE) by 73 % compared to current state-of-the-art and significantly expands the design space to generate diverse metasurface architectures that meet precise spectral demands.

Keywords: metasurface design; generative model; high-fidelity; diverse design

1 Introduction

Metasurfaces constructed of two-dimensional artificial material structures at subwavelength scales have garnered significant attention for unparalleled ability to manipulate intrinsic properties of light, including spectrum [1], [2], amplitude [3], [4], phase [5], [6], polarization [7], [8], and wavefront [9], [10]. This extraordinary capability arises from the vast design flexibility afforded by the spatial and material configurations of meta-atoms, enabling functionalities far beyond those of natural materials. Leveraging this design flexibility, recent advancements in metasurface design have led to the realization of novel functionalities such as light field imaging [11], holographic display [12], perfect absorption [13], vortex beam generation [14], optical encryption [15], and optical communication [16], showcasing the potential of metasurfaces to revolutionize optical technologies.

Even though modern numerical methods allow for the calculation of the electromagnetic (EM) response of complex structures and diverse materials, the design of metasurfaces is still challenging owing to the nonintuitive and nonunique relationship between physical structures, material properties, and their EM responses [17]. Traditionally, metasurface design relies on physics-inspired methods and human expertise, including insights from analytical models, experience from previous designs, and scientific intuition. Techniques such as resonant phase control [18], propagation phase control [19], and geometric phase control [20], used independently or collectively [7], [21], are pivotal for precise phase response tailoring. However, these methods constrain the design space, limiting innovation primarily to meta-atom configurations, which highlights a related shortcoming: the fundamental theory underpinning metasurfaces is not yet well-established [22]. As design complexity increases, the traditional expert-knowledge–based paradigm becomes less effective [23]. Furthermore, the widely used trial-and-error method, combined with extensive scanning, is constrained by its limited optimization space and the time-consuming process of solving Maxwell’s equations [24].

Deep learning (DL), a subset of artificial intelligence (AI), has emerged as a transformative tool for metasurface design, effectively addressing the challenges posed by traditional methods [25], [26], [27], [28]. By mapping the complex relationships between metasurface parameters and their EM responses, DL facilitates direct design processes while significantly reducing reliance on computationally expensive simulations [29], [30], [31], [32]. Among various DL-driven approaches, Generative Adversarial Networks (GANs) [33], [34], [35], [36] stand out due to their capacity to learn intricate data distributions and generate diverse metasurface structures. This capability not only alleviates the limitations inherent in expert-knowledge–based paradigms – such as their restricted design space and dependence on trial-and-error methodologies – but also paves the way for enhanced design flexibility and innovation in metasurface design. However, GANs often generate outputs that resemble training data without precise control over specific characteristics. To address this, Conditional Generative Adversarial Networks (CGANs) [37], [38], [39], [40] address this limitation by introducing conditional inputs, enabling the generation of designs that can align more closely with predefined EM characteristics [41]. Unlike database retrieval methods that are limited to selecting existing structures, GANs learn the underlying data distribution and sample from a continuous latent space, enabling them to generate novel combinations of metasurface parameters that may not explicitly exist in the original dataset [42], [43]. Nonetheless, GAN-based methods still have two critical challenges to address: i) Limited Electromagnetic Fidelity: GAN-based methods typically focus on generating visually accurate structures but often lack explicit constraints to ensure high EM fidelity. This deficiency stems from the absence of direct feedback on EM performance during training, making it difficult for models to learn the complex mapping between metasurfaces and their EM responses. As a result, generated designs may align with the visual characteristics of the dataset but fail to meet the precise EM requirements. ii) Limited Structural Diversity: Metasurface design involves a one-to-many mapping dilemma, where multiple structures can produce the same EM response. However, GAN-based methods often generate solutions that resemble the most frequently observed configurations in the training dataset. This limitation arises from the lack of mechanisms that facilitate the exploration of diverse configurations capable of achieving the same EM targets, thus limiting the potential diversity of the generated metasurfaces. The resulting lack of structural diversity critically impairs the adaptability of designs to a range of application requirements and manufacturing constraints. This deficiency may obstruct the identification of optimal structures that could enhance performance or address specific functional demands, ultimately undermining the robustness and applicability of metasurfaces.

In this study, we focus on the complex task of designing free-form metasurface filters using AI to control and enhance spectral absorption, as demonstrated in Figure 1. To navigate the complex inverse design problem that balances both material and structural properties, we utilize an encoding strategy where key metasurface parameters – including refractive indices, plasma frequencies, and resonator geometries – are mapped into discrete “RGB” channels of color images, capturing a broad design space. To achieve high EM fidelity, our proposed Anchor-controlled Generative Adversarial Network (AcGAN) proposes the Spectral Overlap Coefficient (SOC), a novel metric developed to evaluate the alignment between the generated and target spectral responses, thereby ensuring precise control over the spectral characteristics of the metasurfaces. Furthermore, we develop AnchorNet, a predictive model embedded in the generative framework provides real-time feedback on EM performance during training. This feedback mechanism significantly improves the model’s ability to optimize the complex structure-to-EM mapping. For enhancing structural diversity, AcGAN proposes a cluster-guided controller that promotes the exploration of multiple valid configurations for any given spectral target, effectively addressing the one-to-many mapping dilemma inherent in metasurface design. Combined with our dynamic loss function, this approach shifts the focus from initial data-driven learning to a more balanced optimization of both spectral fidelity and structural diversity. These collective advancements empower AcGAN to not only bridge the gap between visual resemblance and functional EM performance but also establish a robust framework for designing metasurfaces that meet stringent requirements for high EM fidelity and structural diversity in advanced optoelectronic applications. Empirical analysis demonstrates that AcGAN significantly reduces the Mean Squared Error (MSE) by 73 % compared to current state-of-the-art GAN methods and markedly expands the design space to generate diverse metasurface architectures that meet precise spectral demands.

Figure 1:

AI-enhanced free-form absorptive metasurface inverse design schematic, illustrating the application of AI to optimize material (indicated by color variations), thickness (represented by height differences), and structural configuration (depicted through column arrangement) to precisely control spectral absorption profiles.

2 Methods

In this study, we address the complex task of designing free-form metasurface by proposing an advanced AI-driven framework named AcGAN, aiming to enhance both EM fidelity and structural diversity in metasurface designs. Figure 2 outlines the AcGAN architecture, which includes four essential components: controller, generator, discriminator, and AnchorNet, each uniquely contributing to enhance EM fidelity and structural diversity of the generated metasurfaces. The process starts with the pretrained AnchorNet predicting spectral properties, laying the groundwork for adversarial training. Initially, the discriminator is calibrated using precomputed control vectors that replace raw spectral data with structured inputs, streamlining the evaluation process. These inputs allow the discriminator to accurately assess the authenticity and spectral fidelity of the designs, ensuring they align with predefined criteria. Training then shifts focus to the generator, which is optimized through a specialized adversarial loss function to enhance its ability to produce structurally diverse and realistic metasurface designs. The training is iterative, with the generator and discriminator being refined alternately to ensure the designs meet the targeted spectral characteristics effectively. Detailed pseudocode is provided in Supporting information S1.

Figure 2:

The architecture of AcGAN for metasurface design. The controller manages data clustering and processing, enhancing the structural diversity of designs by enabling the generator to explore various configurations. The generator creates metasurface designs based on these organized data inputs. The discriminator assesses the designs for authenticity and spectral fidelity, ensuring they meet predefined performance standards. AnchorNet guides both the generator and discriminator, providing real-time feedback to improve electromagnetic fidelity.

Building on the AcGAN framework, our approach further explores the engineering of metasurface parameters within a defined design space M ∈ R d to align the spectral response S ∈ R m of the metasurface with specified target spectra. Employing EM simulation tools such as Lumerical FDTD for forward mapping F : M → S, we obtain reliable predictions of spectral responses for given metasurface. The crux of the inverse design problem is to determine an optimal design parameters m* that minimizes the discrepancy between the spectrum s _g of generated metasurface and the desired target spectrum s _t, represented by the optimization problem:

(1) m * = arg min m ∈ M Loss s t , s g

where s = F m , and Loss quantifies the distance between s _t and s _g. To ensure high electromagnetic fidelity in metasurface designs, we propose a novel spectral similarity metric named Spectral Overlap Coefficient (SOC), defined as follows:

(2) S O C = 1 − ∑ min s t , s g ∑ max s t , s g

where min s t , s g and max s t , s g are computed element-wise across the spectral vectors s _t and s _g. This formula quantitatively measures the extent of spectral overlap, providing a direct assessment of similarity that is especially useful for complex spectral features such as resonance peaks and specific absorption bands. An SOC value nearing zero signifies high similarity, which offers a direct and adaptable measure for spectral congruence, making it invaluable for evaluating and optimizing metasurface designs across diverse spectroscopic applications. SOC directly quantifies the extent of spectral overlap, offering a more granular and accurate measure of spectral congruence. This is particularly advantageous as it ensures a comprehensive alignment of all spectral features, critically assessing the match between peaks and troughs within the spectra. The adoption of SOC transforms our ability to design metasurfaces with high precision, aligning closely with specified EM requirements and surpassing the limitations of conventional design methodologies, as detailed in Supporting information S2. Additionally, the framework seeks to maximize the differences both between M _t and M _g, as well as among multiple M _g configurations, thereby promoting greater diversity in the metasurface designs.

In our inverse design framework, two types of metasurfaces are scrutinized, metal–insulator–metal (MIM) constructs characterized by broad Lorentzian absorption spectra arising from plasmonic resonances at the metal–dielectric interface [44]. These spectra are pivotal for applications demanding robust thermal emissivity and efficient photothermal energy conversion. Hybrid dielectric metasurfaces, wherein subwavelength cavity resonances yield Fano-resonant profiles, offering sharply defined spectral features optimal for discerning optical sensor technologies [45]. This bifurcation presents a complex challenge, necessitating a modeling approach capable of accommodating the significant spectral divergences inherent to each metasurface type. We encode two types of metasurfaces into standardized RGB images with dimensions of C × H × W, where C is the number of color channels, and H and W are the image height and width, respectively. For MIM structures, the red channel encodes the plasma frequency (ω _p) of the metallic resonators, the blue channel records the dielectric layer thickness (d, in nanometers), and the green channel is set to zero. For hybrid dielectric structures, the green channel encodes the dielectric refractive index (n), the blue channel again records the dielectric thickness (d), and the red channel is set to zero. In addition, the spectral response ranging from 4 to 12 μm is uniformly discretized into N points. This encoding strategy standardizes diverse metasurface configurations into a unified RGB representation, facilitating the training of AcGAN and enabling systematic learning across both metasurface types, as illustrated in Figure 3.

$Figure 3: Schematic diagram of coding and decoding process of AcGAN. (a) and (e), the MIM structure (3.2 × 3.2 μm2 unit cell) includes a 0.2 μm metal layer, a variable-height Al2O3 dielectric layer, and a freeform resonator of 0.1 μm height, while the hybrid structure (7.5 × 7.5 μm2 unit cell) features a 0.2 μm metal layer with a dielectric freeform resonator of unspecified height. (b) and (f) show the 3D-rendered metasurfaces: the MIM resonators are composed of gold, silver, or aluminum with plasma frequencies of 1.91 PHz, 2.32 PHz, or 3.57 PHz, and dielectric layers of 100, 200, or 300 nm thickness; the hybrid resonators are made of ZnSe, Si, or Ge with refractive indices of 2.41, 3.42, or 4.01, and dielectric layers of 500, 750, or 950 nm thickness. In (c) and (g), the structures are encoded into 64 × 64 × 3 RGB images. For MIM structures, the red channel encodes both the metal material properties and resonator geometry, with the green channel set to zero. For hybrid dielectric structures, the green channel encodes both the dielectric material properties and geometry, with the red channel set to zero. In both cases, the blue channel encodes the thickness of the dielectric layer. (d) and (h) present the final decoded images representing the letters “A” (MIM) and “I” (Hybrid).$

Figure 3:

Schematic diagram of coding and decoding process of AcGAN. (a) and (e), the MIM structure (3.2 × 3.2 μm² unit cell) includes a 0.2 μm metal layer, a variable-height Al₂O₃ dielectric layer, and a freeform resonator of 0.1 μm height, while the hybrid structure (7.5 × 7.5 μm² unit cell) features a 0.2 μm metal layer with a dielectric freeform resonator of unspecified height. (b) and (f) show the 3D-rendered metasurfaces: the MIM resonators are composed of gold, silver, or aluminum with plasma frequencies of 1.91 PHz, 2.32 PHz, or 3.57 PHz, and dielectric layers of 100, 200, or 300 nm thickness; the hybrid resonators are made of ZnSe, Si, or Ge with refractive indices of 2.41, 3.42, or 4.01, and dielectric layers of 500, 750, or 950 nm thickness. In (c) and (g), the structures are encoded into 64 × 64 × 3 RGB images. For MIM structures, the red channel encodes both the metal material properties and resonator geometry, with the green channel set to zero. For hybrid dielectric structures, the green channel encodes both the dielectric material properties and geometry, with the red channel set to zero. In both cases, the blue channel encodes the thickness of the dielectric layer. (d) and (h) present the final decoded images representing the letters “A” (MIM) and “I” (Hybrid).

Our framework employs an advanced controller mechanism to address the lack of structural diversity, which acts as an intelligent hub within our architecture, orchestrating the flow and preprocessing of spectral data through sophisticated clustering strategies. Specifically, we utilize the K-means clustering algorithm to segment the training spectral dataset S = s 1 , s 2 , … , s n where each s _i represents a unique spectral data point in R m . The algorithm partitions S into k distinct clusters, optimizing the following objective:

(3) min C ∑ i = 1 k ∑ s ∈ C i ‖ s − c i ‖ 2

where C = C 1 , C 2 , … , C k represents the set of clusters, and c _i is the centroid or cluster center of C _i, embodying the average spectral profile of the cluster. These centroids are then used as reference points to compute the SOC for given spectral input, resulting in a k-dimensional vector v = SOC s , c 1 , SOC s , c 2 , … , SOC s , c k . This vector quantitatively describes the input’s alignment with preidentified spectral categories, enriching the input representation with both detailed and contextual spectral information. The resultant vector v is then concatenated with the original spectral data s to form a comprehensive control vector u = s ; v , which is then input into the generator and discriminator. This enriched input empowers the generator to explore a wider design space, promoting the creation of diverse and functionally tailored metasurfaces. By integrating both detailed and aggregated spectral data, the controller ensures designs not only vary more broadly but also align closely with desired spectra, addressing the challenge of structural diversity problems.

Armed with the comprehensive control vector u – rich in both granularity and contextual insight – the generator is poised to harness these data for metasurface design. As depicted in Figure 4(a), the generator, equipped with a control vector u and a latent vector z ∼ N 0,1 (800-dimensional), uses deconvolutional layers to map enriched spectral inputs into spatial structures. The latent vector introduces stochasticity, enabling the model to generate diverse structural solutions for a given spectral target. These layers, coupled with a noise vector for randomness, facilitate a broad exploration of design spaces, which is crucial for achieving one-to-many mappings in metasurface designs. The process is refined through up-sampling, which ensures the preservation of essential spectral features, allowing the generator to produce diverse and functionally effective metasurfaces. The generator’s effectiveness is quantified by a loss function L _G, comprising three pivotal components:

(4) L G = γ L adv G + α L spectral + β L structural

Figure 4:

Detailed architectural overview of AcGAN components. (a) Generator: transforms control vectors u into metasurface structures M _g using deconvolutions; (b) Discriminator: assesses metasurface designs by applying convolutions to predict authenticity scores ranging from 0 to 1; (c) AnchorNet: predicts spectral responses s _g from metasurface by leveraging bottleneck layers.

The α and β represent the weights for the spectral and structural losses, respectively, and γ denotes the weight for the adversarial losses. The adversarial loss component is defined as: L adv G = − E z ∼ p Z z , u ∼ p U u log D G z , u , u where E z ∼ p Z z , u ∼ p U u denotes the expectation over the distributions of latent vectors z and control vectors u. Here, G z , u is the generator’s output for a given latent vector z and control vectors u, where D G z , u represents the discriminator’s assessment of how real or fake the generated metasurface. L spectral = SOC s g , s t measures the spectral similarity, ensuring the generated metasurface aligns with the target spectrum. The structural loss L _structural is calculated using the Structural Similarity Index (SSIM) [46] between the generated and referenced samples: L structural = SSIM M g , M r . This metric helps regularize the similarity between generated metasurfaces and reference structures, encouraging that different regions of the latent space z correspond to distinct metasurface designs. Parameters α and β strategically balance the spectral and structural loss components within the loss function, tailoring the generator’s output to meet specific operational demands while promoting structural diversity. This strategy guarantees that the generated designs not only effectively deceive the discriminator, demonstrating their realistic characteristics, but also accurately meet the targeted EM specifications and display significant structural diversity, thereby increasing their practical utility across various applications.

The discriminator, as depicted in Figure 4(b) is integral to the AcGAN framework, tasked primarily with validating the authenticity of the metasurface designs generated by the generator. It employs a sophisticated convolutional network to critically assess if the generated designs accurately reflect real metasurface EM properties. The discriminator assesses the quality of the synthesized designs using both learned features and heuristics from the training phase. It acts as the critical feedback component, thus guiding the generative process toward the production of metasurfaces with enhanced practical applicability. The discriminator’s functionality is meticulously evaluated through a composite loss function L _D:

(5) L D = γ L adv D + L mismatch + α L spectral

where the adversarial loss is expressed as: L adv D = E M r , u ∼ p data M r , u log D M r , u − E z ∼ p Z z , u ∼ p U u log 1 − D G z , u , u . This component compares the discriminator’s predictions for referenced metasurface data M r , u , sampled from the dataset, and generated data G z , u from the generator. The first term, E M r , u ∼ p data M r , u log D M r , u , encourages the discriminator to correctly identify real metasurfaces, maximizing the log probability of recognizing real designs. The second term, E z ∼ p Z z , u ∼ p U u log 1 − D G z , u , u , penalizes the discriminator for falsely classifying generated metasurfaces as real, pushing it to distinguish between authentic and generated samples effectively. The mismatch loss L _mismatch enhances the discriminator’s ability to detect inconsistencies between the referenced and generated metasurface, which is computed as:

(6) L mismatch = − E z ∼ p Z z , u ′ ∼ p U u ′ log 1 − D G z , u ′ , u

where u′ represents a control vector that mismatches the intended input conditions for z, encouraging the discriminator to penalize incorrect spectral-structure pairings. This loss is inspired by widely adopted practices in conditional GANs [47], [48], [49], where mismatch regularization improves conditional consistency. In our case, it encourages the generated metasurfaces to precisely match the given spectral targets, which is essential for reliable metasurface inverse design. The L _spectral here refers to the same spectral loss term defined in Eq. (4).

Building upon the intricate interplay between the generator and discriminator within our AcGAN framework, AnchorNet emerges as a pivotal advancement, depicted in Figure 4(c). It incorporates a tailored Bottleneck ResNet [50] architecture, where residual shortcuts deliver structural features from earlier layers directly to deeper ones, helping preserve geometric detail and improve the accuracy of spectral-response prediction. AnchorNet is finely tuned to minimize SOC s ̂ , s , where s ̂ is the predicted spectrum, and s is the ground truth spectrum obtained from EM simulations. An early stopping mechanism is integrated into the training protocol to halt the learning process after a predetermined number of epochs without improvement in training loss, ensuring computational efficiency and preventing overfitting. Integral to the AcGAN architecture, AnchorNet serves as a fast and reliable surrogate model specifically tailored to predict spectrally resolved responses of complex MIM and hybrid metasurfaces [51], [52], enabling efficient evaluation during training. Specifically developed to assess the EM performance of structures generated by AcGAN, AnchorNet surpasses the conventional visual assessment criteria employed in CGANs. AnchorNet is pretrained and its parameters are frozen during GAN training, following a decoupled scheme to enhance training stability and efficiency [52]. It focuses on aligning EM responses, enabling the optimization of metasurfaces for high EM fidelity, independent of visual resemblance to referenced designs.

The AcGAN framework integrates a carefully designed loss function that is crucial for balancing spectral precision and structural diversity in metasurface design. This loss function governs the interaction between the generator and discriminator to ensure that each design meets stringent spectral standards while exhibiting significant structural variation:

(7) L = min G m a x D L D , G = γ L adv + L mismatch + α L spectral + β L structural = γ E M r , u ∼ p data M r , u , u ∼ p U u log ⁡ D M r , u + E Z ∼ p Z z , u ∼ p U u log 1 − D G z , u , u − E z ∼ p Z z , u ′ ∼ p U u ′ log 1 − D G z , u ′ , u + α SOC s g , s t + β SSIM M g , M r

where adversarial loss L _adv enables the generator to improve in deceiving the discriminator by making generated metasurfaces more realistic, the mismatch loss L _mismatch ensures that the generated metasurface aligns with the control vectors, further enhancing design accuracy, the spectral loss ensures that the generated metasurfaces’ spectral responses match the target spectra, and the structural loss encourages structural diversity by comparing the generated metasurfaces to referenced designs. This formulation ensures that each generated design meets stringent spectral criteria while achieving structural diversity. AGAN generates key parameters such as ω _p, d and n, facilitating the exploration of new materials and structures.

3 Results

To validate the innovative contributions of our AcGAN framework, we conducted a series of experiments focusing on key performance metrics such as spectral fidelity, structural diversity, and computational efficiency. Specifically, we tested AcGAN’s ability to generate metasurface designs that meet precise spectral response criteria while overcoming the limitations of existing methods in structural diversity. By employing the novel SOC alongside traditional MSE, we quantitatively assessed the accuracy of the generated designs. Additionally, we explored AcGAN’s capacity to generate diverse metasurface configurations for the same target spectrum, leveraging its one-time training advantage for rapid and efficient design iterations. All the experiments were conducted on a computational setup of Intel Xeon E5-2680 CPU (2.50 GHz) and an NVIDIA GeForce RTX 3090 GPU with 24 GB of VRAM, operating Python 3.9.12 on the Ubuntu Linux platform.

We firstly evaluated AnchorNet’s performance within the AcGAN framework using a dataset of 18,768 metasurface structures, with one-third categorized as hybrid and two-thirds as MIM. To ensure thorough testing, 90 % of the dataset was allocated for training, and the remaining 10 % served as the test set; the detailed hyperparameter setting of AnchorNet is shown in Supporting information S3. As illustrated in Figure 5(a), SOC losses decreased progressively over 342 epochs, with training ceasing upon reaching an early stopping threshold of 30 consecutive epochs without validation loss improvement. The training consumed 136 min, with final SOC losses for training and testing converging to 0.0405 and 0.0807, respectively. After training, AnchorNet could predict the spectrum of metasurface in an average time of 3.6 × 10⁻⁴ s, reducing computation time to approximately 1/1,600,000th of the 560.48 s required by FDTD simulations, accelerating the evaluation process for metasurface inverse design, enabling faster iterations and enhancements.

$Figure 5: Performance evaluation and spectral predictions of AnchorNet. (a) Training and validation SOC loss curves with early stopping implementation. (b)–(c) SOC distributions: SOC distributions for hybrid (b) and MIM (c) metasurfaces indicating predictive accuracy. (d)–(i) Spectral comparisons for hybrid and MIM structures: displays best, mean, and worst result for hybrid (d)–(f) and MIM (g)–(i) metasurfaces, with insets showing respective metasurface. Red and green lines represent actual and predicted spectra of AnchorNet, respectively. Here, ω p denotes the plasma frequency of the metal (for MIM structures), n is the refractive index of the dielectric resonator (for hybrid structures), and d represents the dielectric layer thickness (for both structures).$

Figure 5:

Performance evaluation and spectral predictions of AnchorNet. (a) Training and validation SOC loss curves with early stopping implementation. (b)–(c) SOC distributions: SOC distributions for hybrid (b) and MIM (c) metasurfaces indicating predictive accuracy. (d)–(i) Spectral comparisons for hybrid and MIM structures: displays best, mean, and worst result for hybrid (d)–(f) and MIM (g)–(i) metasurfaces, with insets showing respective metasurface. Red and green lines represent actual and predicted spectra of AnchorNet, respectively. Here, ω _p denotes the plasma frequency of the metal (for MIM structures), n is the refractive index of the dielectric resonator (for hybrid structures), and d represents the dielectric layer thickness (for both structures).

The assessment of AnchorNet across the dataset revealed differences between the hybrid and MIM categories, as detailed in Figure 5(b) to (i). The SOC distribution in Figure 5(b) and (c) demonstrates that MIM structures are predominantly concentrated in lower SOC intervals, indicating higher prediction accuracy compared to Hybrid structures. Quantitatively, the average SOC for MIM structures is 0.0622, which is substantially lower than that of Hybrid structures (0.1346). This result reflects that the network yields more accurate spectral predictions for MIM structures, whose responses are typically characterized by smoother Lorentzian profiles. Figure 5(d)–(i) explicitly demonstrate AnchorNet’s spectral prediction capabilities by presenting instances with the minimum, mean (close to the average SOC), and maximum SOC metrics. Specifically, Figure 5(d)–(f) for hybrid structures and Figure 5(g)–(i) for MIM structures illustrate the accuracy of spectral predictions through comparative plots that juxtapose predicted spectra with ground truth spectra. The improved accuracy for MIM structures is due to their simpler Lorentzian profiles, which are smoother and more predictable than the complex, asymmetric Fano resonances of hybrid structures. These Fano resonances, with sharp variations from quantum interference, present significant predictive challenges. Additionally, the spectral comparison in Figure 5(e) and (h) highlights a key discrepancy: despite a higher MSE in Figure 5(h), the spectral alignment closely matches the ground truth, particularly around peak regions. In contrast, Figure 5(e) shows significant deviations at peak intensities and across broader spectral regions, highlighting MSE’s limitations as a reliable metric. SOC, by providing a more accurate measure of spectral similarity, proves superior to MSE in evaluating spectral fidelity. Inspired by Metric Learning [53], we evaluated Euclidean-based metrics (e.g., MSE and MAE) and SOC for spectral data dimensionality reduction. As detailed in Supporting information S4, our analysis using Principal Component Analysis (PCA) [54], Locally Linear Embedding (LLE) [55], t-Distributed Stochastic Neighbor Embedding (t-SNE) [56], and Autoencoders (AE) [57] revealed Euclidean-based metrics’ limitations in distinguishing classes. In contrast, SOC improved class separability, as shown in Figure S2 in Supporting information S4, demonstrating its effectiveness in preserving intrinsic spectral properties.

To evaluate the performance of our AcGAN method against existing metasurface inverse design techniques, Table 1 provides a comparison across key metrics including training and generation efficiency, MSE, and SOC. MSE is computed as the average squared difference between the generated spectral response s _g and the target spectrum s _t across all discretized spectral points:

(8) M S E = 1 N ∑ i = 1 N s g , i − s t , i 2

where N is the number of spectral points within the range of 4–12 μm. SOC is defined as:

(9) S O C = 1 − ∑ i = 1 N min s g , i , s t , i ∑ i = 1 N max s g , i , s t , i

Table 1:

Comparison of metasurface design methods based on training and design efficiency, MSE, and SOC: “Training time” indicates the time required for model training, while “Generation time” measures the time needed to design a metasurface that matches the target spectrum. MSE and SOC assess the spectral fidelity of the designed metasurface relative to the desired spectrum. “One-time training” indicates whether the model requires retraining for new spectral targets. All the experiments were conducted on a computational setup of Intel Xeon E5-2680 CPU (2.50 GHz) and an NVIDIA GeForce RTX 3090 GPU with 24 GB of VRAM, operating Python 3.9.12 on the Ubuntu Linux platform, with all methods benchmarked under identical hardware and software settings. Bold values denote the best performance for each metric.

Method	Training time	Generation time	MSE	SOC	One-time training
Physics-inspired	–	Days or months	–	–	No
GA [58]	–	3.89 h	2.120 × 10⁻²	0.564	No
PSO [59]	–	2.23 h	2.010 × 10⁻²	0.557	No
DE [60]	–	4.45 h	2.070 × 10⁻²	0.572	No
DNN [61]	13.4 h	6.1 × 10⁻⁴s	1.500 × 10⁻²	0.471	Yes
VAE [62]	16.8 h	6.4 × 10⁻⁴s	1.450 × 10⁻²	0.454	Yes
CGAN [35]	9.62 h	4.2 × 10⁻⁴s	4.151 × 10⁻³	0.274	Yes
AcGAN	4.16 h	4.2 × 10 ⁻⁴ s	1.120 × 10 ⁻³	0.139	Yes

Detailed hyperparameter settings and analysis for AcGAN are provided in Supporting information S5. The hyperparameter settings for comparative methods were adopted from their original papers, and all methods were tested on the same dataset. The results are based on a random selection of 100 spectral data points from the test set. For each data point, corresponding metasurface structures were generated and then simulated using Lumerical FDTD to obtain what can be considered the ground truth spectrum of the designed structures. To address the challenge of limited structural diversity and assess the robustness of designs generated by AcGAN, 256 distinct latent vectors were employed for each target spectrum to explore variations in design accuracy and consistency. Thanks to CUDA’s parallel computing capabilities, generating 256 metasurfaces takes nearly the same time as generating a single one. The design with the lowest SOC relative to the target spectrum is selected as the final design. Traditional physics-inspired methods, in contrast, demonstrate significant inefficiencies as they often require days to months to design a single metasurface and do not support one-time training. Heuristic algorithms like GA [58], PSO [59], and DE [60] are faster but achieve only moderate SOC and MSE, indicating suboptimal spectral fidelity. In contrast, AI-based techniques such as DNN [61], VAE [62], and CGAN [35] significantly reduce prediction latency to milliseconds while improving SOC and MSE. Among these, our AcGAN method excels by recording the fastest generation time of only 4.2 × 10⁻⁴ s, and achieving the lowest MSE (1.120 × 10⁻³) and SOC (0.139). Additionally, the one-time training feature of machine learning methods presents considerable advantages over the iterative, resource-intensive nature of traditional and heuristic approaches.

Figure 6 presents 9 representative cases from the test dataset, each showcasing the design generated by AcGAN with the lowest SOC relative to the target spectrum, highlighting the model’s ability to achieve high spectral fidelity. The spectra from FDTD simulations of metasurfaces designed by AcGAN closely align with the target spectra, underscoring AcGAN’s unprecedented efficiency and accuracy in generating metasurfaces precisely tailored to specific spectral requirements, a significant advancement over traditional methods. Moreover, the close match between the spectra from referenced metasurface structures and the target spectra predicted by AnchorNet emphasizes AnchorNet’s exceptional predictive capability, which is critical for ensuring high electromagnetic fidelity in the design process. Notably, AcGAN accurately designed MIM structures for Lorentzian spectra and hybrid structures for Fano spectra, successfully distinguishing between different physical mechanisms during training without confusion. This demonstrates AcGAN’s innovative capability to effectively differentiate between distinct metasurface types and their corresponding absorption spectra, showcasing its proficiency not only in achieving high design accuracy but also in understanding and applying different physical mechanisms – a significant advancement in metasurface design. AcGAN demonstrates a notable capacity to design metasurfaces that closely match target spectra while significantly diverging in structural dimensions, material properties, and dielectric thickness, as shown in Figures 6 and 7. Analysis of the three metasurfaces with the lowest SOC reveals that, although their absorption spectra closely match the target, their physical configurations vary significantly. For instance, material properties deviate by an average of 12.9 %, and dielectric thicknesses vary by 20.0 %, highlighting the model’s ability to achieve diverse designs beyond traditional visual constraints. In particular, Figure 7(b) highlights that design1’s thickness is reduced by 26.7 % compared to the referenced metasurface, simplifying the manufacturing process. Despite these variations, the average SSIM between the generated and referenced metasurfaces is 0.727, indicating substantial structural differences while maintaining functional integrity, as shown in Figure 7(b) and (c). While variations in the latent vector z introduce observable diversity in the generated structures, we did not observe a consistent or interpretable mapping between specific regions of the latent space and distinct metasurface features. This may suggest that z primarily serves to introduce structural variability rather than encoding explicit design semantics, which is consistent with previous observations in generative design literature [63], [64]. Due to the stochastic nature of deep learning-based generative methods, slight leakage between mutually exclusive channels may occasionally occur, for example in Figure 7(e). Although the leakage ratio is low (∼0.23 % per structure), the automatic removal of these pixels during decoding into metasurface structures alters the final design. While this does not significantly affect practical implementation, it represents a methodological limitation. To further mitigate such leakage, stronger constraints could be considered in future iterations of the model.

Figure 6:

Randomly selected spectra from the test dataset served as input targets for AcGAN. This figure presents a comparative analysis between the target spectra (solid green lines) and the corresponding spectra of metasurfaces designed by AcGAN after FDTD simulation (solid red lines). Dashed blue and yellow lines represent the spectra predicted by AnchorNet for referenced metasurface and AcGAN-designed metasurfaces, respectively. To the right of each plot, green-framed images depict reference metasurface structures with material and thickness parameters, while red-framed images show AcGAN-designed metasurfaces, annotated with material types and thicknesses (t in nanometers), and plasma frequency (ωP) values are given in PHz.

Figure 7:

Demonstrating AcGAN’s capability for design diversity. This figure compares target spectra (solid green lines) with three distinct spectra generated by AcGAN (solid blue, yellow, and purple lines), highlighting AcGAN’s ability to create multiple diverse metasurface designs from a single target specification. Adjacent to each spectral plot, images within green frames depict referenced metasurface structures with detailed material and thickness parameters. Corresponding images in blue, yellow, and purple frames showcase the various designs created by AcGAN, each annotated with specific material types, thicknesses, and plasma frequency values.

To further illustrate AcGAN’s ability to enhance design diversity, Figure 8 presents the near-field electric distributions in the XY plane for both MIM and hybrid metasurface structures. This visualization demonstrates AcGAN’s ability to not only match the target spectral responses but also innovate in the spatial arrangement of meta-atoms across various planes. The corresponding electric field variations in the XZ and YZ planes, which exhibit similarly diverse distributions, are discussed in Supporting information S6. The design versatility enabled by AcGAN allows a single imaging system to perform multiple functions, such as standard and polarimetric imaging, without requiring changes to the optical components. This adaptability significantly enhances the visualization of cellular or tissue structures across different depths and orientations. By generating metasurfaces with tailored EM functionalities, AcGAN expands the operational flexibility and efficiency of imaging systems, paving the way for broader applications.

Figure 8:

Near-field electric responses in the XY plane for MIM and hybrid metasurface: (a) MIM metasurface: showcases the near-field electric responses at various wavelengths (4 μm–12 μm) for MIM metasurface in Figure 7(f). The depth of the colors i reflects the magnitude of the near-field electric field strength. (b) Hybrid metasurface: presents the near-field electric responses at wavelengths from 4 μm to 12 μm for hybrid metasurface in Figure 7(i).

To assess AcGAN’s ability to handle arbitrarily defined spectral challenges, we explored four spectral types: Fano, Lorentzian, Gate, and Gaussian. These spectra were generated according to the details in Supporting information S7, ensuring no same data in the training dataset. For each type, we generated two spectra and simulated 256 metasurface structures per spectrum. The results are shown in Supporting information S8, each panel in Figure S5 contrasts the simulated spectra with the target, highlighting discrepancies in shaded areas and quantifying them with SOC and MSE values. The results show that while AcGAN closely approximates the true spectral characteristics for Fano and Lorentzian resonances Figure S5 a)-d), it exhibits deviations in Gate and Gaussian profiles, particularly at the spectral tails Figure S5 e)-h). This variance suggests AcGAN’s robust performance on spectra present in the database but highlights the need for better model generalization to accommodate theoretically defined but unrepresented spectral types in the training set.

4 Discussion

To further analyze the performance of AcGAN for a variety of spectral designs and their relation to the training dataset, we propose the “weighted distance” metric, which is defined as D w s = ∑ i = 1 k n i N ⋅ SOC s , c i , this metric quantifies the deviation of a spectrum s from the cluster centroids c 1 , c 2 , … , c k based on the training data. Here, n _i is the number of spectra in the i-th cluster, and N is the total number of spectra across all clusters. For empirical analysis, we generated 1,000 spectra for each of the four spectral types and calculated their respective D _w. Each spectrum was categorized into intervals of 0.05 in weighted distance, with 10 spectra sampled per interval to ensure uniform coverage. The results are shown in Supporting Information S9, Figure S6 illustrates the correlation between SOC and weighted distance, with ellipses highlighting the general distribution of SOC against weighted distance for each type. Notably, spectra with lower weighted distances typically achieved lower SOC, indicating closer approximation to the target spectral characteristics. This trend was especially pronounced for Lorentzian and Fano spectra. Conversely, the Gate and Gaussian spectra demonstrated lower SOC, underscoring potential challenges in generating these spectra types due to their underrepresentation in the training data. The observed variance in AcGAN’s performance across spectral types highlights the critical importance of training dataset diversity for model generalization. Expanding the dataset to include a broader spectrum of metasurface configurations could enhance the model’s capability to accurately generate designs for a wider array of spectral responses. Future research will focus on augmenting the dataset and refining model algorithms to improve performance across less represented spectra, thereby broadening the practical applications of the AcGAN framework in metasurface design.

To thoroughly evaluate the AcGAN’s robustness and adaptability to varying configurations, we executed a series of ablation studies, the detailed results of which are presented in Supporting information S10. Our ablation analyses underscored the pivotal roles of the cluster controller and AnchorNet’s integration within both the generator and discriminator. The presence of the cluster controller led to a notable decrease in MSE and SOC, indicating higher fidelity EM design. Similarly, enabling AnchorNet in both model components significantly enhanced the fidelity of generated metasurface designs. Adjustments in the adversarial training dynamics, particularly variations in the k-value, defined as the number of generator updates per discriminator update, revealed that a balanced approach is crucial for stable training and optimal model performance. The studies highlighted that a k-value of 2 was optimal, significantly reducing MSE and increasing SOC compared to other configurations. Moreover, the impact of initial spectral loss weighting, represented by the parameter α, was profound. Surprisingly, a lower weight (α = 0.1) unexpectedly yielded better performance, suggesting that an excessive initial emphasis on spectral fidelity might hinder the model’s ability to generalize across a broader design space. This finding points to the need for a balanced loss function that adequately emphasizes both spectral fidelity and adversarial robustness. Investigations into the effects of latent space dimensions and batch sizes further refined our understanding of model behavior. Optimal latent dimensions and smaller batch sizes tended to improve the model’s precision and stability, indicating that finer granularity in the generation process aids in capturing the nuances of metasurface designs. Collectively, these findings underscore the intricate interdependencies within the AcGAN architecture and highlight the need for careful calibration to maximize performance. These insights are crucial for refining the AcGAN framework to enhance its practical applicability across diverse metasurface design scenarios, thereby extending the capabilities of current computational photonic design methods.

5 Conclusions

AcGAN marks a significant advancement in the inverse design of metasurfaces, showcasing an unprecedented combination of high electromagnetic fidelity and extensive structural diversity. This innovative framework, through rigorous empirical testing, has proven to effectively minimize the MSE by 73 % compared to existing GAN approaches, demonstrating its robust capability to meet stringent spectral demands with enhanced precision. Crucially, AcGAN addresses the perennial challenges of electromagnetic fidelity and structural diversity that have impeded prior generative models. By integrating the SOC and AnchorNet, our framework not only assesses but significantly improves spectral fidelity, ensuring that each generated design adheres closely to the desired electromagnetic characteristics. This precision is vital for applications in complex optoelectronic systems, where exact spectral properties are critical for functionality. Furthermore, AcGAN innovatively incorporates a cluster-guided controller, which refines input processing and facilitates the exploration of diverse structural configurations. This feature is essential for overcoming the one-to-many mapping dilemma inherent in metasurface design, allowing for a broader range of functional possibilities within a single design process. The dynamic loss function, shifting focus from data-driven learning to optimizing spectral and structural outcomes, further underscores our method’s adaptability and efficiency.

Looking ahead, future enhancements to AcGAN will focus on several critical areas: 1. Optimizing AnchorNet: Enhancing its predictive accuracy, particularly for hybrid structures with complex Fano resonance profiles, which currently present significant challenges. 2. Enhancing encoding techniques: Expanding the range of design variables to include more intricate and functional metasurface configurations, thereby broadening the scope of metasurface applications. 3. Expanding dataset diversity: Incorporating a broader array of metasurface structures, especially free-form designs, to improve the model’s generalization capabilities and robustness. 4. Considering manufacturability and fabrication tolerances: Addressing potential challenges in manufacturing complex metasurface designs by ensuring that the generated structures are not only diverse but also feasible to produce with existing fabrication technologies. This will improve the practical applicability of AcGAN-generated designs in real-world scenarios. In addition, the proposed AcGAN framework can be naturally bridged with classical optimization strategies such as adjoint solvers or evolutionary algorithms [34], [65]. Such hybrid integration holds strong potential for enhancing spectral controllability and improving the robustness of inverse design solutions. Finally, combining AcGAN with classical mode-collapse mitigation strategies [66], [67] – such as architectural innovations [68], diversity-aware regularization [69] – may provide complementary benefits, offering a promising path to further enhance structural diversity.

AcGAN represents not only a methodological breakthrough but also a scalable and robust framework that significantly accelerates the design process in nanophotonic applications. This makes AcGAN a pivotal tool for advancing operational metasurfaces, meeting the evolving demands of optoelectronics and related industries.

Corresponding authors: Hongkun Cao, Peng Cheng Laboratory, Shenzhen 518055, China, E-mail: caohk@pcl.ac.cn; and Xin Jin, Shenzhen International Graduate School, Tsinghua University, Shenzhen 518055, China, E-mail: jin.xin@sz.tsinghua.edu.cn

Funding source: National Natural Science Foundation of China

Award Identifier / Grant number: 62131011

Funding source: Major Key Project of PCL

Award Identifier / Grant number: PCL2023A10-3

Funding source: Shenzhen Science and Technology Innovation Program

Award Identifier / Grant number: JCYJ20241202123921029

Research funding: This work was supported in part by Natural Science Foundation of China under Grant 62131011, in part by Shenzhen Science and Technology Program under Grant JCYJ20241202123921029, and in part by the Major Key Project of PCL under Grant PCL2023A10-3.
Author contribution: YZ conceived the idea and implemented the inverse design algorithm. XJ and HC oversaw the research. The manuscript was written through the contributions of all authors. All authors have accepted responsibility for the entire content of this manuscript and consented to its submission to the journal, reviewed all the results, and approved the final version of the manuscript.
Conflict of interest: Authors state no conflict of interest.
Data availability statement: The data that support the findings of this study are available from the corresponding authors upon reasonable request.
Supporting information: Including the pseudocode of the training of AcGAN, literature review on spectral similarity evaluation metrics, detailed hyperparameter setting of AnchorNet, evaluating the impact of spectral similarity evaluation metrics on spectral data dimensionality reduction, detailed hyperparameter setting of AcGAN, diverse near-field electric responses of metasurfaces with similar absorption spectra designed by AcGAN, generation of four arbitrarily defined spectra, performance of AcGAN of arbitrarily defined spectrum, correlation between weighted distance and SOC for various spectral types, and ablation study.

References

[1] J. Chen, C. Qian, J. Zhang, Y. Jia, and H. Chen, “Correlating metasurface spectra with a generation-elimination framework,” Nat. Commun., vol. 14, no. 1, p. 4872, 2023, https://doi.org/10.1038/s41467-023-40619-w.Suche in Google Scholar PubMed PubMed Central

[2] A. S. Rana, M. Zubair, A. Danner, and M. Q. Mehmood, “Revisiting tantalum based nanostructures for efficient harvesting of solar radiation in STPV systems,” Nano Energy, vol. 80, p. 105520, 2021, https://doi.org/10.1016/j.nanoen.2020.105520.Suche in Google Scholar

[3] J. Y. Dai, J. Zhao, Q. Cheng, and T. J. Cui, “Independent control of harmonic amplitudes and phases via a time-domain digital coding metasurface,” Light Sci. Appl., vol. 7, no. 1, p. 90, 2016, https://doi.org/10.1038/s41377-018-0092-z.Suche in Google Scholar PubMed PubMed Central

[4] L. Wang, et al.., “Grayscale transparent metasurface holograms,” Optica, vol. 3, no. 12, p. 1504, 2016, https://doi.org/10.1364/OPTICA.3.001504.Suche in Google Scholar

[5] X. Chen, Y. Zhang, L. Huang, and S. Zhang, “Ultrathin metasurface laser beam shaper,” Adv. Opt. Mater., vol. 2, no. 10, pp. 978–982, 2014, https://doi.org/10.1002/adom.201400186.Suche in Google Scholar

[6] M. Khorasaninejad, W. T. Chen, R. C. Devlin, J. Oh, A. Y. Zhu, and F. Capasso, “Metalenses at visible wavelengths: Diffraction-limited focusing and subwavelength resolution imaging,” Science, vol. 352, no. 6290, pp. 1190–1194, 2016, https://doi.org/10.1126/science.aaf6644.Suche in Google Scholar PubMed

[7] J. P. Balthasar Mueller, N. A. Rubin, R. C. Devlin, B. Groever, and F. Capasso, “Metasurface polarization optics: Independent phase control of arbitrary orthogonal states of polarization,” Phys. Rev. Lett., vol. 118, no. 11, p. 113901, 2017, https://doi.org/10.1103/PhysRevLett.118.113901.Suche in Google Scholar PubMed

[8] A. Arbabi, Y. Horie, M. Bagheri, and A. Faraon, “Dielectric metasurfaces for complete control of phase and polarization with subwavelength spatial resolution and high transmission,” Nat. Nanotech., vol. 10, no. 11, pp. 937–943, 2015, https://doi.org/10.1038/nnano.2015.186.Suche in Google Scholar PubMed

[9] Z. Wang, T. Li, A. Soman, D. Mao, T. Kananen, and T. Gu, “On-chip wavefront shaping with dielectric metasurface,” Nat. Commun., vol. 10, no. 1, p. 3547, 2019, https://doi.org/10.1038/s41467-019-11578-y.Suche in Google Scholar PubMed PubMed Central

[10] Z. Jin, et al.., “Phyllotaxis-inspired nanosieves with multiplexed orbital angular momentum,” eLight, vol. 1, no. 1, p. 5, 2021, https://doi.org/10.1186/s43593-021-00005-9.Suche in Google Scholar

[11] R. J. Lin, et al.., “Achromatic metalens array for full-colour light-field imaging,” Nat. Nanotechnol., vol. 14, no. 3, pp. 227–231, 2019, https://doi.org/10.1038/s41565-018-0347-0.Suche in Google Scholar PubMed

[12] G. Kim, S. Kim, H. Kim, J. Lee, T. Badloe, and J. Rho, “Metasurface-empowered spectral and spatial light modulation for disruptive holographic displays,” Nanoscale, vol. 14, no. 12, pp. 4380–4410, 2022, https://doi.org/10.1039/D1NR07909C.Suche in Google Scholar

[13] Y. Liang, et al.., “Full-stokes polarization perfect absorption with diatomic metasurfaces,” Nano Lett., vol. 21, no. 2, pp. 1090–1095, 2021, https://doi.org/10.1021/acs.nanolett.0c04456.Suche in Google Scholar PubMed

[14] M.-H. Chen, B.-W. Chen, K.-L. Xu, and V.-C. Su, “Wide-angle optical metasurface for vortex beam generation,” Nanomaterials, vol. 13, no. 19, p. 19, 2016, https://doi.org/10.3390/nano13192680.Suche in Google Scholar PubMed PubMed Central

[15] Z. Yu, et al.., “High-security learning-based optical encryption assisted by disordered metasurface,” Nat. Commun., vol. 15, no. 1, p. 2607, 2024, https://doi.org/10.1038/s41467-024-46946-w.Suche in Google Scholar PubMed PubMed Central

[16] Y. Wu, et al.., “Tbps wide-field parallel optical wireless communications based on a metasurface beam splitter,” Nat. Commun., vol. 15, no. 1, p. 7744, 2024, https://doi.org/10.1038/s41467-024-52056-4.Suche in Google Scholar PubMed PubMed Central

[17] Z. Li, R. Pestourie, Z. Lin, S. G. Johnson, and F. Capasso, “Empowering metasurfaces with inverse design: Principles and applications,” ACS Photonics, vol. 9, no. 7, pp. 2178–2192, 2022, https://doi.org/10.1021/acsphotonics.1c01850.Suche in Google Scholar

[18] X. Zhang, Y. Liu, J. Han, Y. Kivshar, and Q. Song, “Chiral emission from resonant metasurfaces,” Science, vol. 377, no. 6611, pp. 1215–1218, 2022, https://doi.org/10.1126/science.abq7870.Suche in Google Scholar PubMed

[19] Z. Li, et al.., “Controlling propagation and coupling of waveguide modes using phase-gradient metasurfaces,” Nat. Nanotech., vol. 12, no. 7, pp. 675–683, 2017, https://doi.org/10.1038/nnano.2017.50.Suche in Google Scholar PubMed

[20] E. Maguid, I. Yulevich, M. Yannai, V. Kleiner, M. L. Brongersma, and E. Hasman, “Multifunctional interleaved geometric-phase dielectric metasurfaces,” Light Sci. Appl., vol. 6, no. 8, p. e17027, 2017, https://doi.org/10.1038/lsa.2017.27.Suche in Google Scholar PubMed PubMed Central

[21] S. Wang, et al.., “Broadband achromatic optical metasurface devices,” Nat. Commun., vol. 8, no. 1, p. 1, 2017, https://doi.org/10.1038/s41467-017-00166-7.Suche in Google Scholar PubMed PubMed Central

[22] Y. Fu, et al.., “Unleashing the potential: AI empowered advanced metasurface research,” Nanophotonics, vol. 13, no. 8, pp. 1239–1278, 2024, https://doi.org/10.1515/nanoph-2023-0759.Suche in Google Scholar PubMed PubMed Central

[23] S. So, J. Mun, J. Park, and J. Rho, “Revisiting the design strategies for metasurfaces: Fundamental physics, optimization, and beyond,” Adv. Mater., vol. 35, no. 43, p. 2206399, 2023, https://doi.org/10.1002/adma.202206399.Suche in Google Scholar PubMed

[24] W. T. Chen, et al.., “A broadband achromatic metalens for focusing and imaging in the visible,” Nat. Nanotech., vol. 13, no. 3, p. 3, 2018, https://doi.org/10.1038/s41565-017-0034-6.Suche in Google Scholar PubMed

[25] T. Qiu, et al.., “Deep learning: A rapid and efficient route to automatic metasurface design,” Adv. Sci., vol. 6, no. 12, p. 1900128, 2019, https://doi.org/10.1002/advs.201900128.Suche in Google Scholar PubMed PubMed Central

[26] C. Chen and G. X. Gu, “Generative deep neural networks for inverse materials design using backpropagation and active learning,” Adv. Sci., vol. 7, no. 5, p. 1902607, 2020, https://doi.org/10.1002/advs.201902607.Suche in Google Scholar PubMed PubMed Central

[27] J. Zhang, et al.., “Harnessing the missing spectral correlation for metasurface inverse design,” Adv. Sci., vol. 11, no. 33, 2024, Art no. 2308807. https://doi.org/10.1002/advs.202308807.Suche in Google Scholar PubMed PubMed Central

[28] Y. Gao, et al.., “Meta‐attention deep learning for smart development of metasurface sensors,” Adv. Sci., vol. 11, no. 42, p. 2405750, 2024, https://doi.org/10.1002/advs.202405750.Suche in Google Scholar PubMed PubMed Central

[29] M. K. Chen, X. Liu, Y. Sun, and D. P. Tsai, “Artificial intelligence in meta-optics,” Chem. Rev., vol. 122, no. 19, pp. 15356–15413, 2022, https://doi.org/10.1021/acs.chemrev.2c00012.Suche in Google Scholar PubMed PubMed Central

[30] Y. Yin, et al.., “Multi‐dimensional multiplexed metasurface holography by inverse design,” Adv. Mater., vol. 36, no. 21, p. 2312303, 2024, https://doi.org/10.1002/adma.202312303.Suche in Google Scholar PubMed

[31] L. Huang, et al.., “Broadband thermal imaging using meta-optics,” Nat. Commun., vol. 15, no. 1, p. 1662, 2024, https://doi.org/10.1038/s41467-024-45904-w.Suche in Google Scholar PubMed PubMed Central

[32] R. P. S, A. Jain, R. kumar, and A. Mitra, “AI‐Enabled inverse design and molecular identification using phase change metamaterial absorber,” Adv. Opt. Mater, vol. 13, no. 9, 2025, Art no. 2402407. https://doi.org/10.1002/adom.202402407.Suche in Google Scholar

[33] Z. Liu, D. Zhu, S. P. Rodrigues, K.-T. Lee, and W. Cai, “Generative model for the inverse design of metasurfaces,” Nano Lett., vol. 18, no. 10, pp. 6570–6576, 2018, https://doi.org/10.1021/acs.nanolett.8b03171.Suche in Google Scholar PubMed

[34] J. Jiang, D. Sell, S. Hoyer, J. Hickey, J. Yang, and J. A. Fan, “Free-form diffractive metagrating design based on generative adversarial networks,” ACS Nano, vol. 13, no. 8, pp. 8872–8878, 2019, https://doi.org/10.1021/acsnano.9b02371.Suche in Google Scholar PubMed

[35] C. Yeung, et al.., “Global inverse design across multiple photonic structure classes using generative deep learning,” Adv. Opt. Mater., vol. 9, no. 20, p. 2100548, 2021, https://doi.org/10.1002/adom.202100548.Suche in Google Scholar

[36] C. Yeung, B. Pham, R. Tsai, K. T. Fountaine, and A. P. Raman, “DeepAdjoint: An all-in-one photonic inverse design framework integrating data-driven machine learning with optimization algorithms,” ACS Photonics, vol. 10, no. 4, pp. 884–891, 2023. https://doi.org/10.1021/acsphotonics.2c00968.Suche in Google Scholar

[37] M. Mirza and S. Osindero, “Conditional generative adversarial nets,” arXiv: arXiv:1411.1784, 2014. http://arxiv.org/abs/1411.1784 Accessed Aug 23, 2022.Suche in Google Scholar

[38] S. So and J. Rho, “Designing nanophotonic structures using conditional deep convolutional generative adversarial networks,” Nanophotonics, vol. 8, no. 7, pp. 1255–1261, 2019, https://doi.org/10.1515/nanoph-2019-0117.Suche in Google Scholar

[39] S. An, et al.., “Multifunctional metasurface design with a generative adversarial network,” Adv. Opt. Mater., vol. 9, no. 5, p. 2001433, 2021, https://doi.org/10.1002/adom.202001433.Suche in Google Scholar

[40] X. Han, Z. Fan, Z. Liu, C. Li, and L. J. Guo, “Inverse design of metasurface optical filters using deep neural network with high degrees of freedom,” InfoMat, vol. 3, no. 4, pp. 432–442, 2021, https://doi.org/10.1002/inf2.12116.Suche in Google Scholar

[41] A. Baucour, M. Kim, and J. Shin, “Data-driven concurrent nanostructure optimization based on conditional generative adversarial networks,” Nanophotonics, vol. 11, no. 12, pp. 2865–2873, 2022, https://doi.org/10.1515/nanoph-2022-0005.Suche in Google Scholar PubMed PubMed Central

[42] C. Qian, I. Kaminer, and H. Chen, “A guidance to intelligent metamaterials and metamaterials intelligence,” Nat. Commun., vol. 16, no. 1, p. 1154, 2025, https://doi.org/10.1038/s41467-025-56122-3.Suche in Google Scholar PubMed PubMed Central

[43] W. Ma, Z. Liu, Z. A. Kudyshev, A. Boltasseva, W. Cai, and Y. Liu, “Deep learning for the design of photonic structures,” Nat. Photonics, vol. 15, no. 2, pp. 77–90, 2021, https://doi.org/10.1038/s41566-020-0685-y.Suche in Google Scholar

[44] A. Dorodnyy, S. M. Koepfli, A. Lochbaum, and J. Leuthold, “Design of CMOS-compatible metal–insulator–metal metasurfaces via extended equivalent-circuit analysis,” Sci. Rep., vol. 10, no. 1, p. 1, 2023, https://doi.org/10.1038/s41598-020-74849-5.Suche in Google Scholar PubMed PubMed Central

[45] F. Han, et al.., “Hybrid bilayer plasmonic metasurface efficiently manipulates visible light,” Sci. Adv., vol. 2, no. 1, p. e1501168, 2016, https://doi.org/10.1126/sciadv.1501168.Suche in Google Scholar PubMed PubMed Central

[46] Z. Wang, A. C. Bovik, H. R. Sheikh, and E. P. Simoncelli, “Image quality assessment: From error visibility to structural similarity,” IEEE Trans. Image Process, vol. 13, no. 4, pp. 600–612, 2004, https://doi.org/10.1109/TIP.2003.819861.Suche in Google Scholar

[47] B. Li, X. Qi, T. Lukasiewicz, and P. H. S. Torr, “ManiGAN: Text-guided image manipulation,” in 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2020, pp. 7877–7886.10.1109/CVPR42600.2020.00790Suche in Google Scholar

[48] H. Zhou, et al.., “LG-GAN: Label guided adversarial network for flexible targeted attack of point cloud based deep networks,” in 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2020, pp. 10353–10362.10.1109/CVPR42600.2020.01037Suche in Google Scholar

[49] S. Reed, Z. Akata, X. Yan, L. Logeswaran, B. Schiele, and H. Lee, “Generative adversarial text to image synthesis,” in Proceedings of The 33rd International Conference on Machine Learning, PMLR, 2016, pp. 1060–1069. https://proceedings.mlr.press/v48/reed16.html Accessed Jun 03, 2025.Suche in Google Scholar

[50] K. He, X. Zhang, S. Ren, and J. Sun, “Deep residual learning for image recognition,” in 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016, pp. 770–778.10.1109/CVPR.2016.90Suche in Google Scholar

[51] M. Frising, J. Bravo-Abad, and F. Prins, “Tackling multimodal device distributions in inverse photonic design using invertible neural networks,” Mach. Learn.: Sci. Technol., vol. 4, no. 2, p. 02LT02, 2023, https://doi.org/10.1088/2632-2153/acd619.Suche in Google Scholar

[52] D. Liu, Y. Tan, E. Khoram, and Z. Yu, “Training deep neural networks for the inverse design of nanophotonic structures,” ACS Photonics, vol. 5, no. 4, pp. 1365–1369, 2018, https://doi.org/10.1021/acsphotonics.7b01377.Suche in Google Scholar

[53] M. Zandehshahvar, et al.., “Metric learning: Harnessing the power of machine learning in nanophotonics,” ACS Photonics, vol. 10, no. 4, pp. 900–909, 2023, https://doi.org/10.1021/acsphotonics.2c01331.Suche in Google Scholar

[54] H. Abdi and L. J. Williams, “Principal component analysis,” WIREs Comput. Stats, vol. 2, no. 4, pp. 433–459, 2010, https://doi.org/10.1002/wics.101.Suche in Google Scholar

[55] S. T. Roweis and L. K. Saul, “Nonlinear dimensionality reduction by locally linear embedding,” Science, vol. 290, no. 5500, pp. 2323–2326, 2000, https://doi.org/10.1126/science.290.5500.2323.Suche in Google Scholar PubMed

[56] G. E. Hinton and S. Roweis, “Stochastic neighbor embedding,” in Advances in Neural Information Processing Systems, MIT Press, 2002. https://proceedings.neurips.cc/paper_files/paper/2002/hash/6150ccc6069bea6b5716254057a194ef-Abstract.html Accessed Oct 04, 2024.Suche in Google Scholar

[57] D. Bank, N. Koenigstein, and R. Giryes, “Autoencoders,” in Machine Learning for Data Science Handbook: Data Mining and Knowledge Discovery Handbook, L. Rokach, O. Maimon, and E. Shmueli, Eds., Cham, Springer International Publishing, 2023, pp. 353–374.10.1007/978-3-031-24628-9_16Suche in Google Scholar

[58] C. Liu, S. A. Maier, and G. Li, “Genetic-algorithm-aided meta-atom multiplication for improved absorption and coloration in nanophotonics,” Photon. Res., vol. 7, no. 7, pp. 1716–1722, 2020, https://doi.org/10.1021/acsphotonics.0c00266.Suche in Google Scholar

[59] J. R. Thompson, H. D. Nelson-Quillin, E. J. Coyle, J. P. Vernon, E. S. Harper, and M. S. Mills, “Particle swarm optimization of polymer-embedded broadband metasurface reflectors,” Opt. Express, vol. 29, no. 26, p. 43421, 2021, https://doi.org/10.1364/OE.444112.Suche in Google Scholar

[60] M. M. R. Elsawy, S. Lanteri, R. Duvigneau, G. Brière, M. S. Mohamed, and P. Genevet, “Global optimization of metasurface designs using statistical learning methods,” Sci. Rep., vol. 9, no. 1, p. 1, 2019, https://doi.org/10.1038/s41598-019-53878-9.Suche in Google Scholar PubMed PubMed Central

[61] X. Liu, et al.., “Compatible stealth metasurface for laser and infrared with radiative thermal engineering enabled by machine learning,” Adv. Funct. Mater., vol. 33, no. 11, p. 2212068, 2023, https://doi.org/10.1002/adfm.202212068.Suche in Google Scholar

[62] W. Ma, et al.., “Pushing the limits of functionality‐multiplexing capability in metasurface design based on statistical machine learning,” Adv. Mater., vol. 34, no. 16, p. 2110022, 2022, https://doi.org/10.1002/adma.202110022.Suche in Google Scholar PubMed

[63] X. Chen, Y. Duan, R. Houthooft, J. Schulman, I. Sutskever, and P. Abbeel, “InfoGAN: Interpretable representation learning by information maximizing generative adversarial nets,” in Advances in Neural Information Processing Systems, Curran Associates, Inc., 2016. https://proceedings.neurips.cc/paper_files/paper/2016/hash/7c9d0b1f96aebd7b5eca8c3edaa19ebb-Abstract.html Accessed Jun 05, 2025.Suche in Google Scholar

[64] Y. Shen, J. Gu, X. Tang, and B. Zhou, “Interpreting the latent space of GANs for semantic face editing,” in 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA, IEEE, 2020, pp. 9240–9249.10.1109/CVPR42600.2020.00926Suche in Google Scholar

[65] R. S. Hegde, “Photonics inverse design: Pairing deep neural networks with evolutionary algorithms,” IEEE J. Select. Topics Quantum Electron., vol. 26, no. 1, pp. 1–8, 2020, https://doi.org/10.1109/JSTQE.2019.2933796.Suche in Google Scholar

[66] D. P. Kingma, T. Salimans, R. Jozefowicz, X. Chen, I. Sutskever, and M. Welling, “Improved variational inference with inverse autoregressive flow,” in Advances in Neural Information Processing Systems, Curran Associates, Inc., 2016. https://proceedings.neurips.cc/paper/2016/hash/ddeebdeefdb7e7e7a697e1c3e3d8ef54-Abstract.html Accessed Jun 07, 2025.Suche in Google Scholar

[67] I. Goodfellow, “NIPS 2016 tutorial: Generative adversarial networks,” arXiv: arXiv:1701.00160, 2017. https://doi.org/10.48550/arXiv.1701.00160.Suche in Google Scholar

[68] A. Ghosh, V. Kulharia, V. Namboodiri, P. H. S. Torr, and P. K. Dokania, “Multi-agent diverse generative adversarial networks,” in 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2018, pp. 8513–8521.10.1109/CVPR.2018.00888Suche in Google Scholar

[69] N. Kodali, J. Abernethy, J. Hays, and Z. Kira, “On convergence and stability of GANs,” arXiv: arXiv:1705.07215, 2017. https://doi.org/10.48550/arXiv.1705.07215.Suche in Google Scholar

Supplementary Material

This article contains supplementary material (https://doi.org/10.1515/nanoph-2025-0210).

Received: 2025-05-06

Accepted: 2025-06-24

Published Online: 2025-07-15

This work is licensed under the Creative Commons Attribution 4.0 International License.

Supplementary Material Details

Artikel in diesem Heft

https://doi.org/10.1515/nanoph-2025-0210

Schlagwörter für diesen Artikel

metasurface design; generative model; high-fidelity; diverse design

Creative Commons

BY 4.0

Anchor-controlled generative adversarial network for high-fidelity electromagnetic and structurally diverse metasurface design

Artikel

Abstract

1 Introduction

2 Methods

3 Results

4 Discussion

5 Conclusions

References

Supplementary Material

Zusatzmaterial

Artikel in diesem Heft

Artikel in diesem Heft

Artikel in diesem Heft