Machine-learning-assisted photonic device development: a multiscale approach from theory to characterization

Yuheng Chen; Alexander Montes McNeil; Taehyuk Park; Blake A. Wilson; Vaishnavi Iyer; Michael Bezick; Jae-Ik Choi; Rohan Ojha; Pravin Mahendran; Daksh Kumar Singh; Geetika Chitturi; Peigang Chen; Trang Do; Alexander V. Kildishev; Vladimir M. Shalaev; Michael Moebius; Wenshan Cai; Yongmin Liu; Alexandra Boltasseva

doi:10.1515/nanoph-2025-0049

Article Open Access

Machine-learning-assisted photonic device development: a multiscale approach from theory to characterization

Yuheng Chen , Alexander Montes McNeil , Taehyuk Park , Blake A. Wilson , Vaishnavi Iyer , Michael Bezick , Jae-Ik Choi , Rohan Ojha , Pravin Mahendran , Daksh Kumar Singh , Geetika Chitturi , Peigang Chen , Trang Do , Alexander V. Kildishev , Vladimir M. Shalaev , Michael Moebius , Wenshan Cai , Yongmin Liu and Alexandra Boltasseva

Published/Copyright: July 3, 2025

Published by

Become an author with De Gruyter Brill

Submit Manuscript Author Information

From the journal Nanophotonics

Abstract

Photonic device development (PDD) has achieved remarkable success in designing and implementing new devices for controlling light across various wavelengths, scales, and applications, including telecommunications, imaging, sensing, and quantum information processing. PDD is an iterative, five-step process that consists of: (i) deriving device behavior from design parameters, (ii) simulating device performance, (iii) finding the optimal candidate designs from simulations, (iv) fabricating the optimal device, and (v) measuring device performance. Classically, all these steps involve Bayesian optimization, material science, control theory, and direct physics-driven numerical methods. However, many of these techniques are computationally intractable, monetarily costly, or difficult to implement at scale. In addition, PDD suffers from large optimization landscapes, uncertainties in structural or optical characterization, and difficulties in implementing robust fabrication processes. However, the advent of machine learning over the past decade has provided novel, data-driven strategies for tackling these challenges, including surrogate estimators for speeding up computations, generative modeling for noisy measurement modeling and data augmentation, reinforcement learning for fabrication, and active learning for experimental physical discovery. In this review, we present a comprehensive perspective on these methods to enable machine-learning-assisted PDD (ML-PDD) for efficient design optimization with powerful generative models, fast simulation and characterization modeling under noisy measurements, and reinforcement learning for fabrication. This review will provide researchers from diverse backgrounds with valuable insights into this emerging topic, fostering interdisciplinary efforts to accelerate the development of complex photonic devices and systems.

Keywords: machine learning; nanophotonics; inverse design

1 Introduction

Photonics studies light–matter interactions that encompass phenomena ranging in scale from macroscopic propagation in optical fibers to nanoscale interactions in photonic crystals, plasmonics, metamaterials, and quantum dots. The precise control of light has been made possible by engineering photonic structures, leading to the development of photonic devices for various applications, including telecommunications [1], [2], imaging [3], sensing [4], [5], and quantum information processing [6]. Photonic device development (PDD) encompasses the design, fabrication, and characterization of photonic devices to manipulate these phenomena and achieving targeted electromagnetic responses for remarkable technologies such as quantum sensors [7] and medicine [8].

Although PDD is often a non-trivial task, it can be deconstructed into five major steps: (i) theory, (ii) simulation, (iii) design, (iv) fabrication and (v) characterization (Figure 1). In the theory step, a model, often derived from Maxwell’s equations with multiphysics coupling, is chosen to characterize the device. These models include all the design parameters that are optimized for, including any necessary material distribution parameters, electromagnetic sources, etc. Then, a simulation is constructed to characterize the device from its design parameters using numerical methods, such as finite-difference time-domain [9], finite element method [10], and rigorous coupled wave analysis [11]. Following the simulation, the design step uses human intuition, gradient descent, or global optimization methods to propose several designs and evaluate their performances via a figure of merit (FOM) score derived from the simulation. Designs encompass a vast parameter search space that is intractable to search exhaustively, so the design step is often computationally expensive [12]. As such, a significant amount of time is spent generating designs and numerically evaluating their performance until a sufficiently optimal design is obtained for the final fabrication and characterization steps. However, fabrication processes are difficult to optimize, and designs are often finalized without fabrication tolerances in mind. In addition, even a well-fabricated design can be subject to noisy or imprecise measurements when measuring the FOM of the fabricated device.

Figure 1:

Overview of photonic device design (PDD) process from theory to characterization. This iterative process begins by deriving a system of equations that computes the optical response for each device using electromagnetic theory. Secondly, the design’s performance and material properties are simulated either using a numerical framework, such as finite difference time domain (FDTD) or rigorous coupled-wave analysis (RCWA), or a pre-trained discriminative model. Then, the material distribution and its properties are generated in the design step using generative architectures such as VAEs, GANs, diffusion models, and recently, hybrid quantum-classical models. Machine learning assisted inverse design techniques use simulation results to predict more optimal designs until the design is desired to be fabricated. The optimal design from the generative model is then used as an ideal design for fabrication, where reinforcement learning, material vaccination, and smoothing algorithms can improve the fabrication quality. Lastly, the fabricated design is characterized through several measurements. High quality characterization measurements can be added to a material database, used for generative modeling and data augmentation, and utilized by active learning methods to choose better fabrication parameters. Completing the design cycle, physical discovery methods like hypothesis learning and symbolic regression provide surrogate models for understanding the physics of new phenomena from measurement data.

In the last decade, the field of machine learning (ML) has experienced unprecedented growth with new generative techniques, models, and optimizers to help photonics researchers tackle these complex problems [13], [14], [15], [16], [17], [18]. ML techniques can learn from time-consuming simulations to propose new parameters and designs that improve fabrication quality, reduce characterization noise, and discover numerical representations of physical trends for new materials. These models enable a human designer to select between well-optimized designs and greatly reduces the volume of data and designs an experienced designer must sift through. With the introduction of material databases, reinforcement learning, self-driving labs, hypothesis learning, and several new generative models, many of these problems are being actively addressed with significant success.

As such, this review explores ML-assisted PDD (ML-PDD), or the use of ML throughout the PDD process. ML-PDD contextualizes the five steps of PDD in the same Bayesian framework as modern ML, providing a structure for future work to build on. Each section of this review is based on a step of ML-PDD and explores the application of emerging ML techniques both from recent photonics works and within the larger ML community, which may benefit future work. Section 2 explores the integration of ML into the development and implementation of EM theory, including symbolic regression to derive physical laws from experimental data, such as material property estimation, and how ML models enhance interpretability and provide physical insight to theory. Then, in Section 3, discriminative models are discussed as surrogate forward simulators, while generative models are shown to reduce the data requirements to train accurate simulation networks, thereby improving efficiency. Next, Section 4 explores ML generative design strategies for effectively exploring high-dimensional design spaces, including the use of generative models such as GANs, VAEs, and diffusion models. Topics such as dimensional compression, latent space engineering, and addressing one-to-many mapping problems are also covered. Hybrid quantum-classical models are introduced as emerging tools for tackling complex design challenges. Subsequently, Section 5 addresses the challenges introduced by real-world fabrication errors absent in simulations. ML proves an ideal candidate to model the complex stochastic process that arises during fabrication, enabling more robust designs. Following that, Section 6 views characterization as an ML inference task to deduce optical properties from experimental measurements. Challenges posed by limited training data are mitigated through physics-informed data augmentation techniques and active learning approaches, enhancing the reliability of ML models in extracting insights from sparse or noisy measurement data. Finally, Section 7 summarizes how the integration of ML can enhance each step of PDD and offers a comprehensive outlook on the future potential of ML-PDD.

We now introduce ML-PDD and its necessary ML background information. The authors recommend these sources as background material for a deeper understanding of the statistical framework and electromagnetic theory used in ML-PDD [19], [20], [21].

1.1 Background

Photonics is deterministic in the classical limit of Maxwell’s equations but designing, fabricating, and measuring real materials can be a noisy, probabilistic process because of large design spaces, fabrication errors, and noisy measurements. Modern ML is especially powerful at learning, optimizing, and manipulating these probabilistic processes, since this is what it was originally built for [19], [22]. ML starts by assuming that the data x ∼ p _data(x) are drawn from a data distribution p _data(x), e.g., measurement errors, material distributions, spectra, etc. Classically, generative models p _θ(x), e.g., unconditioned variational autoencoders (VAEs) [23], generative adversarial networks (GANs) [24], and diffusion models [25], aim to learn the joint distribution p _data(x) of all random variables in the data via maximizing the log-likelihood

(1) arg max θ E x ∼ p data ( x ) [ log p θ ( x ) ] ,

with various methods, including direct analysis, expectation maximization, and gradient descent [19]. Generative models have found applications everywhere a random process needs to be modeled, e.g., simple design generators (Section 4), measurement noise (Section 6), and data augmentation (Section 6).

One limitation of generative models is that they are inherently restricted to learning the joint distribution of the data [22]. For example, data x and their labels y have a joint distribution p _data(x, y), such as measurements and designs [26], measurement errors and measurement parameters [27], fabrication artifacts and ideal designs [28]. However, often a goal in ML-PDD is to train a discriminative model p _θ(y|x) to learn the conditional distribution y ∼ p _data(y|x) for classification, regression, and conditional generative tasks such as classifying material properties [29], characterizing structural defects [30], and predicting the optical responses [31] of generated devices. Discriminative modeling is widely used when only certain aspects of a variable are of interest, such as a subset of devices x or performance metrics y. For example, if a generative model p _θ(x) generates designs in a subset, a discriminative model p _θ(y|x) can be trained to more accurately compute the labels y for designs sampled in that subset. Generally, discriminative models perform better in practice and much of modern ML, including large language models [32], diffusion models [25], [33], etc. involve some form of conditional distribution [22]. Another advantage of discriminative models is that they can encompass varying degrees of randomness because they are conditional. In other words, they are often treated as parameterized, deterministic functions p θ ( y | x ) = δ y ( y ̂ ϕ ( x ) ) , where y ̂ ϕ ( x ) is a multi-layer perceptron [34], transformer [35], or convolutional neural network [36]. Discriminative models have found uses all throughout ML-PDD, including efficient forward modeling of optical systems, providing real-time predictions of device behavior without requiring full numerical simulations (Section 3), and conditional generation of designs from additional design criteria (Section 4).

Many processes in PDD have distributions that belong to a known distribution family [19], e.g., Gaussian noise belongs to exponential families, etc., but the parameters of the distribution are unknown, such as Gaussian noise N ( μ , σ ) with unknown mean μ and variance σ ². For these simpler distributions, it is more effective to learn the parameters of the distribution using maximum likelihood estimator models, e.g., x ∼ N ( μ ̂ , σ ̂ ) where μ ̂ = ∑ i x ( i ) n and σ ̂ = ∑ i ( x ( i ) − μ ̂ ) 2 n , than it is to use a large machine learning model [19]. Maximum likelihood estimation is, in spirit, the driving force behind training both generative models and discriminative models. Each model has a set of parameters, e.g., p _θ has θ, that has to be optimized to sample new data from the distribution p _data. However, for many tasks with larger, unknown distribution families, such as in inverse design (Section 4), increasingly complex models are required to capture the nuances of the data. As the complexity of the model increases, estimating the optimal parameters becomes increasingly difficult, hence the explosion of large-scale, specialized hardware and distributed techniques for training these large models [32]. Modern generative ML models and training techniques exploit large compute [32], [37] and gradient-based optimization [38] to find the optimal parameters θ for optimizing Eq. (1) and its many variants [19], [22]. As such, ML-PDD takes advantage of these techniques to tackle difficult problems in PDD.

1.2 ML-assisted photonic device development (ML-PDD) framework

ML-PDD begins by deriving the optical properties of a photonic device from its parameters (see Section 2). Each device is represented with a parameter vector x ∈ X in a design space X that holds unique information about the design, for example, material topology height maps X ⊆ [ 0,1 ] n × n [39], [40], spectra X ⊆ R n [41], voltage biases X ⊆ [ 0,5 ] n , etc. Each device x has a corresponding optical response y(x) usually derived from Maxwell’s equations and any multiphysics coupling. Often, there is an ideal optical response y* which the design is optimized to realize, e.g., emission spectra y * ∈ R n or 2D phase image y* ∈ [0,2π)^n×n. In the simulation step, as discussed in Section 3, the optical response y(x) is approximated with a numerical model y ̂ ( x ) derived from computational electromagnetic techniques [42]. A deterministic model y ̂ ϕ ( x ) with parameters ϕ known as a surrogate estimator model (Section 3) is trained to approximate the numerical model y ̂ ( x ) , speeding up the inference time by orders of magnitude at the cost of some accuracy. The surrogate estimator model is deterministic but it can be augmented to model noisy measurements, for example Gaussian noisy measurements y = y ̂ ϕ ( x ) + ϵ : ϵ ∼ N ( μ , σ ) . Taking the surrogate estimator model y ̂ ϕ ( x ) and the ideal optical response y*, the following design step (Section 4) constructs a figure of merit (FOM) score function f(y) to rank designs based on their optical response, e.g., f(y) = ||y* − y||_l for some l-norm. Then, a generative model x ∼ p _θ(x) is trained, see Section 4, to sample designs that maximize the FOM in the expectation

(2) arg max θ E x ∼ p θ ( x ) [ f ( y ̂ ϕ ( x ) ) ] .

Often, the design step requires several iterations t = 1, …, T of sampling designs x ( t ) ∼ p θ ( t ) ( x ) , evaluating their performance f ( y ̂ ϕ ( x ( t ) ) ) , and retraining the generative model θ ^(t) → θ ^(t+1) before choosing a sufficiently optimal set of designs X * = x ( t i ) i = 1 N for fabrication. Given these optimized designs X*, a probabilistic fabrication process r _η(χ|x), see Section 5, with fabrication parameters η, e.g., alignment, tolerances, etc., takes an intended design x ∈ X* and fabricates a device χ. Fabrication is a very difficult and time-consuming process that cannot fully realize a design x. Therefore, techniques such as reinforcement learning have been crucial in reducing randomness and improving the final quality of designs, which will be discussed in Section 5. Once the design is fabricated χ ∼ r _η(χ|x), the optical response of the fabricated design is measured via a noisy measurement device υ ∼ m _ρ(υ|χ) with noisy measurements υ and measurement parameters ρ, as discussed in Section 6. Naturally, to accommodate noisy measurements, the FOM is augmented f ̂ to use a finite sample of noisy measurements ϒ = { υ ( i ) ∼ m ρ ( υ | χ ) } i = 1 M . The overall objective of ML-PDD is to optimize the device performance in this noisy environment

(3) arg max θ , η , ρ E x ∼ p θ ( x ) [ E χ ∼ r η ( χ | x ) [ E υ ∼ m ρ ( υ | χ ) [ f ̂ ( ϒ ) ] ] ] .

To make the ML-PDD objective tractable, assumptions are made on the fidelity of measurements, the optimality of designs and fabrication processes. Steps are isolated and optimized. For example, in inverse design, often the numerical simulation FOM is treated as the ground-truth without considering fabrication processes r _η(χ|x) or noisy measurements m _ρ(υ|χ). As such, the remainder of this review will explore each step both in isolation and coupled to other steps, broken down into theory (Section 2), simulation (Section 3), design (Section 4), fabrication (Section 5) and characterization (Section 6).

2 Theory

Classically, electromagnetic (EM) theory has been developed through experimental observations, theoretical reasoning, and mathematical synthesis [43], [44]. Nowadays, applying EM theory to investigate specific systems often involves large computational requirements while struggling to fully capture the richness of photonic phenomena [44]. ML offers a compelling alternative by enabling the discovery of hidden mathematical relationships [45], predicting material properties with remarkable accuracy [46], and solving intricate EM systems that defy conventional techniques [47], empowering researchers to rethink the discovery of governing physical laws and enhance the interpretability of results. This section explores developments for the derivation of optical response y(x), emphasizing the new role that ML may play in the development of future photonic EM theories. Specifically, in discovering new governing laws and interpretable ML for characterization y(x) via symbolic regression and explainable AI.

2.1 Discovering governing laws

A central application of ML in EM theory is the discovery of governing equations and physical laws to model optical responses. Traditional approaches to understanding EM phenomena often involve laborious analytical derivations or heuristic methods informed by experimental observations [44]. ML models, particularly symbolic regression and neural networks, automate this process by identifying mathematical relationships within data [47]. A rapidly growing Python implementation known as PySR [48] combines symbolic regression with genetic programming to construct analytic expressions. Similar techniques have successfully rediscovered Maxwell’s equations by analyzing datasets of electric and magnetic field values in various scenarios [49]. Beyond rediscovering well-established laws, ML has been employed to hypothesize new governing equations in complex environments, such as through experimental physical discovery. For instance, ML has identified novel constitutive relationships that better describe the interaction of fields and materials in metamaterials and plasmas, where traditional models often struggle due to high degrees of complexity and anisotropy [50]. These breakthroughs enable more accurate predictions and a deeper understanding of exotic materials and devices. For example, neural networks trained on EM field distributions and boundary conditions can infer constitutive relations for materials [16], [51], such as the relationship between electric field intensity and polarization in nonlinear media of photonic devices [13].

2.2 Enhancing interpretability and physical insight

One of the key challenges in ML-PDD, in general, is integrating EM theory into training to ensure that models are interpretable and aligned with physical principles [52], [53]. Advances in ML techniques, such as attention mechanisms and feature importance analysis [54], have made it possible to reveal explanations for models’ decisions and enhance our understanding of the underlying physics. For example, Yeung et al., in Figure 2(a)–(c), used a three-step explainable ML approach to uncover the relationship of specific regions of a nanophotonic structure to the presence of an absorption peak, leading to a better understanding of the behavior of complex nanophotonic devices and enabling more efficient and targeted design improvements [52].

Furthermore, by highlighting the most critical parameters, ML models could also provide actionable insights that guide iterative design processes, estimating important parameters such as permittivity, permeability, or conductivity by analyzing data from spectroscopy, reflectometry, or scattering experiments [55], [56]. For instance, Yesilyurt et al. employed a discriminative ML model to extract material refractive indices and loss coefficients from the spectral transmission and reflection data for the realistic design and fabrication of single material variable-index multilayer films in Figure 2(d) and (e) [29]. The classification of modes and band structures is also essential for understanding and optimizing photonic device performance. ML models can efficiently identify guided modes, radiative modes, or photonic band gaps based on structural and material inputs to enhance our understanding of photonics devices [57]. For example, Martinez-Manuel et al. applied support vector machines (SVMs) (Figure 2(f)) for unambiguous refractive index measurement of the fiber fundamental mode in Figure 2(g) [58]. Similarly, Li et al. utilized deep learning approaches to analyze photonic band structures of phononic crystals, identifying band gaps and key dispersion features [59].

$Figure 2: Machine learning applications in photonic theory. Three steps of explainable ML elucidating the behavior of nanophotonic structures: (a) converting 3D metal–dielectric–metal metamaterials into 2D representations. (b) Training a convolutional neural network (CNN) to predict the electromagnetic response. (c) Elucidating the underlying physics learned by explaining the relationships between structural features and predicted parameters to construct new designs with new target properties. Subfigures a, b, and c are adapted with permission from ref. [52]. Copyright 2020 American Chemical Society. (d) Network architecture for fabrication-in-loop NN-based inverse design of single-material multilayer optical stacks with continuously changing refractive index in simulated fabrication case. (e) Measured and experimentally retrieved layer parameters are integrated back into the optimization cycle. Subfigures d and e are adapted with permission from ref. [29]. Copyright 2023 the author(s), published by De Gruyter, Berlin/Boston, licensed under the Creative Commons Attribution 4.0 International License. (f) Flowchart of the SVM model and (g) diagram of the classification process in unambiguous refractive index measurement. Subfigures f and g adapted with permission from ref. [58]. Copyright 2022, IEEE.$

Figure 2:

Machine learning applications in photonic theory. Three steps of explainable ML elucidating the behavior of nanophotonic structures: (a) converting 3D metal–dielectric–metal metamaterials into 2D representations. (b) Training a convolutional neural network (CNN) to predict the electromagnetic response. (c) Elucidating the underlying physics learned by explaining the relationships between structural features and predicted parameters to construct new designs with new target properties. Subfigures a, b, and c are adapted with permission from ref. [52]. Copyright 2020 American Chemical Society. (d) Network architecture for fabrication-in-loop NN-based inverse design of single-material multilayer optical stacks with continuously changing refractive index in simulated fabrication case. (e) Measured and experimentally retrieved layer parameters are integrated back into the optimization cycle. Subfigures d and e are adapted with permission from ref. [29]. Copyright 2023 the author(s), published by De Gruyter, Berlin/Boston, licensed under the Creative Commons Attribution 4.0 International License. (f) Flowchart of the SVM model and (g) diagram of the classification process in unambiguous refractive index measurement. Subfigures f and g adapted with permission from ref. [58]. Copyright 2022, IEEE.

Latent space engineering in generative models, such as autoencoders, offers another avenue for interpretability (see Section 4). These models encode high-dimensional data, like photonic design geometries or spectral responses, into compact representations that reveal patterns and relationships. For instance, clustering designs in latent space based on efficiency metrics can help researchers identify common features of high-performance devices [60].

2.3 Summary and outlook

ML is transforming how we apply EM theory and materials science in PDD. As ML continues to evolve, its integration with foundational physics principles offers a promising avenue for breakthroughs in device performance, material characterization, and the exploration of new photonics device frontiers. However, achieving this vision requires addressing key research gaps, such as overcoming data scarcity through efficient learning methods, enhancing the robustness, interpretability, and real-time adaptability of ML models. By tackling these challenges, researchers can foster a synergistic ML-EM paradigm that drives innovation in photonics and unlocks unprecedented opportunities for scientific and technological advancement.

3 Simulation

Optical simulations computationally approximate the optical response y(x) of each design x via a numerical model y ̂ ( x ) , including reflection, transmission, absorption, scattering, EM field distribution, chirality, and polarization. These simulations play a crucial role in studying light–matter interactions in photonic structures and in the design and optimization of photonic devices. In PDD, optical simulations predict the performance of designed structures and evaluate candidate structures during the iterative design process outlined in Section 4. Traditionally, the optical response of a structure has been obtained by numerically solving Maxwell’s equations. Since numerical simulations directly solve the governing equations, they are often treated as reliable and their accuracy can be improved by refining material meshes, increasing the number of harmonics in the simulation, etc. [42]. However, obtaining accurate numerical solutions is computationally intensive, particularly for complicated structures and systems, and restricts the rate of design evaluation throughout the PDD.

In contrast, ML models can act as efficient and reliable surrogate forward simulators, providing a faster alternative to time-consuming numerical simulations. ML models treat the simulation process as a discriminative regression task, mapping input structures x to output characteristics y based on labeled datasets of input-output pairs. Once well-trained, these simulation ML models offer much faster computation speeds at the cost of some accuracy.

This section discusses how ML techniques provide a fast and reliable alternative to traditional full-wave calculations. The role of ML models in this section is categorized into two main aspects: first, how discriminative models enable computationally efficient predictions of device responses, i.e., training ML models as the approximator y ̂ ϕ ( x ) , and second, how generative models reduce the data demands required for training ML simulator models, i.e., how to increase the quality and quantity of training data for y ̂ ϕ ( x ) . These techniques demonstrate that ML-enabled, data-driven simulations are powerful and agile tools that accelerate the evaluation of complex photonic designs, making the design process more efficient.

3.1 Surrogate forward simulator models

ML-based surrogate estimators are discriminative and data-driven, involving an inference task that approximates the optical response y ̂ ( x ) , of a device design, x. Therefore, discriminative models that learn the complex relationship between input and output based on a training dataset of labeled input-output pairs are well-suited for simulation problems. Since the models generate data-driven predictions of the output response from the input structure using ML, they offer orders-of-magnitude speed-up over numerical simulations, which directly solve simulation problems. In the context of ML applications for simulations, both the input and output are typically represented as vectors. For optical devices with a low degree of freedom, such as multilayer films and periodic structures containing simple geometric patterns, the structures can be fully represented by a vector x comprising a series of thicknesses or geometric parameters, as depicted in Figure 3(a) and (b). For high-degree-of-freedom structures, such as those containing freeform patterns, the structures can be represented using binary images, where materials are distinguished by ‘0’ and ‘1’, as shown in Figure 3(c) and (e). For the output, spectra, for example, can be sampled at a sufficiently fine rate to construct a vector representation of y ̂ ( x ) . A variety of simulation problems in optics have been demonstrated using discriminative neural networks.

Figure 3:

Optical simulations using deep learning methods. (a) A fully connected neural network predicts the scattering spectra of multilayered core–shell nanoparticles. Adapted with permission from ref. [31]. Copyright 2018 the authors, distributed under the Creative Commons Attribution-Non Commercial license. (b) A simple geometry of chiral metamaterials, consisting of two gold twisted split-ring resonators, is represented using a few geometric parameters. A neural network model predicts circular dichroism from the given geometry. Reprinted with permission from ref. [61]. Copyright 2018 American Chemical Society. (c) Freeform meta-atoms are represented as 64 × 64 binary images. The prediction of the real and imaginary components of the spectral responses is obtained through CNN. Adapted with permission from ref. [62]. Copyright 2020 Optical Society of America. (d) The architecture of the CcGAN, which generates synthetic spectra for data augmentation. The generated spectra were conditioned on the temperature label y. The performance of the ML temperature prediction model was significantly enhanced through data augmentation using the CcGAN. Adapted with permission from ref. [63]. Copyright 2023 Optica Publishing Group. (e) Self-supervised learning is applied in a VAE architecture. The encoder predicts reflection spectra as well as compresses the input structure into a latent vector, enabling the model to train with both labeled and unlabeled data. Adapted with permission from ref. [64]. Copyright 2019 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

The simulation of optical devices with a low degree of freedom, including layered structures [31], [65], [66] and patterns with primitive geometries [61], [67], can be conducted using the fundamental discriminative model, fully connected networks. Peurifoy et al. employed them to approximate light scattering from a dielectric spherical nanoparticle with alternating silica and titanium oxide shells [31]. The model achieved high precision in predicting scattering spectra and generalized well to the samples that were not seen during the training, enabling it to serve as a surrogate model for forward simulation of such structures. These forward simulation networks play a crucial role in inverse design by being integrated with inverse design networks that map the optical responses to design parameters. For layered structures, these networks have facilitated the inverse design of core–shell nanoparticles to achieve specific electric and magnetic dipole extinction spectra [66], as well as thin-film structures of alternating SiO₂ and Si₃N₄ for target transmission spectra [65]. Similarly, meta-atoms with primitive geometries, represented by vectors of a few geometric parameters, can also be simulated and designed using neural networks. For instance, deep learning-based simulations and inverse designs have been demonstrated for chiral metamaterials composed of two twisted gold split-ring resonators separated by dielectric spacers [61], as well as metasurfaces with H-shaped gold nanostructure on top of ITO-covered glass [67].

For the simulation of optical structures with a high degree of freedom, more advanced models are required to ensure accurate predictions. Convolutional neural networks (CNNs), which utilize convolution operations with kernels to extract local features and capture spatial hierarchies in input data, are well-suited for the simulation task of optical devices with complex geometries. Freeform patterns, represented as images, benefit from CNNs due to their ability to efficiently capture local correlations within the image, making them a desirable choice for such simulations. An et al. simulated the spectral responses of dielectric metasurfaces, where quasi-freeform meta-atoms were represented by 64 × 64 pixel images using a CNN [62]. The trained CNN simulator demonstrated high precision in predicting the responses of not only quasi-freeform patterns in the test dataset but also generalized well to circle- and ring-shaped meta-atoms, which were entirely different geometries from those in the training dataset. Wiecha et al. successfully simulated the internal fields of arbitrary 3D nanostructures using a CNN in a U-net-like architecture with residual connections [68]. This sophisticated model enabled accurate simulations that reproduced complex physical effects in both plasmonic and dielectric nanostructures. Furthermore, the integrated architecture of ResNet-CNN with residual connections – combined with a recurrent neural network (RNN) demonstrated a good match with numerical simulations of the absorption spectra of periodic structures consisting of silver on top of a glass substrate [69]. In this setup, the CNN extracted spatial features, while the RNN predicted the spectra based on the output from the preceding CNN.

Recently, instead of data-driven simulations that learn functions x ↦ y, which map input and output vectors, neural operators, which learn operators x( r ) ↦ y( r ) that map between functions, have gained significant attention. Neural operator architectures consist of multiple layers of linear integral operators followed by nonlinear activations [70]. Neural operators have been applied to learn solution operators for partial differential equations (PDEs), including Darcy flow, Burgers’ equation, and the Navier–Stokes equations, enabling significantly faster simulation than conventional numerical solvers [70], [71]. Moreover, neural operator models generalize well across different levels of discretization, as they learn mappings between continuous functions, whereas traditional vector-to-vector neural networks are heavily dependent on the discretization scheme [72]. The idea of using neural operators to solve PDEs has also been extended to Maxwell’s equations for electromagnetic simulations. Gu et al. proposed the NeurOLight framework, which combines a PDE encoder with a neural-operator-based backbone to build a surrogate forward solver for simulating multi-mode interference photonic devices [73]. The NeurOLight enables efficient simulation of a family of parametric photonic devices, rather than solving only a single instance or one conditioned on fixed parameters.

3.2 Data augmentation using generative models

While the forward simulation problem is inherently discriminative from a machine learning perspective, generative models can play a role in enhancing the data efficiency of simulator network training. Since the training of simulator networks is data-intensive, achieving highly accurate simulator networks requires a large amount of labeled data – input structure and output optical response pairs – obtained through computationally expensive numerical simulations. Reducing dependency on data can be achieved by incorporating known optical knowledge [74], [75], [76] and employing machine learning techniques such as transfer learning [77], [78], [79] and generative models [63], [64], [80], [81]. Physical knowledge serves as a form of regularization during the training of simulator networks, helping the network find valid solutions with less data. Transfer learning is a technique where a model trained on a source task with a fairly large dataset is used to initialize the trainable parameters for a target task. This approach enables faster convergence and improved performance on the target task, even when the target dataset is relatively small, by leveraging the knowledge learned from the source task. Furthermore, generative models alleviate the burden of data collection by augmenting datasets through their generative capabilities and enabling self-supervised learning within encoder-decoder architectures.

Generative models enable efficient data-driven simulation by generating synthetic pseudo-labeled samples, which support semi-supervised learning for simulator networks by leveraging the generated data. Kim et al. employed a denoising diffusion probabilistic model (DDPM) to enhance the performance of a simulator network designed to predict the transmission spectra of photonic crystal waveguide unit cells [80]. The DDPM was trained on images of photonic crystal patterns, allowing it to generate unlabeled new patterns later labeled by a pretrained CNN simulator network. Since these newly generated patterns from the DDPM were produced by learning the probabilistic distribution of the training data, the pseudo-labeling process achieved a high-fidelity approximation using the pretrained simulator. Additionally, Zhu et al. introduced a continuous conditional GAN (CcGAN) to generate synthetic spectra of a self-interference microring resonator sensor at different operating temperatures [63]. As illustrated in Figure 3(d), the CcGAN was conditioned on the temperature, and the corresponding generated spectra were combined with the original small training dataset for data augmentation. By leveraging the augmented data with synthetic pseudo-labeled samples for training ML forward prediction networks, the prediction accuracy improved, and the error distribution shifted toward lower losses.

Another approach to achieving efficient data-driven optical simulation using generative models is the application of self-supervised learning in encoder-decoder architectures [64], [81]. Ma et al. utilized a variational autoencoder (VAE), where the encoder not only compresses the input nanophotonic structure into a latent vector but also predicts the corresponding spectral response. This setup enables the self-supervised learning strategy to improve the performance of nanophotonic structure characterization [81]. As illustrated in Figure 3(e), in this VAE architecture, an additional loss term for prediction accuracy was introduced, along with the reconstruction error and KL divergence. During training, both structures from readily available labeled data obtained from numerical simulations and dynamically generated structures were fed online into the encoder. The ground truth spectra were used to calculate the loss for labeled samples, while the generated structures were pseudo-labeled by the encoder itself. This approach allowed the training process to be self-supervised by the encoder. The total loss, which included the prediction error, was backpropagated through the entire encoder-decoder structure, leading to a further reduction in prediction loss compared to a fully supervised counterpart, thereby improving the simulation capability of the model.

3.3 Physics-inspired constraints in model training

A key advancement in ML modeling for photonics is the incorporation of physics theory-based constraints into model training. For instance, by embedding Maxwell’s equations as regularization terms or constraints, ML models can ensure that their outputs adhere to fundamental physical laws [82]. It not only reduces the reliance on large datasets, but also simultaneously improves the interpretability and reliability of the ML models for both simulation and design of photonic devices. Especially, surrogate simulator models y ̂ ϕ ( x ) are trained by minimizing a loss function dependent on the parameters ϕ. By incorporating physical constraints into the loss function, the surrogate estimator y ̂ ϕ ( x ) is more interpretable and physics informed, thereby better able to approximate the optical response y ̂ ( x ) .

Physical constraints play a pivotal role in reducing the effort required for data collection in training models. In PDD, assuming symmetry, e.g., rotational or axis mirror, simplifies the design space and computational requirements. For instance, simulations can be restricted to a quarter or half of the unit cell by applying symmetric boundary conditions along the axes, depending on the type of symmetry involved. Symmetry also ensures continuity and connectivity at the boundaries between neighboring unit cells, which is critical in the design of gratings. Additionally, symmetry is often leveraged to augment training datasets and regularize machine learning models, ensuring improved performance and data efficiency. For example, in periodic structures based on optical principles, invariances in the spectrum under transformations such as translation, flipping, and 180° rotation can be exploited [12], [83]. Furthermore, 90° or 270° rotations of the pattern induce cross-polarization, effectively swapping the x and y polarization components while maintaining the overall spectral properties. These symmetries enable data augmentation for simulator networks that predict optical characteristics by including structure-characterisitics pairs induced by physics constraints [84], [85]. This approach is not limited to discriminative simulator models but also applies to generative models for design, significantly reducing the reliance on computationally expensive numerical simulations or topology optimizations [12], [83].

Incorporating physical knowledge about resonances [74], [75] and electrical field distributions [76], [86] as constraints into the model training process serves as a form of regularization, further enhancing data efficiency. In optics, integrating Maxwell’s equations as a loss function has proven particularly effective for solving inverse problems. Applications include tasks such as permittivity retrieval [87], [88], designing invisible cloaking devices [87], and optimizing meta-lenses [89], [90], all achieved with significantly reduced data requirements. These methods highlight the importance of embedding physical principles to improve the robustness and efficiency of machine learning models in photonics.

In addition, fabricability imposes significant physical constraints, requiring the elimination of intricate design features that are infeasible to fabricate and ensuring robustness against fabrication errors [12], [83], [91], [92], as we will detailedly discuss in Section 5.

3.4 Summary and outlook

In this section, we explored how ML simulator models can perform simulations much faster than numerical approaches by predicting the optical response of photonic devices in a data-driven manner, rather than directly solving equations. However, these models require training on a large amount of labeled data, which must inevitably be obtained from time-consuming numerical simulations to achieve sufficiently accurate results. Generative ML models mitigate the burden of data collection through self-supervised learning, leveraging generative techniques to synthesize labeled data, while physical constraints contribute through data augmentation and regularization during model training. Therefore, well-trained simulator models, combined with effective data generation strategies, can expedite the iterative performance evaluation of intermediate designs, enabling computationally efficient PDD. With fast performance evaluation of devices in place, how to effectively explore the large design space and identify optimal designs will be discussed in detail in Section 4.

4 Design

Generating designs x ∼ p _θ(x) in PDD requires sampling complicated, sparse and high-dimensional design spaces X to achieve specific optical responses y * = y ̂ ( x ) such as spectra, bandwidth or polarization. Many of the classical techniques, such as adjoint optimization, physics-inspired optimization, or even evolutionary algorithms, are iterative and directly optimize Eq. (2) by first randomly proposing a design x ⁽⁰⁾ ∼ p(x ⁽⁰⁾), and then iteratively proposing new designs p(x ^(t+1)|x ^(t), t) until the FOM f ( y ̂ ( t ) ) , where y ̂ ( t ) = y ̂ ϕ ( x ( t ) ) , saturates at a locally optimal value. Many classical design methods use gradient based optimizers such as adjoint optimization [93], [94] and gradient descent [15], [95], etc. and evaluate the FOM using the computationally expensive numerical model y ̂ ( x ) . As such, classical methods are computationally expensive, slow, and get stuck in local minima where the change in FOM Δf(y ^(t))/Δt → 0 between design proposals decreases rapidly with t. On top of that, the design generation process requires a delicate balance of performance optimization and adherence to constraints, e.g., physical, geometric, and fabrication.

To address this delicate balance, generative models, such as generative adversarial networks (GANs), variational autoencoders (VAEs), diffusion models, RL, and hybrid quantum-classical models have rapidly proven especially effective to tackle these problems. These models are uniquely suited for exploring vast and sensitive design spaces, enabling the creation of innovative and highly customized device configurations. They can rapidly generate solutions and learn complex relationships between design parameters and performance metrics such as size, material properties, and operational robustness, allowing for the automated discovery of non-intuitive solutions. Additionally, generative models allow for the probabilistic modeling of design spaces, making it possible to predict and mitigate variations or uncertainties in performance. By accelerating the exploration of design possibilities and enhancing the precision of results, ML technologies, especially generative models, are reshaping the photonics design process, driving innovation, and enabling the creation of cutting-edge optical devices. In particular, the one-to-many mapping problem is significant in photonics design because a single desired optical response can often correspond to multiple distinct designs, making traditional optimization approaches inefficient and prone to converging on suboptimal solutions. Probabilistic modeling facilitated by generative models enables the exploration of diverse, non-intuitive solutions that satisfy the same performance criteria. ML-driven optimization techniques are growing in popularity for their ability to reduce the dimensionality of design space [96], [97], facilitating the exploration of new designs [64], [98], [99], performing global optimization of photonic devices [12], [83], [100], address the one-to-many mapping problem in inverse design [101], [102], and explore new physics in highly complex photonic structures [103]. To prepare the use of each of these models, the FOM should be considered carefully.

4.1 Constructing the figure of merit (FOM)

A figure of merit (FOM) f ( y ) ∈ R is a scalar measure of device performance which scores high-dimensional optical response labels y into a single metric to facilitate optimal device design. A higher FOM value for an optical response y _i compared to another response y _j (f(y _i) > f(y _j)), indicates that y _i is “closer” to the optimal response y*.^[1]

A common approach to defining FOMs in deterministic optical responses is using a normalized L-norm loss against the optimal response y*, i.e., ||y − y*||_l [104]. In simpler cases, when the response has to be maximized to some finite value-such as reflection, transmission coefficients or normalized power efficiency-the FOM can be directly defined as f(y) ≡ y. Deterministic formulations are the most common, since any finite sampling can be represented with a sufficiently larger vector y.

Conversely, when the responses exhibit inherent variability due to noise, probabilistic metrics such as the KL divergence [85], Reny divergence [105], or Jenson–Shannon divergence [106] provide more robust alternatives by quantifying the statistical distance between observed and ideal optical response distributions, thereby capturing deviations that would otherwise not be apparent in deterministic methods. Furthermore, neural networks have recently demonstrated alternative loss functions to better guide the optimization process, such as KID, FID, and perceptual loss [33], [104]. While underutilized in PDD, these methods can effectively serve as learned FOMs, quantifying how closely a given response aligns with an optimal target. These networks are trained on large classification datasets and the loss is computed as an L-norm distance between activations in the deeper layers of the network, which encode important structural similarities. Neural network-based losses align well with human intuition for differences in data, however they can be much slower than the aforementioned losses.

4.2 Latent optimization

The design space for PDD problems X is frequently exponentially large with a sparse distribution of useful designs; for example, the space of useful material height maps X ⊂ [ 0,1 ] n × n usually has rounded features, axial symmetries and minimum feature widths, all of which are sparsely distributed around [0,1]^n×n. In which case, directly optimizing the designs through adjoint methods is too costly and slow, as most variations are suboptimal with small, but computationally expensive step sizes. Instead, designs can be compressed into lower-dimensional latent feature vectors z ∈ Z using an encoder q _θ(z|x) such that searching the associated latent space is more tractable using adjoint, gradient and global optimization techniques. To use this more efficient design space, we train a decoder p _θ(x|z) and modify our design objective in Eq. (2) to employ latent optimization methods [12], [104] to generate the optimal design

(4) E z ∼ q θ ( z ) [ E x ∼ p θ ( x | z ) [ f ( y ̂ ϕ ( x ) ) ] ] .

Here, the choice of optimizer q _θ(z) and decoder p _θ(x|z) are especially important, as they direct the performance. While the design space X is typically sparse, the latent space Z is very dense, making it more efficient to explore. For example, introducing continuous variations on the optical response y ̂ ϕ ( x ( 0 ) + δ x ( t ) ) by perturbing latent vectors z ^(t) ← z ⁽⁰⁾ + δ _z(t) allows the exploration of the design space x ^(t) ∼ p _θ(x ^(t)|z ^(t)). By adding subtle but impactful perturbations to latent representations of complex structures through global optimization [39], evolutionary and genetic algorithms [85], differential evolution [12], and even ML-assisted optimization [104] can efficiently explore the latent space and offer alternative designs outside of human intuition [103], [104], [107]. Typically, the optimizer q _θ(z) is chosen such that it is efficient to sample and is coupled to the FOM by a latent surrogate model E _θ(z) that is trained to predict the FOM f ( y ̂ ϕ ( x ) ) : x ∼ p θ ( x | z ) . Recent work by the authors has demonstrated that simple L-norm energy-matching losses, e.g., | | E θ ( z ) − f ( y ̂ ϕ ( x ) ) | | l , are difficult to train and too restrictive for optimization. Naturally, the neighboring correlations between FOMs are more important to capture in the surrogate model than exactly matching the FOM. Therefore, by using a Pearson correlation loss instead of an L-norm, the optimizer can more efficiently sample E _θ(z). The optimizer itself is often inspired by combinatorial optimization optimizers, such as, simulated annealing [39], global optimizers [12], recurrent neural networks [104] and even quantum samplers (Section 4.4) [107].

4.2.1 Variational autoencoders (VAEs)

Variational autoencoders (VAEs) are a type of unsupervised generative models that compress high-dimensional designs x ∈ R n into low-dimensional latent vectors z ∈ Z ⊆ R d : d < n [23]. VAEs are often applied to denoising, compression, and optimization applications, where the feature rich latent vectors z represent designs x without its redundancies, noise, or symmetries which cannot be removed efficiently otherwise [108]. The basic VAE workflow in ML-PDD begins with a set of optimized designs that is generated by classical methods, e.g., topology-optimized material distributions. Then, each design is compressed via an encoder q _θ(z|x) and eventually decompressed into the original design via a decoder x ∼ p _θ(x|z). The loss function is a two-term loss derived from the variational evidence lower bound

where q(z) is a prior over z, q _θ(z|x) is the encoder, and p _θ(x|z) is the decoder. The first term D KL is the Kullback–Leibler (KL) divergence which regularizes the distribution of the encoder with the prior q(z) and the second term is the reconstruction loss that biases the generated design to be similar to the data design [23].

A significant advantage of using VAEs over diffusion models is their structured latent space Z given by optimizing the evidence lower bound in Eq. (5). When a VAE is properly trained [109], the decoder p _θ(x|z) efficiently constructs designs x given latent vectors z that are distributed according to the prior q(z). This dense distribution of latent vectors around the prior is what enables latent optimization to be so successful because small perturbations around the prior have a large impact on the design.

However, the trade-off between training the reconstruction loss and KL divergence terms often results in suboptimal generative performance, particularly when dealing with fine structural features. Moreover, poor prior model assumptions q(z), such as a single Gaussian distribution [23], can lead to poor regularization [104], blurry reconstruction [107] and difficulties in capturing multi-modal distributions for multi-objective problems. For more advanced photonic design problems, such as nonlinear devices [110], plasmonic devices [111], or multi-objective devices [112], these problems become more exaggerated and difficult to overcome. Several advanced strategies have been used to address these deficiencies in design.

First, conditional VAEs (cVAEs) incorporate conditional labels into the latent space as a latent feature, guiding the decoder to generate more specific properties on-demand, such as optical, structural, and material requirements, which must be incorporated into the inverse design process. cVAEs are effective because the latent conditional label is also feature rich label and contains a lot of information for the decoder. Secondly, cVAEs are well-suited for managing one-to-many mapping challenges by conditioning the model on desired outputs, enabling it to produce diverse solutions that meet the same optical response criteria. This flexibility is valuable in exploring designs that meet multiple requirements, as cVAEs generate solutions within specific constraints. Thirdly, the introduction of β-VAEs weighs the KL divergence with a scalar β to balance the influence of the prior on training.

For example, in Figure 4(a), Kumar et al. implemented a constrained β-VAE model to optimize dielectric multilayer structures using genetic algorithms, allowing for multi-solution inverse design [101]. In another study by Lin et al., the authors examined eigenmodes for the inverse design of 2D BIC structures by using a β-VAE for encoding and decoding geometries, and two CNNs for forward simulation and inverse design (Figure 4(b)) [102]. By exploring the latent space of the β-VAE, the authors contributed to understanding protected/unprotected modes in complex geometries, enhancing future band engineering capabilities in photonic structures.

Figure 4:

Variational autoencoder (VAE) and generative adversarial network (GAN) models in photonic design applications. (a) Architecture of GA-β-VAE for dielectric multilayer structures inverse design. Adapted with permission from ref. [101]. Copyright 2024 Optica Publishing Group. (b) A DNN fusion model comprised of β-VAE and CNN1-z-CNN2 for inverse design and forward prediction, respectively, for bound states in the continuum (BICs) in freeform structures. Adapted with permission from ref. [102]. Copyright 2021 Chinese Laser Press. (c) Schematic of the conditional global topology optimization networks for metagrating generation. Adapted with permission from ref. [91]. Copyright 2019 American Chemical Society. (d) The workflow of the cGAN-based methodology for the inverse design of multifunctional microwave metasurfaces. Adapted with permission from ref. [113]. Copyright 2022 the authors. Advanced Photonics Research published by Wiley-VCH GmbH.

Moreover, using attention mechanisms allows the model to focus on small details over multiple passes [35], [40]. For instance, transformer-based architectures [35] leverage self-attention mechanisms to model long-range dependencies between data points, which is crucial for capturing complex interactions in photonic structures [114].

Lastly, rather than relying on the KL divergence for regularizing the encoder, introducing an adversarial discriminator mechanism forces the latent space to better align with the prior through a discriminator model, thus improving the generative quality [12], [83]. Despite these improvements to VAEs, outside of latent optimization, they have largely been replaced by a more general framework which uses the aforementioned adversarial mechanism known as generative adversarial networks or GANs [115].

4.2.2 Generative adversarial networks (GANs)

Generative adversarial networks (GANs) attempt to address the regularization issues present in VAEs by eliminating the KL divergence loss entirely. GANs consist of two main components, a generator x ∼ p _θ(x) that produces designs, and a discriminator d(x) that evaluates a design’s “authenticity” d(x) ∈ [0, 1] with the training dataset [24], i.e., how probable is it that the data was constructed by the generator d(x) = 1. Often the discriminator is trained but we omit its parameters. The adversarial training process iteratively refines the generator to produce increasingly realistic outputs to fool the discriminator. On the other hand, the discriminator tries to accurately classify data to overcome the generator. So GANs’ optimization objective can be expressed as a minimax problem:

(6) min θ max d E x ∼ p data ( x ) [ log ⁡ d ( x ) ] + E x ∼ p θ ( x ) [ log ( 1 − d ( x ) ) ] .

In contrast, VAEs minimize a combination of reconstruction loss and KL divergence. The KL divergence term regularizes the latent space, while GANs eliminate this term entirely, relying instead on the adversarial loss to guide the generation process. This difference in loss functions addresses the regularization issues in VAEs, but introduces challenges in balancing the generator and discriminator.

GANs are particularly valuable for developing device architectures [15], optimizing layouts [116], enhancing imaging resolution [117], and solving inverse design problems [118], [119], [120]. They excel in searching large design spaces for configurations with desired optical properties, such as high-efficiency metasurfaces and low-loss waveguides [121], [122] since they can learn complex mappings from design parameters to performance metrics and synthesize new designs that match the underlying data distributions, enabling efficient exploration of vast design spaces. In inverse design, GANs offer a rapid alternative to traditional iterative methods.

However, GANs are prone to several challenges during training. Mode collapse occurs when the generator produces limited variations of outputs, failing to capture the diversity of the data distribution [123]. Training instability refers to the difficulty in achieving a stable balance between the generator and discriminator, often leading to oscillating or divergent loss functions [124]. To address challenges in training instability and mode collapse, advanced GAN variants such as Wasserstein GANs (WGANs) have been introduced. The Wasserstein distance measures the cost of transforming the distribution of generated data into the distribution of real data. Unlike the original GAN loss, which relies on binary cross-entropy, the Wasserstein distance provides a more meaningful gradient when the distributions of real and generated data are far apart [125]. WGANs use this distance to improve the quality and diversity of generated designs, particularly in complex applications where structural nuances are critical [126]. For instance, Jiang et al. presented a global optimizer based on the WGAN model, as shown in Figure 4(c), which can output ensembles of highly efficient topology-optimized metasurfaces operating across a range of parameters with efficiencies better than the best devices produced by adjoint-based topology optimization, while requiring less computational cost [91].

Although GANs do not inherently impose structured control over the latent space, extensions such as conditional GANs (cGANs), similar to cVAEs, enable targeted design generation by incorporating specific constraints into the model, which are also adaptable to complex optimization challenges in photonics [127]. This capability allows GANs to address the one-to-many mapping problem, producing diverse designs that yield similar optical responses. For instance, Kiani et al. used a specialized cGAN [113], where the generator produces designs based on set conditions, such as specific optical properties (Figure 4(d)). They employed Gramian Angular Fields to transform optical property data into images, representing data correlations effectively. This technique allows the creation of varied metasurface structures with similar electromagnetic functionality, handling the one-to-many problem well.

GANs also benefit from hybrid strategies that integrate their generative strengths with other models’ latent space control by designing hybrid frameworks that combine adversarial loss for realism with reconstruction or noise-based loss for structured latent space and diverse sample generation. For instance, combining GANs with VAEs [128] or diffusion models [129] enhances their ability to produce stable, high-quality designs. Diffusion models, in particular, provide noise-based generation processes that complement GANs, enabling diverse and realistic designs [11]. This integration could be particularly effective for photonic applications requiring high fidelity and adaptability in the generated outputs.

4.2.3 Diffusion models

While GANs are able to generate high-quality results, they still suffer from training instability due to their sensitivity towards hyperparameters and the balance of the relative power of the generator and discriminator. Furthermore, VAEs are known to have issues balancing terms in the evidence lower bounds and their generative performance is lacking. A class of models which recently gained popularity are denoising diffusion models. Diffusion models are far more stable to train and robust towards hyperparameter changes as compared to GANs, and they have been shown to possess great generative capabilities and superior distribution coverage [130], outperforming GANs as evaluated by generative metrics such as Fréchet inception distance (FID). Diffusion models have found applications in various domains, from computer vision to bioinformatics [131], [132]. However, diffusion models are often more computationally expensive during training and sampling as compared to GANs and VAEs, requiring many more training steps to generate an image [130].

A diffusion model uses a Markov chain q(x ^(t+1)|x ^(t)) to gradually add noise, e.g., Gaussian noise x ( t ) + ϵ ( t ) : ϵ ( t ) ∼ N ( μ t , σ t ) , to the training data x ⁽⁰⁾ ∼ p _data(x) [133] so that the designs converge to a noisy Gaussian over time [25], [134]. A neural network predicts previous image q _θ(x ^(t−1)|x ^(t)) to reverse the noising process, which during training iteratively predicts the previous image given the current image over all time steps. To sample from the diffusion model, isotropic Gaussian noise is generated at the final time step T and the neural network iteratively generates the previous image for T − 1 time steps until a novel image is produced [25], [135]. To improve on the iterative denoising process, Ho et al. [25] introduced the denoising diffusion probabilistic model (DDPM), where they parameterize the neural network to predict the original source of noise ϵ _θ(t) instead of the previous image, which is shown in Figure 5(a). Typically for image applications, DDPMs use U-Nets which are deep convolutional networks with a spatial dimension bottleneck and skip connections to preserve information [25], [136], [137]. Recently, latent diffusion models have been introduced, which perform the diffusion process within the latent space Z of a pre-trained autoencoder, reducing the training and sampling computational requirements as the expensive diffusion process occurs on a lower dimensional representation [33].

Figure 5:

Diffusion model applications in photonic design. (a) Illustration of reverse process, where the neural network learns how to denoise across the Markov chain to generate novel images. Adapted with permission from ref. [25]. Copyright 2020 Neural Information Processing Systems Foundation, Inc. (b) Training architecture of conditional diffusion model, where a conditional and unconditional U-Net are mixed to produce new inverse designs through the denoising process. Adapted from ref. [138]. Copyright 2023 the author(s), published by De Gruyter, Berlin/Boston, licensed under the Creative Commons Attribution 4.0 International License. (c) Illustration of semi-supervised training process. Adapted from ref. [80]. Copyright 2024 the author(s). Laser & Photonics Reviews published by Wiley-VCH GmbH, licensed under a Creative Commons Attribution-Non Commercial License.

Diffusion models have been applied to a variety of tasks in nanophotonics, such as metasurface design, classification, and image deconvolution [80], [138], [139], [140], [141]. Zhang et al. [138] focused on conditionally generating dielectric metasurfaces using a conditional diffusion model on an all-dieletric freeform unit cell dataset to generate high-quality designs, guided by S-parameters. The researchers use a neural network to predict the spectral response and find that their diffusion model outperforms SLMGAN, WGAN-GP, and cVAE on frequency response accuracy, achieving a 43 %, 48 %, and 54 % improvement respectively. Their framework is shown in Figure 5(b). Kim et al. [80] sought to improve upon a classification task through implementing a semi-supervised learning strategy, facilitated by a diffusion model, that takes advantage of a large amount of unlabeled data with a small amount of labeled data, comparing their approach to a supervised learning scheme on a photonic crystal waveguide dataset. Their strategy, as shown in Figure 5(c), first consists of training a “teacher model” to predict the spectral response of the generated dataset, and then training a DDPM to expand the original dataset, to which the “teacher model” affixes labels. Finally, a “student model” with the same architecture as the teacher model is trained on this expanded dataset. The study finds that this semi-supervised learning strategy can enhance average training losses of the student classification model by up to 102.8 % as compared to the teacher model. Naturally, DDPMs are built for denoising. So, using them outside of design is also worthwhile. Chakravarthula et al. [139] used a diffusion model to improve the optical quality of images captured from an on-sensor metalens array, a challenging problem prevalent at smaller wavelengths. The authors introduce a flat array of metalenses to increase the field of view and implement a diffusion model to recover an image from the on-sensor metalens array. The image recovery process takes a captured image and alternates between inverse filtering, diffusion sampling steps, and merging steps. Their study finds that their probabilistic deconvolution method outperforms existing traditional and more recent machine learning deconvolution methods in various metrics.

Diffusion models also provide an alternative approach to tackle one-to-many mappings in transforming random noise into structured designs. Zhu et al. demonstrated this method for meta-atom design [140], using gradual noise-based generation specific to diffusion models to create multiple designs meeting the same specifications by varying the initial noise conditions. This allows the model to produce diverse solutions for the same performance criteria, which is ideal for cases where similar spectral responses arise from different structures.

In the most recent studies, diffusion model-based multi-modal machine learning frameworks bridge the fields of natural language processing and computer vision by combining models that deal with these two domains. Sun et al. created a multi-modal machine learning framework that incorporated the contrastive language-image pre-training (CLIP) and stable diffusion (SD) models to predict the photonic modes in one certain structure from structural information text [141]. The CLIP model is used to evaluate how similar a word is to an image, and the SD model performs the same diffusion process, but in the latent space of pre-trained autoencoders. The authors report that their method displays signs of stabilization across aHash, FID, and CLIP score metrics after only 6 training epochs. Furthermore, their method achieves an approximately 2-fold speedup in efficiency over conventional techniques that utilize Maxwell’s equations.

4.3 Reinforcement learning

Reinforcement learning (RL) algorithms excel when design choices build on one another, making them well-suited for photonic design problems where each step can, in turn, drastically alter the optical response. They recently gained popularity for their ability to beat humans at chess, Go, and other popular games [142]. RL stands apart from other ML techniques because it uses a goal-driven, trial-and-error approach, where the RL agent learns by interacting with an environment and receiving feedback (rewards or penalties) based on its actions [143]. RL agents generate their datasets in real time instead of relying on large datasets at the outset of the training process. This reduces the number of simulations needed to meet specific performance benchmarks. Their autonomy and adaptability make RL uniquely suited to streamline photonic device design while minimizing the computational overhead required for training datasets.

4.3.1 Reinforcement learning applied to PDD

RL enables inverse design by forcing the agent to design a device environment x (comprised of different materials and geometries) that optimizes the optical response, for example, the transmission spectrum of a metasurface [144], [145]. Figure 6(a) shows an example in which the device parameters x = {NT, L, AT, D} are the dimensions of the Si unit cells on a Si₃N₄ substrate [69]. The agent iteratively adjusts these parameters to achieve desired optical response using feedback from a simulation r ̂ ( y | x ) to generate positive and negative rewards. The agents’ actions adjust the geometry or materials in the design environment x and, over time, the RL model converges on optimal configurations. We simplify the RL training process to the following steps:

Initialize environment: Create the input parameters x ⁽⁰⁾ for the policy function π(x ⁽⁰⁾). For example, in Figure 6(a), this means initially assigning values to the dimensions in the unit cell. The agent could also select from a list of materials although it is not included in this example.
Action Selection: The agent uses its policy π(x ^(t)) to determine the next action a _t+1 ∼ π(x ^(t)), choosing from a range of probabilistically-generated actions based on the current state x ^(t). The agent balances exploration (trying new actions) and exploitation (repeating effective actions) as it refines the design.
Simulate the environment: Each action’s outcome x ^(t+1) ∼ a _t+1(x ^(t)) is evaluated by simulating the optical properties y ̂ ϕ ( x ( t + 1 ) ) , see Section 3. This simulation provides feedback on how well each action advances the design toward the target outcome.
Process Rewards: The RL model estimates rewards (or penalties) for each action based on how closely the resulting design meets the target outcome. Then, these rewards (or penalties) are used to update the agent’s policy, in preparation for the next step.
Repeat: Let the FOM f(y ^(t)) rank each design x ^(t) by its optical response y ( t ) = y ̂ ϕ ( x ( t ) ) from each generated design x ^(t), for example using a loss function designed to minimize deviation from a target spectrum [146]. Iterate until the FOM f(y ^(t)) reaches a target threshold that signifies the otimization is complete.

$Figure 6: Reinforcement learning for photonic design. (a) Example of reinforcement learning (RL) agent applied to the optimization of color generation using dielectric metasurfaces. Adapted with permission from ref. [69]. Copyright 2019 Optical Society of America under the terms of the OSA Open Access Publishing Agreement. (b) Training process for RL algorithm for the design of 1-dimensional photonic crystals. The agent uses the transfer matrix method (TMM) to simulate the spectrum of the design created by the sequence generator. Adapted with permission from ref. [146]. Copyright 2021 the author(s). Published by IOP Publishing Ltd. (c) Result of the RL algorithm implemented in (b). The system outperforms the state-of-the-art memetic algorithm for the emissive power of light bulbs in the visible regime. Adapted with permission from ref. [146]. Copyright 2021 the author(s). Published by IOP Publishing Ltd. (d) Silicon on Insulator (SOI) diffraction grating structure designed with RL algorithm. The RL model (PHORCED) outperforms GLOnet a compatible discriminative neural network trained with supervised learning. Both graphs in this subfigure adapted with permission from ref. [147]. Copyright 2021 the author(s). Published by De Gruyter, Berlin/Boston.$

Figure 6:

Reinforcement learning for photonic design. (a) Example of reinforcement learning (RL) agent applied to the optimization of color generation using dielectric metasurfaces. Adapted with permission from ref. [69]. Copyright 2019 Optical Society of America under the terms of the OSA Open Access Publishing Agreement. (b) Training process for RL algorithm for the design of 1-dimensional photonic crystals. The agent uses the transfer matrix method (TMM) to simulate the spectrum of the design created by the sequence generator. Adapted with permission from ref. [146]. Copyright 2021 the author(s). Published by IOP Publishing Ltd. (c) Result of the RL algorithm implemented in (b). The system outperforms the state-of-the-art memetic algorithm for the emissive power of light bulbs in the visible regime. Adapted with permission from ref. [146]. Copyright 2021 the author(s). Published by IOP Publishing Ltd. (d) Silicon on Insulator (SOI) diffraction grating structure designed with RL algorithm. The RL model (PHORCED) outperforms GLOnet a compatible discriminative neural network trained with supervised learning. Both graphs in this subfigure adapted with permission from ref. [147]. Copyright 2021 the author(s). Published by De Gruyter, Berlin/Boston.

The previous architectures discussed in this review update their parameters based on some form of gradient descent incorporated with the results of a loss function, such as Eq. (5). Similarly, RL techniques must specify a method for updating their agent’s policy; however, unlike the previously discussed architectures, this update is not necessarily derived from a loss function. Instead, RL techniques rely on feedback in the form of rewards (or penalties) to adjust the policy, based on the agent’s progress toward achieving a specified goal. This distinction has a significant implication: RL does not require the reward calculation process to be differentiable which means they can incorporate external simulations, such as 3D field solvers, into the optimization process. This flexibility expands the range of problems that RL can address and allows researchers to integrate diverse computational tools into the training process

4.3.2 Reinforcement learning architectures

The two main policy update techniques in RL are called Q-learning, which is a value-based approach, and policy gradients (PG), which is a policy-based technique. Both methods offer distinct benefits for inverse design.

The Q-learning method, first popularized in the deep Q network (DQN) model, updates an action value function a _t(x) (or Q-values) for each state-action pair and evaluates them based on future rewards [148]. The updates are processed using the Bellman equation in order to gradually improve Q-values by comparison of future rewards. The key to this technique is that it attempts to connect the state of the structure to the action that should be taken, making Q-values ideal for environments that have discrete parameter spaces.

Sajedian et al. use this process to find the optimal materials for maximizing the efficiency of a metasurface hologram [69]. They architect Q-values which map actions to the change of the material creating a state-action pair. They demonstrated this algorithm on a three-layer structure that was pre-designed to generate a hologram and successfully increased the efficiency of that hologram by 17 % using their RL algorithm. In another example, Badloe et al. design an ultra-broadband perfect absorber based on moth-eye structures. They expand beyond a discrete parameter space by using a double deep Q network (DDQN) architecture that considers the height, width, and period of each nanostructure [149]. The key to this network is that it utilizes two networks that select actions and then evaluate them. Their approach demonstrates quick convergence using a range of different materials with over 90 % average absorption between 400 and 1,600 nm for each design.

Alternatively, policy gradient methods (PMs) directly update the agent’s policy by adjusting its parameters to maximize rewards. Wang et al. demonstrate a PG technique for the design of one-dimensional photonic crystals [146]. They chose a gated recurrent unit (GRU) architecture for their policy because it is a good representation of the sequential nature of multi-layer structures. Figure 6(b) shows a block diagram of their model. During each iteration, they use the GRU to select a series of materials for their structure, then they compute the result using the transfer matrix method (TMM), and then update the network using the proximal policy optimization (PPO). PPO updates the policy by maximizing the expected reward, which, in this context, corresponds to how closely the generated optical spectrum matches the target spectrum. They designed an ultra-wide band absorber that increased efficiency from 95.37 % to 97.64 % between 400 and 2,000 nm for a 5-layer system. They also designed an incandescent light bulb filter, using 42-layers, which achieved an enhancement factor of 16.60 demonstrating 8.5 % improvement over state-of-the-art techniques.

Several other papers highlight the versatility of policy gradient methods for the inverse design of photonic structures. Hooten et al. propose PHORCE (PHotonic Optimization using REINFORCE Criteria for Enhanced Design) which they use to increase the efficiency of a gradient coupler [147]. PHORCE is a great example of how RL can reduce the training time and training data required by other ML techniques. Figure 6(d) shows the performance improvement over a comparable deep neural network. Li et al. demonstrated a combination of Q-learning and policy gradient techniques for the inverse design of photonic crystals for nanoscale laser cavities [150]. They found a crystal that achieved a Q-factor over 50 million, over 2 orders of magnitude better than state-of-the-art techniques.

RL has emerged as a powerful new technique for the inverse design of photonics structures. Unlike all other techniques discussed in this paper, it does not require a continuous function between the rewards it calculates and the updates to its policy. This frees up researchers to incorporate discontinuous analysis, like FDTD solvers or even real-world feedback into the training loop [151], [152]. Policy gradient methods are useful for continuous, high-dimensional action spaces whereas Q-learning excels at discrete action spaces, offering efficient solutions for problems with clearly defined actions. However, this freedom makes RL algorithms significantly more challenging to design, often requiring entire models to be built from scratch for a specific design. The resulting model is specific to a specific application and may not be as general as other ML techniques. They can also be computationally intensive, often requiring substantial training time and resources to achieve convergence over the other techniques [151]. While RL presents computational challenges, its novelty in photonic design suggests significant potential for future applications as researchers continue to explore and refine its capabilities.

4.4 Quantum generative models

Quantum information science is a rapidly growing field, with recent breakthrough demonstrations accelerating enthusiasm for its potential [153], [154]. Quantum-enhanced sampling and optimization algorithms are speculated to yield at least quadratic speed-up [155] for combinatorial optimization; thus, there is increasing interest to apply quantum algorithms for optimizing designs in PDD. The core idea is that designs x are embedded in quantum states |x⟩ and sampled using a quantum device q _θ(x) = |⟨x|U _θ|ψ ₀⟩|² where U _θ is a unitary acting on the initial state ψ ₀. Quantum sampling devices can be accessed commercially through D-wave [156], IBM [157], quantinuum [158], to name a few. To sample designs using a quantum device, the pure states |ψ⟩ of the native Hamiltonian H ̂ of the device must encode the design x. Likewise, the FOM is often encoded onto the low-energy states of a surrogate Hamiltonian H ̂ such that the energy ⟨ x | H ̂ | x ⟩ is anti-correlated with the FOM f ( y ̂ ϕ ( x ) ) . Then, the quantum device implements an algorithm, such as quantum-enhanced MCMC [107], [159], quantum annealing [156], or QAOA [155], to sample low-energy states of H ̂ to produce designs that optimize the FOM [39], [107] (Figure 7(a)). Initial work using this scheme in PDD problems was based on problems whose designs could be naturally encoded on pure states of the surrogate Hamiltonian. For example, in Figure 7(c) Inoue et al. optimized photonic-crystal surface emitting lasers [160]. The authors observe that non-uniform spatial distributions would be useful in accommodating for varying lattice constants or hole shapes in photonic crystal devices [161]. These parameters are formulated into a Hamiltonian sampling problem using a factorization machine and sampled by D-wave’s quantum solvers. Similarly, Kitai et al. [162] explored the design of metamaterials for radiative cooling applications using quantum sampling and a factorization machine. Properties such as compositional inhomogeneity can be formulated into minimization problem for the quantum annealer’s Hamiltonian, which identified candidates with high FOMs. The resulting structures were evaluated through rigorous coupled-wave analysis for their radiative properties, and the results were iteratively used to refine the factorization machine model. Direct mapping of large problems increases the complexity of the system Hamiltonian, making it susceptible to noise and errors, while also exacerbating local minima trapping and significantly lengthening sampling times. In an effort to reduce this complexity, Ye et al. explored a hybrid quantum-classical strategy using mixed-integer linear programming (MILP), which optimizes material layouts within a continuous domain using linear equations with continuous and discrete variables based on generalized benders’ decomposition (GBD) [163], [164]. This approach optimizes goals such as structural compliance and heat transfer efficiency while managing constraints such as local displacement [165]. GBD decomposes the original PDD problem into a sequence of MILPs, and uses material layouts from previous iterations to determine each new iteration, leading to faster convergence compared to classic direct optimization methods. The material is represented as a binary variable, and the formulations are tested on D-wave’s systems, and the implementations using GBD (GBD splitting-direct and GBD splitting-CQM) predictably took less time to converge, taking 234.12 and 111.97 s respectively [166]. Linear programming reduces iterations to reach optimal solutions, yielding sharper designs with fewer grey areas. Iterative refinements provide direct mappings, individual optimal solutions, faster convergence, and simpler implementation on quantum computers, extending to other continuous optimization problems. Despite the efficiency gains from MILP, the need for robust feature extraction and dimensional reduction persists.

For other PDD problems, such as freeform material distribution optimization, the designs cannot be embedded directly into pure states via direct mapping because there are too few qubits, the design space is too sparse or unknown FOM mappings onto the native Hamiltonian. Sparse design spaces in PDD especially complicate the optimization landscape, leading to inefficient use of quantum resources as many sampled quantum states may correspond to sub-optimal solutions. This further motivates the use of autoencoders: more particularly, binary variational autoencoders are useful in representing continuous optimization problems by traversing a latent space [167]. To address these challenges, discriminative models based on encoders compress designs onto pure states |z⟩ that can be natively sampled [107]. These frameworks typically begin with a classical pre-processing step, where the PDD designs x are encoded q _θ(z|x) onto a “native” representation z, sampled via the quantum device q _θ(z) = |⟨z|U _θ|ψ ₀⟩|², and then decoded x ∼ p _θ(x|z) [107], [168]. The bVAE-QUBO framework proposed by Wilson et al. aims to optimize dielectric, free-form diffractive meta-grating for beam steering [39]. As shown in Figure 7(b), the framework begins by compressing input designs into a binary latent space with a corresponding polynomial (QUBO) formalism. Quantum annealing models optical behaviors as state transitions, solving them as combinatorial problems where optimal configurations represent solutions. Statistical mechanics models enhance this approach by encoding metamaterial features like refractive indices and conductivity levels. Using D-Wave’s leap hybrid models and simulated annealing, bVAE-QUBO achieves 96.5 % and 96.7 % respective efficiencies. The framework was also applied to optimize the efficiency of a thermal emitter for TPV cells, maximizing power generation and minimizing radiation losses. The bVAE-QUBO framework uses a reduced dimension space for faster processing and traversal, ensuring feature-rich encoding through one-to-one mapping and generating a probabilistic distribution of all possible topologies. Notably, these techniques have been applied to freeform metamaterial unit-cell design for radiative cooling applications, thermophotovoltaic energy recapture [39], diffractive meta-gratings [39], and even improving future quantum devices [107]. Generally, these techniques have shown marked improvements beneficial in fields like thermophotovoltaics, incandescent light sources, biosensing, microbolometers, and drying furnaces [162].

Figure 7:

Quantum generative models in photonic design applications. (a) Simulated annealing versus quantum annealing. Simulated annealing aims to traverse a large solution space and converge at the global minimum as the solution. Quantum annealing works in a similar manner; it encodes the problem into a quantum system where the system’s qubits represent binary variables. By evolving the system’s Hamiltonian, all possible states of the qubits are explored simultaneously. Quantum annealing image adapted with permission from ref. [169]. Copyright D-wave systems. (b) bVAE-QUBO framework. Adapted with permission from ref. [39]. Copyright 2021 author(s). Published under an exclusive license by AIP Publishing. (c) Photon-crystal surface emitting lasers (PCSELs) using 3D RCWA, QUBO, and FMs. Adapted with permission from ref. [160]. Copyright 2022 Optica Publishing Group under the terms of the Optica Open Access Publishing Agreement.

4.5 Summary and outlook

ML is transforming photonic design by enabling efficient exploration of complex, high-dimensional spaces to optimize device performance. Advanced generative models like VAEs, GANs, diffusion models, and RL streamline the design process by balancing innovation, precision, and practical constraints. These tools enable breakthroughs in photonic systems by facilitating rapid iteration, addressing multi-modal challenges, and uncovering new physical phenomena. Emerging quantum/hybrid quantum-classical approaches further extend these capabilities, driving innovation in advanced photonic applications with unprecedented efficiency and scalability.

5 Fabrication

Fabrication bridges the gap between simulation and experimental demonstration and presents a critical and unique challenge in the PDD. It is a complex, often iterative and time-consuming process that involves balancing stochastic sources of error, including material inconsistencies, fabrication defects, and processing-induced variations [170], [171]. This issue is exacerbated in the design of metamaterials for nanophotonics, where structures must achieve sub-wavelength precision while often extending over dimensions that span hundreds of wavelengths. Even minimal deviations, on the order of nanometers, can cause substantial degradation in device performance, impacting key FOM metrics such as efficiency, bandwidth, and functional robustness [172]. Fortunately, the emergence of machine learning in nanophotonics enables precise, data-driven improvements to the complex, probabilistic nature of fabrication processes. In PDD, fabrication is a stochastic process r _η(χ|x) with parameters η where χ is our fabricated design and x is the ideal design. Notably, unlike other probabilistic models such as p _θ where θ is variational and can be trained via machine learning methods, the fabrication process’s dependency on its parameters η can’t be written down in a way that allows us to always optimize them, e.g., table alignments, machine settings, etc. An ideal fabrication process r _η(χ|x) reduces the variation, i.e., the entropy, in the fabricated designs given an ideal design. For the optimal fabrication process, would then be deterministic such that every design is fabricated perfectly every time. To address fabrication robustness and minimize design variability in PDD, we first discuss classical, non-ML approaches such as smoothing and vaccination. We then explore more advanced ML-driven methods, emphasizing discriminative models and reinforcement learning techniques.

5.1 Improving reliability with stochastic processes

Simple yet effective stochastic methods, such as smoothing, significantly enhance fabrication reliability. Smoothing typically employs Gaussian blurring ( x → x ̃ ) on optimized structures generated by methods like topology optimization (Figure 8(a)) [173], [174]. This pre-fabrication step mitigates vulnerabilities to small-scale defects by reducing the complexity of fragile or unintuitive geometries frequently generated during optimization [175]. Figure 8(c) and (d) demonstrates the effect of smoothing on a cylindrical metalens designed via topological optimization, illustrating improved manufacturability and consistency between simulated and fabricated results. Smoothing techniques have been successfully applied to various applications, such as improving waveguide mode matching and enhancing robustness against defects through structural optimization [176].

Figure 8:

Improving the fabrication of photonic devices with machine learning. (a) Inclusion of fabrication defects into the inverse design process. (b) Reinforcement learning (RL) uses a fab-in-the-loop process to optimize nanophotonic design over multiple iterations of the algorithm. (c) Design of cylindrical silicon metalens using topological optimization. This example shows the result of the topological optimization without taking into account the fabrication process. (d) The same design as (c) except that the topological optimization accounts for the fabrication process available to the designers of the system. Subfigures c and d are adapted with permission from ref. [177]. Copyright 2021 Optical Society of America. (e) Result of the fab-in-the-loop process applied to a grating coupler design. (f) Comparison between a traditional grating coupler and a grating coupler designed with the fab-in-the-loop RL algorithm. Subfigures a, b, e, and f are adapted with permission from ref. [151]. Copyright 2023 the authors, distributed under the Creative Commons License, AIP Publishing.

Vaccination is a similar technique that shifts the focus from the probabilistic incorporation of fabrication errors to adjusting misalignments common during experimental demonstration of free space optical designs, i.e., refining the fabrication parameters η to reduce error. Mengu et al. demonstrated this technique on their deep diffractive neural network (D2NN) platform [178]. Their inverse design process created a series of metasurfaces in which each unit cell was trained like a neuron in an artificial neural network. Each neuron’s precise control and individual freedom left the platform vulnerable to misalignments on the order of a single wavelength. With the introduction of vaccination, the researchers introduced random misalignments in the training process, resulting in a robust experimental demonstration at up to four wavelengths. This technique is another promising example of how careful incorporation of noise during the training process can be generally helpful in the realization of nanophotonic devices [179].

5.2 Correcting variations with discriminative models

While these simple stochastic processes are effective, researchers have sought more advanced techniques to protect against fabrication variations by adjusting the fabrication parameters η. As we have demonstrated throughout this review, a natural next step to the modeling of complex stochastic processes is to use highly effective discriminative models which can model complex, nonlinear fabrication processes. A common way to improve fabrication via discriminative modeling is to predict the optimal parameters η*(x) to improve the fabrication process r η * ( χ | x ) for any design x. For example, Gostimirovic et al. presented a feed-forward CNN to automatically correct layout distortions, which ensured that the resulting experimental demonstrations were robust against fabrication defects [180]. They trained their model on a small set of scanning electron microscopy images, enabling it to predict and then fix local variations such as corner rounding, bridging of narrow features, and over-etching in convex corners. Their approach requires no modifications to the existing fabrication process or proprietary foundry data, making it well-suited to a wide variety of photonic applications.

Alternatively, by predicting the error ϵ(χ|η, x) in the fabricated designs χ, adjustments can be made to the fabrication parameters η to improve robustness. In later work, the same group leveraged a deep CNN to predict nanoscale deviations from the original layout, revealing how proximity effects and other lithographic or etch-related artifacts degrade fidelity [181]. In a similar approach, Liu et al. use a tandem architecture of deep neural networks to address disparities between their design simulations and optical response [65]. They address inverse design for problems like thin-film filters and multiwavelength phase modulators. They combine the simulation, design, and fabrication steps of the PDD into a single feedback loop using multiple machine learning approaches. First, a discriminative model is trained to map a device layout to its optical behavior, then they combined the input design with a model that generates candidate layouts for a specified target performance. Although they set out to optimize their inverse design pipeline, their strategy naturally enhances robustness against manufacturing defects because the forward model is trained on fabrication data and the inverse network consistently generates layouts less prone to fabrication-induced errors. The resulting layouts avoid, for example, unrealistically thin layers that could cause over-etching or dimension loss.

5.3 Reinforcement learning in fabrication

RL is a strong candidate to improve fabrication errors because of its naturally iterative and adaptive nature which real-time feedback from the fabrication process. This makes it well-suited to handle the complexities and uncertainties inherent in manufacturing at the nanoscale. One of the critical advantages of RL is its ability to model the fabrication process as part of the environment in which the learning agent operates. Witt et al. demonstrated a fab-in-the-loop RL algorithm that automates adjustments to the design based on measured results (Figure 8(b)) [151]. The algorithm iteratively adjusted the design based on measured performance after each fabrication cycle. Specifically, the RL agent received feedback from insertion loss measurements and constructed rewards based on the results. After only six iterations of this process, they reduced insertion loss in a crystal grating coupler from 8.8 to 3.24 dB. In an alternative approach, Park et al. created a physics-informed RL model to design a reward system to encourage the feasibility of their fabricated devices [152]. Figure 8(e) and (f) highlight the potential of RL for scalable, robust, and innovative solutions in the general design of nanophotonic systems. RL models provide a very general framework for machine learning, so introducing something like fabrication variations into the reward process can be as simple or complicated as the experimental demonstration may require.

5.4 Summary and outlook

These fabrication-oriented methods showcase the growing capability of machine learning to address the randomness and complexity of real-world manufacturing conditions. Basic smoothing strategies provide fast, practical ways to refine the complex designs that sometimes result from inverse design techniques. Approaches like vaccination can ensure robustness against stochastic processes such as misalignment that might be present in experimental demonstration. Discriminative networks are an even more effective approach to protect against fabrication variations but they require datasets that can be time consuming to generate. Finally, RL provides a closed loop approach that can include fabrication results within the design process. Taken altogether, these techniques highlight a compelling trend. The inverse design of photonic structures push for tighter tolerances and higher complexity but machine learning techniques also offer a protective framework to ensure the manufacturability of those designs.

6 Characterization

Once a design χ ∼ r _η(χ|x) has been fabricated, its FOM f(y) is inferred f ̂ ( ϒ ) using a finite sample of noisy optical response measurements ϒ = { υ ( i ) ∼ m ρ ( υ | χ ) } i = 1 M , e.g., refractive index [182], electronegativity [183], absorption [184], chirality [185], etc., from a measurement device m _ρ(υ|χ). When measurement data can be rapidly collected, such as when simulations accurately mimic noisy measurements, or several measurements can be collected from a few fabricated devices, surrogate estimators can often significantly speed up characterization time, reducing the number of samples required to compute the FOM with a notable recent example being denoising back scattering in SEM [186]. Moreover, AI-driven SEM [187] in Figure 9(c) combines quasi-random initial measurements with the supervised learning approach for dynamic sampling using deep neural networks to identify the optimal unmeasured points that, when added to the dataset, enhance the fidelity and quality of the reconstructed image. Each unmeasured point is represented as a feature vector, with its elements determined by the measurement state in its surrounding neighborhood. Additionally, there is substantial work being done in measuring thermal conductivity and corrosion degradation [188], [189], [190], [191], super-resolution imaging [192], authenticating the source of semiconductor devices [40], automating grain size measurements using GANs [193] and detecting defects and cracks [30], [48], [194], [195] using symbolic regression. Known as short crack symbolic regression (SCSR) in Figure 9(d), this knowledge-based method consists of three phases: data collection, domain knowledge-guided machine learning, and model extension. Symbolic regression is involved by randomly generating the initial population and identifying individuals with high fitness to evolve through crossover and produce new offspring, as shown in the tree diagrams in Figure 9. Notably, orders-of-magnitude speed-ups can be realized when characterization tasks require less accuracy, such as detecting single-photon emitters [196] or tampering [40]. However, machine learning models are often limited by the quality and quantity of training data, the latter being a common problem for material science research and characterization [186], [187], [197], [198], [199], [200]. In modern machine learning settings, there are often hundreds of thousands to millions of samples for training [32]. Recent advances in rapid fabrication techniques such as combinatorial spread libraries [201], [202], human-automation workflows [203], and self-driving labs, enable larger data collection to improve characterization and experimental physical exploration. However, many of these methods are still in their infancy, and the more mature combinatorial spread libraries [201] can only supplement this need with hundreds of samples a day when the measurements of interest are local, for example scanning probe microscopies. In addition, the combinatorial search space is often exponentially large and intractable to search exactly [204]. To combat this low-sample problem, recent breakthroughs employ the following techniques which we explore further in this section: (i) generative modeling for pretraining (ii) active learning techniques using reinforcement learning and Bayesian optimization [200], [205], [206], [207], [208].

Figure 9:

Machine learning applications in photonic device characterization. (a) Automated discovery using scanning probe microscopy. With the ubiquity of combinatorial spread libraries and scanning probe microscopy, active learning methods are required to acquire and interpret these spectroscopic measurements. The corresponding machine learning frameworks for automated SPM include FerroBOT and Feature-discovery, which follow predefined rules for decision-making. Reprinted with permission from ref. [197]. Copyright 2022 American Chemical Society. (b) Nanoparticle distance matrix characterization and discrimination using data augmentation and generative modeling. Adapted with permission from ref. [40]. Copyright 2024 SPIE Digital Library, Creative Commons Attribution 4.0 International License. (c) AI driven SEM. Adapted with permission from ref. [187]. Copyright 2023, UChicago Argonne, LLC, Operator of Argonne National Laboratory. (d) Symbolic regression in crack growth defects (SCSR). Adapted with permission from ref. [30]. Copyright 2024, the author(s) under exclusive license to The Korean Institute of Metals and Materials.

6.1 Physics-informed generative data augmentation for pre-training

Two powerful techniques in machine learning for increasing training data are data augmentation [209] and generative modeling [19]. Typically, data augmentation increases the number of training samples by applying random transformations to the training data, e.g., additive Gaussian noise, image manipulation, etc. If the data augmentation is performed in a physics-informed manner, e.g., by applying generative modeling to learn the hyperparameters of noise sources during characterization, the generative models can be manually applied to pretrain a model on simulation data. For example, in a recent work by Wilson et al. [40] as shown in Figure 9(b), a set of 40 dark field microscopy images were augmented to generate a 10,000 image dataset by applying a pre-trained segmentation model to extract segmented images of gold nanoparticles spread uniformly on silicon. This model, along with a labeled clustering algorithm, are used to calculate the distance matrix and nanoparticle radii for adversarial tampering and natural degradation sample types. The discriminator network is then trained by randomly selecting a synthetic tampering type based on the tampering Bernoulli distribution. The distribution of the nanoparticles was shown to be uniform, and the background Gaussian noise was measured. Taking the segmented nanoparticle images and measured background noise distributions, an arbitrary number of synthetic samples could mimic the characterization measurements by reproducing a uniform distribution of nanoparticles, each of which is randomly augmented with stretching, rotation, etc., and adding background noise. These methods work well when the physical model is well known, but the hyperparameters need to be learned. However, when the physical model is unknown and several candidates are available, we turn to alternative Bayesian methods.

6.2 Experimental physical discovery

The aim of automatic physical discovery [210] is to predict an analytical model for a set of measurements [197], [199], [200] through Gaussian processes, active learning, and symbolic regression [30], [48], [195], [211], [212], [213], [214]. Oftentimes, these models are employed when measurement data are sparse and time-consuming to collect and the physical model isn’t known a priori. For example, in hypothesis learning, a variant of active learning which uses reinforcement learning and Bayesian optimization, several candidate models are chosen with randomly initialized hyperparameters. These models use a set of prior physical models y ̂ ( i ) such as Gaussian priors and candidate models with preconfigured hyperparameters. Then, each of these models is evaluated using Bayesian inference based on reducing uncertainty. Using a greedy policy, the model which has the minimal uncertainty is used to predict the next sample point ϕ ^(t+1) and all the models are updated with the new characterization measurement. Active learning enables algorithms to prioritize and identify critical structural or design features by iteratively refining their understanding based on user-defined parameters or feedback, such as key microstructural elements based on operator-defined signal aspects. Further, hypothesis learning optimizes functions using physical or competing models, such as optimizing microscope resolution or exploring ferroelectric domain growth [197] (Figure 9(a)). While hypothesis learning and other active learning methods have been demonstrated on small examples, as automated physical discovery matures, methods will need to be augmented and improved to address larger fabrication priors. Along the same lines, active learning methods have few examples of integrating modern symbolic regression libraries for purely analytic modeling.

6.3 Summary and outlook

Treating characterization as an inference task where the objective is to construct a symbolic model or approximate the FOM allows us to employ new techniques like symbolic regression and automated physical discovery. To ensure that ML models are ready in low-shot environments like characterization, where sometimes a relatively low number of uniquely correlated samples are available, data augmentation techniques like generative modeling and random transformations can pre-train approximate FOM models. The introduction of symbolic regression and automated physical discovery enable more efficient modeling of physical phenomena. The biggest future opportunities are in the introduction of community-wide nanophotonic databases that would give more characterization data for training these models.

7 Conclusion and future outlook

7.1 Conclusions

This work comprehensively reviews how machine learning-assisted photonic device development (ML-PDD) transforms the traditional, iterative photonic device development (PDD) approach. We broke down the PDD process into 5 steps: theory, simulation, design, fabrication, and characterization and showcase how machine learning (ML) algorithms improve each step of the process. This framework is presented in the context of discriminative models, which excel at representing sophisticated functions, and generative models, which excel at exploring large design spaces. Combining these techniques creates a data-driven approach that automates device exploration, speeds up the PDD process, and provides increasingly reliable real-world solutions.

The theory portion (Section 2) discusses how machine learning synergizes with the fundamental physics of electromagnetic theory to enrich the fidelity of photonic research. ML techniques like symbolic regression and discriminative models uncover and refine the underlying physical laws by proposing new, high-fidelity constitutive relationships in exotic media. Generative models provide insights into dominant mechanisms that aid performance through latent space engineering. Altogether, these developments pave the way for more comprehensive, accurate, and rapid electromagnetic modeling, linking photonic design directly to fundamental principles in ways that conventional analysis alone cannot achieve.

The simulation section (Section 3) highlights how ML approaches can assist or replace traditional simulation techniques like FDTD or RCWA. Discriminative models, which map design parameters to optical responses, act as fast, approximate simulators for specific problem areas. These techniques require extensive training data collected from numerical solvers or lab measurements. However, once trained, they can predict device behavior at a fraction of the time required by traditional techniques. Fortunately, research has shown that generative models can assist in data collection, offering data augmentation through synthetic examples that mimic actual or simulated measurements. This synergy helps to address the data scarcity problem that plagues many photonics workflows. Ultimately, the combination of these techniques is invaluable to the iterative optimization problems in photonic design, enabling the designer to search much larger design spaces than was previously possible while maintaining fidelity to established electromagnetic theory.

The design section (Section 4) showcases the latest techniques for the inverse design of photonic structures using machine learning. We highlight the latest techniques for VAEs, GANs, diffusion models, reinforcement learning, and quantum-hybrid solvers. The generative models encode geometric features and material information into latent variables. Researchers can efficiently explore these expansive design spaces to refine design features and inversely optimize the solution. Reinforcement learning techniques provide a precise solution while minimizing the training data required to navigate the design space. Meanwhile, quantum-inspired and hybrid quantum-classical frameworks bring new frontiers to photonic optimization by leveraging quantum annealers or factorization machines for combinatorial challenges. All these ML-driven advances in design reflect a growing shift in the research community. Instead of manually guessing or tuning geometry, scientists are using neural networks to discover innovative device configurations that often outperform traditional techniques.

Next, we discuss how common imperfections in the fabrication process (Section 5), such as defects, misalignments, or tolerances, can be mitigated with modern ML techniques. We introduce straightforward solutions like smoothing techniques and “vaccination” against alignment errors. Then, we highlighted discriminative models capable of predicting lithographic distortions or etching errors, thereby proactively correcting design layouts for real-world production steps. Lastly, we introduce reinforcement learning techniques that include the fabrication process directly into the algorithm’s training environment, training the RL agent to propose robust designs against the fabrication process. We showcase algorithms at several levels of complexity, highlighting how the ML toolkit can be adapted to the fidelity required by the experimental demonstration.

The characterization section (Section 6) emphasizes how ML can streamline data-heavy or noise-limited tasks. Discriminative models trained on partial information can quickly infer relevant properties when measuring the performance of complex photonic chips or metasurfaces. These models drastically reduce the need for exhaustive measurement protocols. These methods are critical for characterization, such as single-photon emitters, detecting subtle surface defects, or reconstructing near-field images. We then showcase how realistic noise models can ensure even more robust data augmentation techniques. Ultimately, ML-driven characterization enables more rapid insights into a device’s optical behavior, surpassing purely manual or physics-based interpolation.

Altogether, these new ML techniques demonstrate performance improvements across every step of the PDD. Discriminative models provide rapid forward mappings critical for speedy simulations, online process monitoring, and efficient characterization. Generative models offer unconstrained exploration of device topologies and parameter sets, which result in exotic yet robust design solutions. Combining both approaches in collaboration with physical knowledge and real fabrication constraints automates and accelerates the entire PDD process.

7.2 Future outlook

In the short term, we expect that ML techniques will continue to improve the efficiency of photonics designs in terms of accuracy and cost. By incorporating physics-based models and fabrication constraints into the training process, they will ensure that designs are both physically sound and practically realizable. Hybrid modeling techniques that combine quantum annealers, latent space engineering, and high-fidelity surrogate models promise unprecedented efficiency in exploring complex design landscapes over today’s solutions.

With regard to hybrid techniques that incorporate quantum annealers, however, it is evident that these tools are still in their incipient stage, and significant progress is required in both quantum hardware development and algorithmic design before their capabilities can be reliably leveraged in device design. In the foreseeable future, we anticipate the most viable role for quantum annealing to be a heuristic subroutine embedded within classical machine learning frameworks, offering possible advantages in constrained combinatorial tasks like QUBO, graph partitioning, or feature selection. Nevertheless, as Aaronson has emphasized, there are substantial hurdles like noise, limited qubit connectivity, and embedding inefficiencies that must be overcome to demonstrate the true utility of quantum optimization-assisted machine learning [215]. Even so, incremental advantages in quantum materials, co-design, and surrogate modelling may gradually enable quantum optimization to become a more practical tool within machine learning.

As these approaches mature, we anticipate that the leading models will become more robust and expand to larger design spaces. Along this process, they will become increasingly accessible, making photonics designers less reliant on extensive machine learning backgrounds for selecting, training, and deploying models. Eventually, we expect these models to integrate directly with commercially available simulation software, similar to how topological optimization is already available in tools like Lumerical or COMSOL. We anticipate that generative inverse design frameworks will follow a similar trajectory toward maturity and eventually become a familiar and user-friendly tool employed across the majority of photonics designs.

Finally, we anticipate the creation and sharing of large datasets of photonic structures will standardize and expedite progress. We imagine a comprehensive, community-driven database similar the Materials project [216] and Atlas [217], which are already useful resources for the reference of optical data in homogeneous materials. Still, for the development of metamaterials [218] and similar complex photonic structures, there is a lack of a similar effort to construct a unified dataset of characterization measurements and computational models for training machine learning models. With leadership, the photonics community could compile the datasets created throughout their training processes. Then, as the field evolves and machine learning-assisted design becomes more prevalent, a central design database could vastly improve the PDD design cycle. Over time, ML will automate more and more of the PDD, improving technologies from integrated photonic circuits and topological photonics to advanced metasurfaces and quantum information systems.

Corresponding author: Alexandra Boltasseva, Elmore Family School of Electrical and Computer Engineering, Birck Nanotechnology Center, and Purdue Quantum Science and Engineering Institute, Purdue University, West Lafayette, IN 47907, USA; and Quantum Science Center, Oak Ridge National Laboratory, Oak Ridge, TN 37830, USA, E-mail: aeb@purdue.edu

Yuheng Chen, Alexander Montes McNeil, Taehyuk Park, and Blake A. Wilson contributed equally to this work.

Funding source: National Aeronautics and Space Administration

Award Identifier / Grant number: 80NSSC23K0195

Funding source: Purdue’s Elmore ECE Emerging Frontiers Center ‘The Crossroads of Quantum and AI’

Funding source: Department of Energy

Funding source: Air Force Office of Scientific Research

Award Identifier / Grant number: FA9550-20-1-0124

Funding source: National Science Foundation

Award Identifier / Grant number: DMR-2323910, DMR-2202268, DMR-2323908, DMR-2323909, ECCS-2430412

Research funding: Purdue team acknowledges the U.S. Department of Energy (DOE), Office of Science through the Quantum Science Center (QSC), a National Quantum Information Science Research Center, Air Force Office of Scientific Research (AFOSR) award No. FA9550-20-1-0124, Purdue’s Elmore ECE Emerging Frontiers Center ‘The Crossroads of Quantum and AI’, National Science Foundation (NSF) award DMR-2323910. The team at Northeastern University acknowledges the support from the NSF under Grant Nos. DMR-2202268, DMR-2323908, and ECCS-2430412. Northeastern team also acknowledges the support of AMM as a Draper Scholar from The Charles Stark Draper Laboratory, Inc. Georgia Tech team was supported in part by the NSF under Grant No. DMR-2323909, and in part by the Early-Stage Innovations (ESI) program of the National Aeronautics and Space Administration (NASA) under Grant No. 80NSSC23K0195 (subcontract from Baylor University; PI: Dr. Alan X. Wang).
Author contributions: YC, AMM, TP, and BAW contributed equally to the conception, literature review, and drafting of the manuscript. VI, MB, J-IC, RO, PM, DKS, GC, PC, and TD assisted with research, figure preparation, and editing. AVK, VMS, MM, WC, YL, and AB supervised the project, provided critical revisions, and guided the overall structure and direction of the review. All authors have accepted responsibility for the entire content of this manuscript and consented to its submission to the journal, reviewed all the results and approved the final version of the manuscript.
Conflict of interest: Authors state no conflict of interest.
Data availability: Data sharing is not applicable to this article as no datasets were generated or analyzed during the current study.

References

[1] L. Eldada, “Advances in telecom and datacom optical components,” Opt. Eng., vol. 40, no. 7, p. 1165, 2001. https://doi.org/10.1117/1.1372703.Search in Google Scholar

[2] K. Yamada, et al.., “High-performance silicon photonics technology for telecommunications applications,” Sci. Technol. Adv. Mater., vol. 15, no. 2, p. 024603, 2014. https://doi.org/10.1088/1468-6996/15/2/024603.Search in Google Scholar PubMed PubMed Central

[3] C. Rogers, et al.., “A universal 3D imaging sensor on a silicon photonics platform,” Nature, vol. 590, no. 7845, pp. 256–261, 2021. https://doi.org/10.1038/s41586-021-03259-y.Search in Google Scholar PubMed

[4] V. Passaro, C. Tullio, B. Troia, M. Notte, G. Giannoccaro, and F. Leonardis, “Recent advances in integrated photonic sensors,” Sensors, vol. 12, no. 11, pp. 15558–15598, 2012. https://doi.org/10.3390/s121115558.Search in Google Scholar PubMed PubMed Central

[5] M. Shahbaz, M. A. Butt, and R. Piramidowicz, “A concise review of the progress in photonic sensing devices,” Photonics, vol. 10, no. 6, p. 698, 2023. https://doi.org/10.3390/photonics10060698.Search in Google Scholar

[6] F. Flamini, N. Spagnolo, and F. Sciarrino, “Photonic quantum information processing: a review,” Rep. Prog. Phys., vol. 82, no. 1, p. 016001, 2019. https://doi.org/10.1088/1361-6633/aad5b2.Search in Google Scholar PubMed

[7] A. González-Tudela, A. Reiserer, J. J. García-Ripoll, and F. J. García-Vidal, “Light–matter interactions in quantum nanophotonic devices,” Nat. Rev. Phys., vol. 6, no. 3, pp. 166–179, 2024. https://doi.org/10.1038/s42254-023-00681-1.Search in Google Scholar

[8] M. T. Raimondi, S. M. Eaton, M. M. Nava, M. Laganà, G. Cerullo, and R. Osellame, “Two-photon laser polymerization: from fundamentals to biomedical application in tissue engineering and regenerative medicine,” J. Appl. Biomater. Funct. Mater., vol. 10, no. 1, pp. 56–66, 2012. https://doi.org/10.5301/jabfm.2012.9278.Search in Google Scholar PubMed

[9] K. Yee, “Numerical solution of initial boundary value problems involving Maxwell’s equations in isotropic media,” IEEE Trans. Antenn. Propag., vol. 14, no. 3, pp. 302–307, 1966. https://doi.org/10.1109/tap.1966.1138693.Search in Google Scholar

[10] R. W. Clough, “The finite element in plane stress analysis,” in Proc. 2nd ASCE Confer. On Electric Computation, 1960, p. 1960.Search in Google Scholar

[11] M. Moharam and T. K. Gaylord, “Rigorous coupled-wave analysis of planar-grating diffraction,” JOSA, vol. 71, no. 7, pp. 811–818, 1981. https://doi.org/10.1364/josa.71.000811.Search in Google Scholar

[12] Z. A. Kudyshev, A. V. Kildishev, V. M. Shalaev, and A. Boltasseva, “Machine learning–assisted global optimization of photonic devices,” Nanophotonics, vol. 10, no. 1, pp. 371–383, 2020. https://doi.org/10.1515/nanoph-2020-0376.Search in Google Scholar

[13] G. Genty, et al.., “Machine learning and applications in ultrafast photonics,” Nat. Photonics, vol. 15, no. 2, pp. 91–101, 2021. https://doi.org/10.1038/s41566-020-00716-4.Search in Google Scholar

[14] Z. A. Kudyshev, V. M. Shalaev, and A. Boltasseva, “Machine learning for integrated quantum photonics,” ACS Photonics, vol. 8, no. 1, pp. 34–46, 2021. https://doi.org/10.1021/acsphotonics.0c00960.Search in Google Scholar

[15] Z. Liu, D. Zhu, L. Raju, and W. Cai, “Tackling photonic inverse design with machine learning,” Advanced Science, vol. 8, no. 5, p. 2002923, 2021. https://doi.org/10.1002/advs.202002923.Search in Google Scholar PubMed PubMed Central

[16] J. Jiang, M. Chen, and J. A. Fan, “Deep neural networks for the evaluation and design of photonic devices,” Nat. Rev. Mater., vol. 6, no. 8, pp. 679–700, 2021. https://doi.org/10.1038/s41578-020-00260-1.Search in Google Scholar

[17] W. Ma, Z. Liu, Z. A. Kudyshev, A. Boltasseva, W. Cai, and Y. Liu, “Deep learning for the design of photonic structures,” Nat. Photonics, vol. 15, no. 2, pp. 77–90, 2021. https://doi.org/10.1038/s41566-020-0685-y.Search in Google Scholar

[18] Y. Xu, B. Xiong, W. Ma, and Y. Liu, “Software-defined nanophotonic devices and systems empowered by machine learning,” Prog. Quantum Electron., vol. 89, p. 100469, 2023. https://doi.org/10.1016/j.pquantelec.2023.100469.Search in Google Scholar

[19] C. M. Bishop and N. M. Nasrabadi, Pattern Recognition and Machine Learning, vol. 4, New York, Springer, 2006.Search in Google Scholar

[20] B. E. Saleh and M. C. Teich, Fundamentals of Photonics, Hoboken, NJ, John Wiley & Sons, 2019.Search in Google Scholar

[21] L. Novotny and B. Hecht, Principles of Nano-Optics, Cambridge, Cambridge University Press, 2012.10.1017/CBO9780511794193Search in Google Scholar

[22] A. Ng and M. Jordan, “On discriminative vs. generative classifiers: a comparison of logistic regression and naive bayes,” Adv. Neural Inf. Process. Syst., vol. 14, pp. 841–848, 2001.Search in Google Scholar

[23] D. P. Kingma, “Auto-encoding variational bayes,” arXiv preprint arXiv:1312.6114, 2013.Search in Google Scholar

[24] I. Goodfellow, et al.., “Generative adversarial nets,” Adv. Neural Inf. Process. Syst., vol. 27, 2014. https://doi.org/10.48550/arXiv.1406.2661.Search in Google Scholar

[25] J. Ho, A. Jain, and P. Abbeel, “Denoising diffusion probabilistic models,” Adv. Neural Inf. Process. Syst., vol. 33, pp. 6840–6851, 2020.Search in Google Scholar

[26] A. Burnap, Y. Liu, Y. Pan, H. Lee, R. Gonzalez, and P. Y. Papalambros, “Estimating and exploring the product form design space using deep generative models,” in Volume 2A: 42nd Design Automation Conference, Charlotte, North Carolina, USA, American Society of Mechanical Engineers, 2016, p. V02AT03A013.10.1115/DETC2016-60091Search in Google Scholar

[27] M. Dhar, A. Grover, and S. Ermon, “Modeling sparse deviations for compressed sensing using generative models,” in International Conference on Machine Learning, PMLR, 2018, pp. 1214–1223.Search in Google Scholar

[28] F. Faruqi, et al.., “Style2Fab: functionality-aware segmentation for fabricating personalized 3D models with generative AI,” in Proceedings of the 36th Annual ACM Symposium on User Interface Software and Technology, San Francisco CA USA, ACM, 2023, pp. 1–13.10.1145/3586183.3606723Search in Google Scholar

[29] O. Yesilyurt, et al.., “Fabrication-conscious neural network based inverse design of single-material variable-index multilayer films,” Nanophotonics, vol. 12, no. 5, pp. 993–1006, 2023. https://doi.org/10.1515/nanoph-2022-0537.Search in Google Scholar PubMed PubMed Central

[30] S. Zhou, B. Yang, S. Xiao, G. Yang, and T. Zhu, “Interpretable machine learning method for modelling fatigue short crack growth behaviour,” Met. Mater. Int., vol. 30, no. 7, pp. 1944–1964, 2024. https://doi.org/10.1007/s12540-024-01628-6.Search in Google Scholar

[31] J. Peurifoy, et al.., “Nanophotonic particle simulation and inverse design using artificial neural networks,” Sci. Adv., vol. 4, no. 6, p. eaar4206, 2018. https://doi.org/10.1126/sciadv.aar4206.Search in Google Scholar PubMed PubMed Central

[32] J. Achiam, et al.., “Gpt-4 technical report,” arXiv preprint arXiv:2303.08774, 2023.Search in Google Scholar

[33] R. Rombach, A. Blattmann, D. Lorenz, P. Esser, and B. Ommer, “High-resolution image synthesis with latent diffusion models,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022, pp. 10684–10695.10.1109/CVPR52688.2022.01042Search in Google Scholar

[34] D. E. Rumelhart, G. E. Hinton, and R. J. Williams, “Learning representations by back-propagating errors,” Nature, vol. 323, no. 6088, pp. 533–536, 1986. https://doi.org/10.1038/323533a0.Search in Google Scholar

[35] A. Vaswani, “Attention is all you need,” Adv. Neural Inf. Process. Syst., vol. 30, 2017. https://doi.org/10.48550/arXiv.1706.03762.Search in Google Scholar

[36] A. Krizhevsky, I. Sutskever, and G. E. Hinton, “Imagenet classification with deep convolutional neural networks,” Adv. Neural Inf. Process. Syst., vol. 25, 2012.Search in Google Scholar

[37] A. Paszke, et al.., “Pytorch: an imperative style, high-performance deep learning library,” arXiv preprint arXiv:1912.01703, 2019.Search in Google Scholar

[38] D. P. Kingma and J. Ba, “Adam: a method for stochastic optimization,” arXiv preprint arXiv:1412.6980, 2014.Search in Google Scholar

[39] B. A. Wilson, Z. A. Kudyshev, A. V. Kildishev, S. Kais, V. M. Shalaev, and A. Boltasseva, “Machine learning framework for quantum sampling of highly-constrained, continuous optimization problems,” Appl. Phys. Rev., vol. 8, no. 4, p. 041418, 2021. https://doi.org/10.1063/5.0060481.Search in Google Scholar

[40] B. Wilson, et al.., “Authentication through residual attention-based processing of tampered optical responses,” Adv. Photonics, vol. 6, no. 5, p. 056002, 2024. https://doi.org/10.1117/1.ap.6.5.056002.Search in Google Scholar

[41] A. Bandi, P. V. S. R. Adapa, and Y. E. V. P. K. Kuchi, “The power of generative AI: a review of requirements, models, input–output formats, evaluation metrics, and challenges,” Future Internet, vol. 15, no. 8, p. 260, 2023. https://doi.org/10.3390/fi15080260.Search in Google Scholar

[42] A. Bondeson, T. Rylander, and P. Ingelström, Computational Electromagnetics, New York, Springer, 2012.10.1007/978-1-4614-5351-2Search in Google Scholar

[43] H. Yu, Y. Peng, Y. Yang, and Z.-Y. Li, “Plasmon-enhanced light–matter interactions and applications,” npj Comput. Mater., vol. 5, no. 1, p. 45, 2019. https://doi.org/10.1038/s41524-019-0184-1.Search in Google Scholar

[44] B. J. Shastri, et al.., “Photonics for artificial intelligence and neuromorphic computing,” Nat. Photonics, vol. 15, no. 2, pp. 102–114, 2021. https://doi.org/10.1038/s41566-020-00754-y.Search in Google Scholar

[45] X. Shu and Y. Ye, “Knowledge discovery: methods from data mining and machine learning,” Soc. Sci. Res., vol. 110, p. 102817, 2023, https://doi.org/10.1016/j.ssresearch.2022.102817.Search in Google Scholar PubMed

[46] Y. Liu, T. Zhao, W. Ju, and S. Shi, “Materials discovery and design using machine learning,” J. Materiomics, vol. 3, no. 3, pp. 159–177, 2017. https://doi.org/10.1016/j.jmat.2017.08.002.Search in Google Scholar

[47] Z. Li, R. Pestourie, Z. Lin, S. G. Johnson, and F. Capasso, “Empowering metasurfaces with inverse design: principles and applications,” ACS Photonics, vol. 9, no. 7, pp. 2178–2192, 2022. https://doi.org/10.1021/acsphotonics.1c01850.Search in Google Scholar

[48] M. Cranmer, “Interpretable machine learning for science with pysr and symbolicregression. jl,” arXiv preprint arXiv:2305.01582, 2023.Search in Google Scholar

[49] S. Kim, et al.., “Integration of neural network-based symbolic regression in deep learning for scientific discovery,” IEEE Transact. Neural Networks Learn. Syst., vol. 32, no. 9, pp. 4166–4177, 2020. https://doi.org/10.1109/tnnls.2020.3017010.Search in Google Scholar PubMed

[50] W. Li, et al.., “Deep learning modeling strategy for material science: from natural materials to metamaterials,” J. Phys. Mater., vol. 5, no. 1, p. 014003, 2022. https://doi.org/10.1088/2515-7639/ac5914.Search in Google Scholar

[51] A. M. Tartakovsky, C. O. Marrero, P. Perdikaris, G. D. Tartakovsky, and D. Barajas-Solano, “Learning parameters and constitutive relationships with physics informed deep neural networks,” arXiv preprint arXiv:1808.03398, 2018.Search in Google Scholar

[52] C. Yeung, et al.., “Elucidating the behavior of nanophotonic structures through explainable machine learning algorithms,” ACS Photonics, vol. 7, no. 8, pp. 2309–2318, 2020. https://doi.org/10.1021/acsphotonics.0c01067.Search in Google Scholar

[53] M. Elzouka, C. Yang, A. Albert, R. S. Prasher, and S. D. Lubner, “Interpretable forward and inverse design of particle spectral emissivity using common machine-learning models,” Cell Rep. Phys. Sci., vol. 1, no. 12, 2020, https://doi.org/10.1016/j.xcrp.2020.100259.Search in Google Scholar

[54] C. Shao, et al.., “Machine learning in short-reach optical systems: a comprehensive survey,” Photonics, vol. 11, no. 7, p. 613, 2024. https://doi.org/10.3390/photonics11070613.Search in Google Scholar

[55] A. Armghan, M. Alsharari, K. Aliqab, O. Alsalman, J. Parmar, and S. K. Patel, “Graphene twistronics: tuning the absorption spectrum and achieving metamaterial properties,” Mathematics, vol. 11, no. 7, p. 1579, 2023. https://doi.org/10.3390/math11071579.Search in Google Scholar

[56] J. Yun, S. Kim, S. So, M. Kim, and J. Rho, “Deep learning for topological photonics,” Adv. Phys. X, vol. 7, no. 1, p. 2046156, 2022. https://doi.org/10.1080/23746149.2022.2046156.Search in Google Scholar

[57] A. Venketeswaran, et al.., “Recent advances in machine learning for fiber optic sensor applications,” Adv. Intell. Syst., vol. 4, no. 1, p. 2100067, 2022. https://doi.org/10.1002/aisy.202100067.Search in Google Scholar

[58] R. Martinez-Manuel, L. M. Valentin-Coronado, J. Esquivel-Hernandez, K. J.-J. Monga, and S. LaRochelle, “Machine learning implementation for unambiguous refractive index measurement using a self-referenced fiber refractometer,” IEEE Sens. J., vol. 22, no. 14, pp. 14134–14141, 2022. https://doi.org/10.1109/jsen.2022.3183475.Search in Google Scholar

[59] X. Li, S. Ning, Z. Liu, Z. Yan, C. Luo, and Z. Zhuang, “Designing phononic crystal with anticipated band gap through a deep learning based data-driven method,” Comput. Methods Appl. Mech. Eng., vol. 361, p. 112737, 2020, https://doi.org/10.1016/j.cma.2019.112737.Search in Google Scholar

[60] S. Singh, R. Kumar, S. S. Panda, and R. S. Hegde, “Deep-learning enabled photonic nanostructure discovery in arbitrarily large shape sets via linked latent space representation learning,” Digit. Discov., vol. 3, no. 8, pp. 1612–1623, 2024. https://doi.org/10.1039/d4dd00107a.Search in Google Scholar

[61] W. Ma, F. Cheng, and Y. Liu, “Deep-learning-enabled on-demand design of chiral metamaterials,” ACS Nano, vol. 12, no. 6, pp. 6326–6334, 2018. https://doi.org/10.1021/acsnano.8b03569.Search in Google Scholar PubMed

[62] S. An, et al.., “Deep learning modeling approach for metasurfaces with high degrees of freedom,” Opt. Express, vol. 28, no. 21, pp. 31932–31942, 2020. https://doi.org/10.1364/oe.401960.Search in Google Scholar PubMed

[63] Y. Zhu, et al.., “Data augmentation using continuous conditional generative adversarial networks for regression and its application to improved spectral sensing,” Opt. Express, vol. 31, no. 23, pp. 37722–37739, 2023. https://doi.org/10.1364/oe.502709.Search in Google Scholar

[64] W. Ma, F. Cheng, Y. Xu, Q. Wen, and Y. Liu, “Probabilistic representation and inverse design of metamaterials based on a deep generative model with semi-supervised learning strategy,” Adv. Mater., vol. 31, no. 35, p. 1901111, 2019. https://doi.org/10.1002/adma.201901111.Search in Google Scholar PubMed

[65] D. Liu, Y. Tan, E. Khoram, and Z. Yu, “Training deep neural networks for the inverse design of nanophotonic structures,” ACS Photonics, vol. 5, no. 4, pp. 1365–1369, 2018. https://doi.org/10.1021/acsphotonics.7b01377.Search in Google Scholar

[66] S. So, J. Mun, and J. Rho, “Simultaneous inverse design of materials and structures via deep learning: demonstration of dipole resonance engineering using core–shell nanoparticles,” ACS Appl. Mater. Interfaces, vol. 11, no. 27, pp. 24264–24268, 2019. https://doi.org/10.1021/acsami.9b05857.Search in Google Scholar PubMed

[67] I. Malkiel, M. Mrejen, A. Nagler, U. Arieli, L. Wolf, and H. Suchowski, “Plasmonic nanostructure design and characterization via deep learning,” Light: Sci. Appl., vol. 7, no. 1, p. 60, 2018. https://doi.org/10.1038/s41377-018-0060-7.Search in Google Scholar PubMed PubMed Central

[68] P. R. Wiecha and O. L. Muskens, “Deep learning meets nanophotonics: a generalized accurate predictor for near fields and far fields of arbitrary 3d nanostructures,” Nano Lett., vol. 20, no. 1, pp. 329–338, 2019. https://doi.org/10.1021/acs.nanolett.9b03971.Search in Google Scholar PubMed

[69] I. Sajedian, T. Badloe, and J. Rho, “Optimisation of colour generation from dielectric nanostructures using reinforcement learning,” Opt. Express, vol. 27, no. 4, pp. 5874–5883, 2019. https://doi.org/10.1364/oe.27.005874.Search in Google Scholar PubMed

[70] N. Kovachki, et al.., “Neural operator: learning maps between function spaces with applications to pdes,” J. Mach. Learn. Res., vol. 24, no. 89, pp. 1–97, 2023.Search in Google Scholar

[71] Z. Li, et al.., “Fourier neural operator for parametric partial differential equations,” arXiv preprint arXiv:2010.08895, 2020.Search in Google Scholar

[72] K. Azizzadenesheli, N. Kovachki, Z. Li, M. Liu-Schiaffini, J. Kossaifi, and A. Anandkumar, “Neural operators for accelerating scientific simulations and design,” Nat. Rev. Phys., vol. 6, no. 5, pp. 320–328, 2024. https://doi.org/10.1038/s42254-024-00712-5.Search in Google Scholar

[73] J. Gu, et al.., “Neurolight: a physics-agnostic neural operator enabling parametric photonic device simulation,” Adv. Neural Inf. Process. Syst., vol. 35, pp. 14623–14636, 2022.Search in Google Scholar

[74] Y. Tang, et al.., “Physics-informed recurrent neural network for time dynamics in optical resonances,” Nat. Comput. Sci., vol. 2, no. 3, pp. 169–178, 2022. https://doi.org/10.1038/s43588-022-00215-2.Search in Google Scholar PubMed

[75] X. Ma, et al.., “Strategical deep learning for photonic bound states in the continuum,” Laser Photonics Rev., vol. 16, no. 10, p. 2100658, 2022. https://doi.org/10.1002/lpor.202100658.Search in Google Scholar

[76] M. Chen, et al.., “High speed simulation and freeform optimization of nanophotonic devices with physics-augmented deep learning,” ACS Photonics, vol. 9, no. 9, pp. 3110–3123, 2022. https://doi.org/10.1021/acsphotonics.2c00876.Search in Google Scholar

[77] Y. Qu, L. Jing, Y. Shen, M. Qiu, and M. Soljacic, “Migrating knowledge between physical scenarios based on artificial neural networks,” ACS Photonics, vol. 6, no. 5, pp. 1168–1174, 2019. https://doi.org/10.1021/acsphotonics.8b01526.Search in Google Scholar

[78] D. Xu, et al.., “Efficient design of a dielectric metasurface with transfer learning and genetic algorithm,” Opt. Mater. Express, vol. 11, no. 7, pp. 1852–1862, 2021. https://doi.org/10.1364/ome.427426.Search in Google Scholar

[79] J. H. Han, et al.., “Neural-network-enabled design of a chiral plasmonic nanodimer for target-specific chirality sensing,” ACS Nano, vol. 17, no. 3, pp. 2306–2317, 2023. https://doi.org/10.1021/acsnano.2c08867.Search in Google Scholar PubMed

[80] J. Kim, et al.., “Semi-supervised learning leveraging denoising diffusion probabilistic models for the characterization of nanophotonic devices,” Laser Photonics Rev., vol. 18, no. 10, p. 2300998, 2024. https://doi.org/10.1002/lpor.202300998.Search in Google Scholar

[81] W. Ma and Y. Liu, “A data-efficient self-supervised deep learning model for design and characterization of nanophotonic structures,” Sci. China Phys. Mech. Astron., vol. 63, no. 8, p. 284212, 2020. https://doi.org/10.1007/s11433-020-1575-2.Search in Google Scholar

[82] C. Leon and A. Scheinker, “Physics-constrained machine learning for electrodynamics without gauge ambiguity based on Fourier transformed Maxwell’s equations,” Sci. Rep., vol. 14, no. 1, p. 14809, 2024. https://doi.org/10.1038/s41598-024-65650-9.Search in Google Scholar PubMed PubMed Central

[83] Z. A. Kudyshev, A. V. Kildishev, V. M. Shalaev, and A. Boltasseva, “Machine-learning-assisted metasurface design for high-efficiency thermal emitter optimization,” Appl. Phys. Rev., vol. 7, no. 2, 2020, https://doi.org/10.1063/1.5134792.Search in Google Scholar

[84] Z. Liu, D. Zhu, S. P. Rodrigues, K.-T. Lee, and W. Cai, “Generative model for the inverse design of metasurfaces,” Nano Lett., vol. 18, no. 10, pp. 6570–6576, 2018. https://doi.org/10.1021/acs.nanolett.8b03171.Search in Google Scholar PubMed

[85] Z. Liu, L. Raju, D. Zhu, and W. Cai, “A hybrid strategy for the discovery and design of photonic structures,” IEEE J. Emerg. Sel. Topics Circuits Syst., vol. 10, no. 1, pp. 126–135, 2020. https://doi.org/10.1109/jetcas.2020.2970080.Search in Google Scholar

[86] A. Ghosh, M. Elhamod, J. Bu, W.-C. Lee, A. Karpatne, and V. A. Podolskiy, “Physics-informed machine learning for optical modes in composites,” Adv. Photonics Res., vol. 3, no. 11, p. 2200073, 2022. https://doi.org/10.1002/adpr.202200073.Search in Google Scholar

[87] Y. Chen, L. Lu, G. E. Karniadakis, and L. Dal Negro, “Physics-informed neural networks for inverse problems in nano-optics and metamaterials,” Opt. Express, vol. 28, no. 8, pp. 11618–11633, 2020. https://doi.org/10.1364/oe.384875.Search in Google Scholar

[88] Y. Chen and L. Dal Negro, “Physics-informed neural networks for imaging and parameter retrieval of photonic nanostructures from near-field data,” APL Photonics, vol. 7, no. 1, 2022, https://doi.org/10.1063/5.0072969.Search in Google Scholar

[89] M. Zhelyeznyakov, et al.., “Large area optimization of meta-lens via data-free machine learning,” Commun. Eng., vol. 2, no. 1, p. 60, 2023. https://doi.org/10.1038/s44172-023-00107-x.Search in Google Scholar

[90] J. Lim and D. Psaltis, “Maxwellnet: physics-driven deep neural network training based on Maxwell’s equations,” APL Photonics, vol. 7, no. 1, 2022, https://doi.org/10.1063/5.0071616.Search in Google Scholar

[91] J. Jiang and J. A. Fan, “Global optimization of dielectric metasurfaces using a physics-driven neural network,” Nano Lett., vol. 19, no. 8, pp. 5366–5372, 2019. https://doi.org/10.1021/acs.nanolett.9b01857.Search in Google Scholar PubMed

[92] J. Jiang and J. A. Fan, “Multiobjective and categorical global optimization of photonic structures based on resnet generative neural networks,” Nanophotonics, vol. 10, no. 1, pp. 361–369, 2020. https://doi.org/10.1515/nanoph-2020-0407.Search in Google Scholar

[93] C. Yeung, et al.., “Enhancing adjoint optimization-based photonic inverse design with explainable machine learning,” ACS Photonics, vol. 9, no. 5, pp. 1577–1585, 2022. https://doi.org/10.1021/acsphotonics.1c01636.Search in Google Scholar

[94] T. W. Hughes, M. Minkov, I. A. D. Williamson, and S. Fan, “Adjoint method and inverse design for nonlinear nanophotonic devices,” ACS Photonics, vol. 5, no. 12, pp. 4781–4787, 2018. https://doi.org/10.1021/acsphotonics.8b01522.Search in Google Scholar

[95] Y. Deng, S. Ren, J. Malof, and W. J. Padilla, “Deep inverse photonic design: a tutorial,” Photon. Nanostruct: Fundam. Appl., vol. 52, p. 101070, 2022, https://doi.org/10.1016/j.photonics.2022.101070.Search in Google Scholar

[96] Z. Liu, Z. Zhu, and W. Cai, “Topological encoding method for data-driven photonics inverse design,” Opt. Express, vol. 28, no. 4, pp. 4825–4835, 2020. https://doi.org/10.1364/oe.387504.Search in Google Scholar

[97] B. Wilson, Y. Chen, S. Kais, A. Kildishev, V. Shalaev, and A. Boltasseva, “Empowering quantum 2.0 devices and approaches with machine learning,” in Quantum 2.0 Conference and Exhibition, Boston, MA, Optica Publishing Group, 2022, p. QTu2A.13.10.1364/QUANTUM.2022.QTu2A.13Search in Google Scholar

[98] W. Ding, J. Chen, and R.-x. Wu, “A generative meta-atom model for metasurface-based absorber designs,” Adv. Opt. Mater., vol. 11, no. 2, p. 2201959, 2023. https://doi.org/10.1002/adom.202201959.Search in Google Scholar

[99] Y. Chen, et al.., “Advancing photonic design with topological latent diffusion generative model,” in Frontiers in Optics + Laser Science 2024 (FiO, LS), Denver, Colorado, Optica Publishing Group, 2024, p. JW5A.58.10.1364/FIO.2024.JW5A.58Search in Google Scholar

[100] L. Mascaretti, et al.., “Designing metasurfaces for efficient solar energy conversion,” ACS Photonics, vol. 10, no. 12, pp. 4079–4103, 2023. https://doi.org/10.1021/acsphotonics.3c01013.Search in Google Scholar PubMed PubMed Central

[101] P. Kumar, et al.., “Multi-solution inverse design in photonics using generative modeling,” JOSA B, vol. 41, no. 2, pp. A152–A160, 2024. https://doi.org/10.1364/josab.502923.Search in Google Scholar

[102] R. Lin, Z. Alnakhli, and X. Li, “Engineering of multiple bound states in the continuum by latent representation of freeform structures,” Photonics Res., vol. 9, no. 4, pp. B96–B103, 2021. https://doi.org/10.1364/prj.415655.Search in Google Scholar

[103] L. Wang, Y.-C. Chan, F. Ahmed, Z. Liu, P. Zhu, and W. Chen, “Deep generative modeling for mechanistic-based learning and design of metamaterial systems,” Comput. Methods Appl. Mech. Eng., vol. 372, p. 113377, 2020, https://doi.org/10.1016/j.cma.2020.113377.Search in Google Scholar

[104] M. Bezick, et al.., “Pearsan: a machine learning method for inverse design using pearson correlated surrogate annealing,” arXiv preprint arXiv:2412.19284, 2024.Search in Google Scholar

[105] H. Cai, S. Gehly, Y. Yang, R. Hoseinnezhad, R. Norman, and K. Zhang, “Multisensor tasking using analytical rényi divergence in labeled multi-Bernoulli filtering,” J. Guid. Control Dynam., vol. 42, no. 9, pp. 2078–2085, 2019. https://doi.org/10.2514/1.g004232.Search in Google Scholar

[106] F. Rodríguez-Santos, A. L. Quintanar-Reséndiz, G. Delgado-Gutiérrez, L. Palacios-Luengas, O. Jiménez-Ramírez, and R. Vázquez-Medina, “Identifying the digital camera from natural images using residual noise and the Jensen–Shannon divergence,” J. Electr. Comput. Eng., vol. 2022, no. 1, p. 1574024, 2022. https://doi.org/10.1155/2022/1574024.Search in Google Scholar

[107] B. A. Wilson, et al.., “Non-native quantum generative optimization with adversarial autoencoders,” arXiv preprint arXiv:2407.13830, 2024.Search in Google Scholar

[108] X. Wang, H. Chen, S. Tang, Z. Wu, and W. Zhu, “Disentangled representation learning,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 46, no. 12, pp. 9677–9696, 2024. https://doi.org/10.1109/tpami.2024.3420937.Search in Google Scholar PubMed

[109] B. Dai and D. Wipf, “Diagnosing and enhancing vae models,” arXiv preprint arXiv:1903.05789, 2019.Search in Google Scholar

[110] L. Raju, et al.., “Maximized frequency doubling through the inverse design of nonlinear metamaterials,” ACS Nano, vol. 16, no. 3, pp. 3926–3933, 2022. https://doi.org/10.1021/acsnano.1c09298.Search in Google Scholar PubMed

[111] J.-F. Masson, J. S. Biggins, and E. Ringe, “Machine learning for nanoplasmonics,” Nat. Nanotechnol., vol. 18, no. 2, pp. 111–123, 2023. https://doi.org/10.1038/s41565-022-01284-0.Search in Google Scholar PubMed

[112] Z. A. Kudyshev, A. V. Kildishev, V. M. Shalaev, and A. Boltasseva, “Optimizing startshot lightsail design: a generative network-based approach,” ACS Photonics, vol. 9, no. 1, pp. 190–196, 2021. https://doi.org/10.1021/acsphotonics.1c01352.Search in Google Scholar

[113] M. Kiani, J. Kiani, and M. Zolfaghari, “Conditional generative adversarial networks for inverse design of multifunctional metasurfaces,” Adv. Photonics Res., vol. 3, no. 11, p. 2200110, 2022. https://doi.org/10.1002/adpr.202200110.Search in Google Scholar

[114] N. Rane, “Transformers in material science: roles, challenges, and future scope,” SSRN Electron. J., 2023, https://doi.org/10.2139/ssrn.4609920.Search in Google Scholar

[115] T. Zhou, Q. Li, H. Lu, Q. Cheng, and X. Zhang, “GAN review: models and medical image fusion applications,” Inf. Fusion, vol. 91, pp. 134–148, 2023, https://doi.org/10.1016/j.inffus.2022.10.017.Search in Google Scholar

[116] C. Qian, R. K. Tan, and W. Ye, “An adaptive artificial neural network-based generative design method for layout designs,” Int. J. Heat Mass Transfer, vol. 184, p. 122313, 2022, https://doi.org/10.1016/j.ijheatmasstransfer.2021.122313.Search in Google Scholar

[117] C. You, et al.., “CT super-resolution GAN constrained by the identical, residual, and cycle learning ensemble (GAN-CIRCLE),” IEEE Trans. Med. Imag., vol. 39, no. 1, pp. 188–203, 2020. https://doi.org/10.1109/tmi.2019.2922960.Search in Google Scholar

[118] T. Christensen, et al.., “Predictive and generative machine learning models for photonic crystals,” Nanophotonics, vol. 9, no. 13, pp. 4183–4192, 2020. https://doi.org/10.1515/nanoph-2020-0197.Search in Google Scholar

[119] S. So and J. Rho, “Designing nanophotonic structures using conditional deep convolutional generative adversarial networks,” Nanophotonics, vol. 8, no. 7, pp. 1255–1261, 2019. https://doi.org/10.1515/nanoph-2019-0117.Search in Google Scholar

[120] C. Liu, W. M. Yu, Q. Ma, L. Li, and T. J. Cui, “Intelligent coding metasurface holograms by physics-assisted unsupervised generative adversarial network,” Photonics Res., vol. 9, no. 4, pp. B159–B167, 2021. https://doi.org/10.1364/prj.416287.Search in Google Scholar

[121] J. Jiang and J. A. Fan, “Simulator-based training of generative neural networks for the inverse design of metasurfaces,” Nanophotonics, vol. 9, no. 5, pp. 1059–1069, 2020. https://doi.org/10.1515/nanoph-2019-0330.Search in Google Scholar

[122] H. N. Bui, J.-S. Kim, and J.-W. Lee, “Design of tunable metasurface using deep neural networks for field localized wireless power transfer,” IEEE Access, vol. 8, pp. 194868–194878, 2020, https://doi.org/10.1109/access.2020.3033527.Search in Google Scholar

[123] D. Saxena and J. Cao, “Generative adversarial networks (GANs): challenges, solutions, and future directions,” ACM Comput. Surv., vol. 54, no. 3, pp. 1–42, 2022. https://doi.org/10.1145/3446374.Search in Google Scholar

[124] M. Wiatrak, S. V. Albrecht, and A. Nystrom, “Stabilizing generative adversarial networks: a survey,” arXiv preprint arXiv:1910.00927, 2019.Search in Google Scholar

[125] I. Deshpande, et al.., “Max-sliced wasserstein distance and its use for GANs,” in 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA, IEEE, 2019, pp. 10640–10648.10.1109/CVPR.2019.01090Search in Google Scholar

[126] I. Gulrajani, F. Ahmed, M. Arjovsky, V. Dumoulin, and A. C. Courville, “Improved training of wasserstein gans,” Adv. Neural Inf. Process. Syst., vol. 30, 2017.Search in Google Scholar

[127] T.-C. Wang, M.-Y. Liu, J.-Y. Zhu, A. Tao, J. Kautz, and B. Catanzaro, “High-resolution image synthesis and semantic manipulation with conditional gans,” in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2018, pp. 8798–8807.10.1109/CVPR.2018.00917Search in Google Scholar

[128] C. Wang, E. Sharifnia, Z. Gao, S. H. Tindemans, and P. Palensky, “Generating multivariate load states using a conditional variational autoencoder,” Elec. Power Syst. Res., vol. 213, p. 108603, 2022, https://doi.org/10.1016/j.epsr.2022.108603.Search in Google Scholar

[129] Z. Wang, H. Zheng, P. He, W. Chen, and M. Zhou, “Diffusion-gan: training gans with diffusion,” arXiv preprint arXiv:2206.02262, 2022.Search in Google Scholar

[130] P. Dhariwal and A. Nichol, “Diffusion models beat gans on image synthesis,” Adv. Neural Inf. Process. Syst., vol. 34, pp. 8780–8794, 2021.Search in Google Scholar

[131] L. Yang, et al.., “Diffusion models: a comprehensive survey of methods and applications,” ACM Comput. Surv., vol. 56, no. 4, pp. 105:1–105:39, 2023. https://doi.org/10.1145/3626235.Search in Google Scholar

[132] H. Cao, et al.., “A survey on generative diffusion models,” IEEE Trans. Knowl. Data Eng., vol. 36, no. 7, pp. 2814–2830, 2024. https://doi.org/10.1109/tkde.2024.3361474.Search in Google Scholar

[133] J. Sohl-Dickstein, E. Weiss, N. Maheswaranathan, and S. Ganguli, “Deep unsupervised learning using nonequilibrium thermodynamics,” in International Conference on Machine Learning, PMLR, 2015, pp. 2256–2265.Search in Google Scholar

[134] S. Chan, et al.., “Tutorial on diffusion models for imaging and vision,” Found. Trends® Comput. Graph. Vis., vol. 16, no. 4, pp. 322–471, 2024. https://doi.org/10.1561/0600000112.Search in Google Scholar

[135] C. Luo, “Understanding diffusion models: a unified perspective,” arXiv preprint arXiv:2208.11970, 2022.Search in Google Scholar

[136] M. Chen, S. Mei, J. Fan, and M. Wang, “An overview of diffusion models: applications, guided generation, statistical rates and optimization,” arXiv preprint arXiv:2404.07771, 2024.Search in Google Scholar

[137] O. Ronneberger, P. Fischer, and T. Brox, “U-net: convolutional networks for biomedical image segmentation,” in Medical Image Computing and Computer-Assisted Intervention – MICCAI 2015, N. Navab, J. Hornegger, W. M. Wells, and A. F. Frangi, Eds., Cham, Springer International Publishing, 2015, pp. 234–241.10.1007/978-3-319-24574-4_28Search in Google Scholar

[138] Z. Zhang, C. Yang, Y. Qin, H. Feng, J. Feng, and H. Li, “Diffusion probabilistic model based accurate and high-degree-of-freedom metasurface inverse design,” Nanophotonics, vol. 12, no. 20, pp. 3871–3881, 2023. https://doi.org/10.1515/nanoph-2023-0292.Search in Google Scholar PubMed PubMed Central

[139] P. Chakravarthula, et al.., “Thin on-sensor nanophotonic array cameras,” ACM Trans. Graphics, vol. 42, no. 6, pp. 249:1–249:18, 2023. https://doi.org/10.1145/3618398.Search in Google Scholar

[140] L. Zhu, W. Hua, C. Lv, and Y. Liu, “Rapid inverse design of high degree of freedom meta-atoms based on the image-parameter diffusion model,” J. Lightwave Technol., vol. 42, no. 15, pp. 5269–5278, 2024. https://doi.org/10.1109/jlt.2024.3391924.Search in Google Scholar

[141] J. Sun, X. Chen, X. Wang, D. Zhu, and X. Zhou, “Photonic modes prediction via multi-modal diffusion model,” arXiv preprint arXiv:2401.08199, 2024. https://doi.org/10.1088/2632-2153/ad743f.Search in Google Scholar

[142] D. Silver, et al.., “A general reinforcement learning algorithm that masters chess, shogi, and go through self-play,” Science, vol. 362, no. 6419, pp. 1140–1144, 2018. https://doi.org/10.1126/science.aar6404.Search in Google Scholar PubMed

[143] R. S. Sutton and A. G. Barto, Reinforcement Learning: An Introduction, Cambridge, A Bradford Book, 2018.Search in Google Scholar

[144] I. Sajedian, H. Lee, and J. Rho, “Double-deep q-learning to increase the efficiency of metasurface holograms,” Sci. Rep., vol. 9, no. 1, p. 10899, 2019. https://doi.org/10.1038/s41598-019-47154-z.Search in Google Scholar PubMed PubMed Central

[145] S. So, T. Badloe, J. Noh, J. Bravo-Abad, and J. Rho, “Deep learning enabled inverse design in nanophotonics,” Nanophotonics, vol. 9, no. 5, pp. 1041–1057, 2020. https://doi.org/10.1515/nanoph-2019-0474.Search in Google Scholar

[146] H. Wang, Z. Zheng, C. Ji, and L. J. Guo, “Automated multi-layer optical design via deep reinforcement learning,” Mach. Learn. Sci. Technol., vol. 2, no. 2, p. 025013, 2021. https://doi.org/10.1088/2632-2153/abc327.Search in Google Scholar

[147] S. Hooten, R. G. Beausoleil, and T. Van Vaerenbergh, “Inverse design of grating couplers using the policy gradient method from reinforcement learning,” Nanophotonics, vol. 10, no. 15, pp. 3843–3856, 2021. https://doi.org/10.1515/nanoph-2021-0332.Search in Google Scholar

[148] V. Mnih, “Playing atari with deep reinforcement learning,” arXiv preprint arXiv:1312.5602, 2013.Search in Google Scholar

[149] T. Badloe, I. Kim, and J. Rho, “Biomimetic ultra-broadband perfect absorbers optimised with reinforcement learning,” Phys. Chem. Chem. Phys., vol. 22, no. 4, pp. 2337–2342, 2020. https://doi.org/10.1039/c9cp05621a.Search in Google Scholar PubMed

[150] R. Li, et al.., “Deep reinforcement learning empowers automated inverse design and optimization of photonic crystals for nanoscale laser cavities,” Nanophotonics, vol. 12, no. 2, pp. 319–334, 2023. https://doi.org/10.1515/nanoph-2022-0692.Search in Google Scholar PubMed PubMed Central

[151] D. Witt, J. Young, and L. Chrostowski, “Reinforcement learning for photonic component design,” APL Photonics, vol. 8, no. 10, 2023, https://doi.org/10.1063/5.0159928.Search in Google Scholar

[152] C. Park, et al.., “Sample-efficient inverse design of freeform nanophotonic devices with physics-informed reinforcement learning,” Nanophotonics, vol. 13, no. 8, pp. 1483–1492, 2024. https://doi.org/10.1515/nanoph-2023-0852.Search in Google Scholar PubMed PubMed Central

[153] R. Acharya, et al.., “Quantum error correction below the surface code threshold,” Nature, vol. 638, pp. 920–926, 2024. https://doi.org/10.1038/s41586-024-08449-y.Search in Google Scholar PubMed PubMed Central

[154] S. Bravyi, A. W. Cross, J. M. Gambetta, D. Maslov, P. Rall, and T. J. Yoder, “High-threshold and low-overhead fault-tolerant quantum memory,” Nature, vol. 627, no. 8005, pp. 778–782, 2024. https://doi.org/10.1038/s41586-024-07107-7.Search in Google Scholar PubMed PubMed Central

[155] E. Farhi, J. Goldstone, and S. Gutmann, “A quantum approximate optimization algorithm,” arXiv preprint arXiv:1411.4028, 2014.Search in Google Scholar

[156] A. D. King, et al.., “Computational supremacy in quantum simulation,” arXiv preprint arXiv:2403.00910, 2024.Search in Google Scholar

[157] IBM Quantum, “Qiskit,” 2025. Accessed: Jan. 27, 2025.Search in Google Scholar

[158] Quantinuum, “tket: the quantum software development kit,” 2025. Accessed: Jan. 27, 2025.Search in Google Scholar

[159] D. Layden, et al.., “Quantum-enhanced Markov chain Monte Carlo,” Nature, vol. 619, no. 7969, pp. 282–287, 2023. https://doi.org/10.1038/s41586-023-06095-4.Search in Google Scholar PubMed

[160] T. Inoue, Y. Seki, S. Tanaka, N. Togawa, K. Ishizaki, and S. Noda, “Towards optimization of photonic-crystal surface-emitting lasers via quantum annealing,” Opt. Express, vol. 30, no. 24, p. 43503, 2022. https://doi.org/10.1364/oe.476839.Search in Google Scholar PubMed

[161] R. Lima Thomes, J. A. Mosquera-Sánchez, and C. De Marqui, “An investigation of optimal non-uniform locally resonant piezoelectric metamaterials,” in Active and Passive Smart Structures and Integrated Systems IX, J.-H. Han, S. Shahab, and G. Wang, Eds., SPIE, 2020, p. 37.10.1117/12.2558552Search in Google Scholar

[162] K. Kitai, et al.., “Designing metamaterials with quantum annealing and factorization machines,” Phys. Rev. Res., vol. 2, no. 1, p. 013319, 2020. https://doi.org/10.1103/physrevresearch.2.013319.Search in Google Scholar

[163] J. Nocedal and S. J. Wright, Numerical Optimization. Springer Series in Operations Research and Financial Engineering, 2nd ed. New York, NY, Springer, 2006.Search in Google Scholar

[164] E. Muñoz and M. Stolpe, “‘Generalized benders’ decomposition for topology optimization problems,” J. Global Optim., vol. 51, no. 1, pp. 149–183, 2011. https://doi.org/10.1007/s10898-010-9627-4.Search in Google Scholar

[165] Y. Liang and G. Cheng, “Topology optimization via sequential integer programming and canonical relaxation algorithm,” Comput. Methods Appl. Mech. Eng., vol. 348, pp. 64–96, 2019, https://doi.org/10.1016/j.cma.2018.10.050.Search in Google Scholar

[166] Z. Ye, X. Qian, and W. Pan, “Quantum topology optimization via quantum annealing,” IEEE Trans. Quantum Eng., vol. 4, pp. 1–15, 2023, https://doi.org/10.1109/tqe.2023.3266410.Search in Google Scholar

[167] Keras Team, “Keras documentation: Adam,” 2020. Available at: https://keras.io/api/optimizers/adam/.Search in Google Scholar

[168] J. Wurtz, S. H. Sack, and S.-T. Wang, “Solving nonnative combinatorial optimization problems using hybrid quantum-classical algorithms,” IEEE Trans. Quantum Eng., vol. 5, p. 3103114, 2024. https://doi.org/10.1109/tqe.2024.3443660.Search in Google Scholar

[169] J. Bergstra, R. Bardenet, Y. Bengio, and B. Kégl, “Algorithms for hyper-parameter optimization,” Adv. Neural Inf. Process. Syst., vol. 24, 2011.Search in Google Scholar

[170] J. Lu and J. Vučković, “Nanophotonic computational design,” Opt. Express, vol. 21, no. 11, pp. 13351–13367, 2013. https://doi.org/10.1364/oe.21.013351.Search in Google Scholar

[171] S. Molesky, Z. Lin, A. Y. Piggott, W. Jin, J. Vucković, and A. W. Rodriguez, “Inverse design in nanophotonics,” Nat. Photonics, vol. 12, no. 11, pp. 659–670, 2018. https://doi.org/10.1038/s41566-018-0246-9.Search in Google Scholar

[172] P. R. Wiecha, A. Arbouet, C. Girard, and O. L. Muskens, “Deep learning in nano-photonics: inverse design and beyond,” Photonics Res., vol. 9, no. 5, pp. B182–B200, 2021. https://doi.org/10.1364/prj.415960.Search in Google Scholar

[173] M. P. Bendsoe and O. Sigmund, Topology Optimization: Theory, Methods, and Applications, New York, Springer Science & Business Media, 2013.Search in Google Scholar

[174] H. Men, K. Y. Lee, R. M. Freund, J. Peraire, and S. G. Johnson, “Robust topology optimization of three-dimensional photonic-crystal band-gap structures,” Opt. Express, vol. 22, no. 19, pp. 22632–22648, 2014. https://doi.org/10.1364/oe.22.022632.Search in Google Scholar

[175] F. Wang, J. S. Jensen, and O. Sigmund, “Robust topology optimization of photonic crystal waveguides with tailored dispersion properties,” JOSA B, vol. 28, no. 3, pp. 387–397, 2011. https://doi.org/10.1364/josab.28.000387.Search in Google Scholar

[176] S. Khan, M. Hammood, N. A. Jaeger, and L. Chrostowski, “Fabrication-aware inverse design for shape optimization,” arXiv preprint arXiv:2410.07353, 2024.Search in Google Scholar

[177] R. E. Christiansen and O. Sigmund, “Inverse design in photonics by topology optimization: tutorial,” J. Opt. Soc. Am. B, vol. 38, no. 2, pp. 496–509, 2021. https://doi.org/10.1364/josab.406048.Search in Google Scholar

[178] D. Mengu, Y. Zhao, N. T. Yardimci, Y. Rivenson, M. Jarrahi, and A. Ozcan, “Misalignment resilient diffractive optical networks,” Nanophotonics, vol. 9, no. 13, pp. 4207–4219, 2020. https://doi.org/10.1515/nanoph-2020-0291.Search in Google Scholar

[179] A. Montes McNeil, Y. Li, A. Zhang, M. Moebius, and Y. Liu, “Fundamentals and recent developments of free-space optical neural networks,” J. Appl. Phys., vol. 136, no. 3, 2024, https://doi.org/10.1063/5.0215752.Search in Google Scholar

[180] D. Gostimirovic, D.-X. Xu, O. Liboiron-Ladouceur, and Y. Grinberg, “Deep learning-based prediction of fabrication-process-induced structural variations in nanophotonic devices,” ACS Photonics, vol. 9, no. 8, pp. 2623–2633, 2022. https://doi.org/10.1021/acsphotonics.1c01973.Search in Google Scholar

[181] D. Gostimirovic, Y. Grinberg, D.-X. Xu, and O. Liboiron-Ladouceur, “Improving fabrication fidelity of integrated nanophotonic devices using deep learning,” ACS Photonics, vol. 10, no. 6, pp. 1953–1961, 2023. https://doi.org/10.1021/acsphotonics.3c00389.Search in Google Scholar

[182] J. P. Lightstone, L. Chen, C. Kim, R. Batra, and R. Ramprasad, “Refractive index prediction models for polymers using machine learning,” J. Appl. Phys., vol. 127, no. 21, p. 215105, 2020. https://doi.org/10.1063/5.0008026.Search in Google Scholar

[183] Y. Li, et al.., “Electron transfer rules of minerals under pressure informed by machine learning,” Nat. Commun., vol. 14, no. 1, p. 1815, 2023. https://doi.org/10.1038/s41467-023-37384-1.Search in Google Scholar PubMed PubMed Central

[184] S. K. Patel, J. Parmar, and V. Katkar, “Metasurface-based solar absorber with absorption prediction using machine learning,” Opt. Mater., vol. 124, p. 112049, 2022, https://doi.org/10.1016/j.optmat.2022.112049.Search in Google Scholar

[185] V. Kuznetsova, A. Coogan, D. Botov, Y. Gromova, E. V. Ushakova, and Y. K. Gun’ko, “Expanding the horizons of machine learning in nanomaterials to chiral nanostructures,” Adv. Mater., vol. 36, no. 18, p. 2308912, 2024. https://doi.org/10.1002/adma.202308912.Search in Google Scholar PubMed PubMed Central

[186] K. V. M. Krishna, R. Madhavan, M. V. Pantawane, R. Banerjee, and N. B. Dahotre, “Machine learning based de-noising of electron back scatter patterns of various crystallographic metallic materials fabricated using laser directed energy deposition,” Ultramicroscopy, vol. 247, p. 113703, 2023, https://doi.org/10.1016/j.ultramic.2023.113703.Search in Google Scholar PubMed

[187] S. Kandel, et al.., “Demonstration of an ai-driven workflow for autonomous high-resolution scanning microscopy,” Nat. Commun., vol. 14, no. 1, p. 5501, 2023. https://doi.org/10.1038/s41467-023-40339-1.Search in Google Scholar PubMed PubMed Central

[188] J. Ojih, A. Rodriguez, J. Hu, and M. Hu, “Screening outstanding mechanical properties and low lattice thermal conductivity using global attention graph neural network,” Energy AI, vol. 14, p. 100286, 2023, https://doi.org/10.1016/j.egyai.2023.100286.Search in Google Scholar

[189] D.-H. Xia, et al.., “Electrochemical measurements used for assessment of corrosion and protection of metallic materials in the field: a critical review,” J. Mater. Sci. Technol., vol. 112, pp. 151–183, 2022, https://doi.org/10.1016/j.jmst.2021.11.004.Search in Google Scholar

[190] W. Nash, T. Drummond, and N. Birbilis, “A review of deep learning in the study of materials degradation,” npj Mater. Degrad., vol. 2, no. 1, p. 37, 2018. https://doi.org/10.1038/s41529-018-0058-x.Search in Google Scholar

[191] L. B. Coelho, D. Zhang, Y. Van Ingelgem, D. Steckelmacher, A. Nowé, and H. Terryn, “Reviewing machine learning of corrosion prediction in a data-oriented perspective,” npj Mater. Degrad., vol. 6, no. 1, p. 8, 2022. https://doi.org/10.1038/s41529-022-00218-4.Search in Google Scholar

[192] Z. A. Kudyshev, et al.., “Machine learning assisted quantum super-resolution microscopy,” Nat. Commun., vol. 14, no. 1, p. 4828, 2023. https://doi.org/10.1038/s41467-023-40506-4.Search in Google Scholar PubMed PubMed Central

[193] A. Anantatamukala, K. M. Krishna, and N. B. Dahotre, “Generative adversarial networks assisted machine learning based automated quantification of grain size from scanning electron microscope back scatter images,” Mater. Charact., vol. 206, p. 113396, 2023, https://doi.org/10.1016/j.matchar.2023.113396.Search in Google Scholar

[194] BramahHazela, et al.., “Machine learning: supervised algorithms to determine the defect in high-precision foundry operation,” J. Nanomater., vol. 2022, no. 1, p. 1732441, 2022. https://doi.org/10.1155/2022/1732441.Search in Google Scholar

[195] S. Zhou, B. Yang, S. Xiao, G. Yang, and T. Zhu, “Crack growth rate model derived from domain knowledge-guided symbolic regression,” Chin. J. Mech. Eng., vol. 36, no. 1, p. 40, 2023. https://doi.org/10.1186/s10033-023-00876-8.Search in Google Scholar

[196] Z. A. Kudyshev, S. I. Bogdanov, T. Isacsson, A. V. Kildishev, A. Boltasseva, and V. M. Shalaev, “Rapid classification of quantum sources enabled by machine learning,” Adv. Quantum Technol., vol. 3, no. 10, p. 2000067, 2020. https://doi.org/10.1002/qute.202000067.Search in Google Scholar

[197] M. A. Ziatdinov, et al.., “Hypothesis learning in automated experiment: application to combinatorial materials libraries,” Adv. Mater., vol. 34, no. 20, p. 2201345, 2022. https://doi.org/10.1002/adma.202201345.Search in Google Scholar PubMed

[198] M. K. Horton, S. Dwaraknath, and K. A. Persson, “Promises and perils of computational materials databases,” Nat. Comput. Sci., vol. 1, no. 1, pp. 3–5, 2021. https://doi.org/10.1038/s43588-020-00016-5.Search in Google Scholar PubMed

[199] Y. K. Wakabayashi, T. Otsuka, Y. Krockenberger, H. Sawada, Y. Taniyasu, and H. Yamamoto, “Bayesian optimization with experimental failure for high-throughput materials growth,” npj Comput. Mater., vol. 8, no. 1, p. 180, 2022. https://doi.org/10.1038/s41524-022-00859-8.Search in Google Scholar

[200] Y. Liu, et al.., “Autonomous scanning probe microscopy with hypothesis learning: exploring the physics of domain switching in ferroelectric materials,” Patterns, vol. 4, no. 3, p. 100704, 2023. https://doi.org/10.1016/j.patter.2023.100704.Search in Google Scholar PubMed PubMed Central

[201] B. N. Slautin, et al.., “Co-orchestration of multiple instruments to uncover structure–property relationships in combinatorial libraries,” Digit. Discov., vol. 3, no. 8, pp. 1602–1611, 2024. https://doi.org/10.1039/d4dd00109e.Search in Google Scholar

[202] R. Potyrailo, K. Rajan, K. Stoewe, I. Takeuchi, B. Chisholm, and H. Lam, “Combinatorial and high-throughput screening of materials libraries: review of state of the art,” ACS Comb. Sci., vol. 13, no. 6, pp. 579–633, 2011. https://doi.org/10.1021/co200007w.Search in Google Scholar PubMed

[203] Y. Xie, K. Sattari, C. Zhang, and J. Lin, “Toward autonomous laboratories: convergence of artificial intelligence and experimental automation,” Prog. Mater. Sci., vol. 132, p. 101043, 2023, https://doi.org/10.1016/j.pmatsci.2022.101043.Search in Google Scholar

[204] A. Ludwig, “Discovery of new materials using combinatorial synthesis and high-throughput characterization of thin-film materials libraries combined with computational methods,” npj Comput. Mater., vol. 5, no. 1, p. 70, 2019. https://doi.org/10.1038/s41524-019-0205-0.Search in Google Scholar

[205] B. N. Slautin, et al.., “Measurements with noise: Bayesian optimization for co-optimizing noise and property discovery in automated experiments,” arXiv preprint arXiv:2410.02717, 2024.10.1039/D4DD00391HSearch in Google Scholar

[206] S. V. Kalinin, et al.., “Designing workflows for materials characterization,” Appl. Phys. Rev., vol. 11, no. 1, p. 011314, 2024. https://doi.org/10.1063/5.0169961.Search in Google Scholar

[207] A. Ghosh, P. Gayathri, M. Shaikh, and S. Ghosh, “Structural mode coupling in perovskite oxides using hypothesis-driven active learning,” J. Phys. Mater., vol. 7, no. 2, p. 025014, 2024. https://doi.org/10.1088/2515-7639/ad3fea.Search in Google Scholar

[208] H. Kim, H. Choi, D. Kang, W. B. Lee, and J. Na, “Materials discovery with extreme properties via reinforcement learning-guided combinatorial chemistry,” Chem. Sci., vol. 15, no. 21, pp. 7908–7925, 2024. https://doi.org/10.1039/d3sc05281h.Search in Google Scholar PubMed PubMed Central

[209] A. Mumuni and F. Mumuni, “Data augmentation: a comprehensive survey of modern approaches,” Array, vol. 16, p. 100258, 2022, https://doi.org/10.1016/j.array.2022.100258.Search in Google Scholar

[210] M. Omidvar, et al.., “Accelerated discovery of perovskite solid solutions through automated materials synthesis and characterization,” Nat. Commun., vol. 15, no. 1, p. 6554, 2024. https://doi.org/10.1038/s41467-024-50884-y.Search in Google Scholar PubMed PubMed Central

[211] D. Angelis, F. Sofos, and T. E. Karakasidis, “Artificial intelligence in physical sciences: symbolic regression trends and perspectives,” Arch. Comput. Methods Eng., vol. 30, no. 6, pp. 3845–3865, 2023. https://doi.org/10.1007/s11831-023-09922-z.Search in Google Scholar PubMed PubMed Central

[212] W. Cai, A. Pacheco-Vega, M. Sen, and K.-T. Yang, “Heat transfer correlations by symbolic regression,” Int. J. Heat Mass Transfer, vol. 49, nos. 23–24, pp. 4352–4359, 2006. https://doi.org/10.1016/j.ijheatmasstransfer.2006.04.029.Search in Google Scholar

[213] S. Kim, Novel Approaches to Discovery and Optimization in Physics: Symbolic Regression, Bayesian Optimization, and Topological Photonics, Ph.D. thesis, Massachusetts Institute of Technology, 2023.Search in Google Scholar

[214] Q. Li, D. Macias, and A. Vial, “Modeling the optical properties of transparent and absorbing dielectrics by means of symbolic regression,” Opt. Express, vol. 30, no. 23, pp. 41862–41873, 2022. https://doi.org/10.1364/oe.468110.Search in Google Scholar PubMed

[215] S. Aaronson, “Read the fine print,” Nat. Phys., vol. 11, no. 4, pp. 291–293, 2015. https://doi.org/10.1038/nphys3272.Search in Google Scholar

[216] A. Jain, et al.., “Commentary: the materials project: a materials genome approach to accelerating materials innovation,” APL Mater., vol. 1, no. 1, p. 011002, 2013. https://doi.org/10.1063/1.4812323.Search in Google Scholar

[217] J. Hu, et al.., “Materialsatlas.org: a materials informatics web app platform for materials discovery and survey of state-of-the-art,” npj Comput. Mater., vol. 8, no. 1, p. 65, 2022. https://doi.org/10.1038/s41524-022-00750-6.Search in Google Scholar

[218] Y.-C. Chan, F. Ahmed, L. Wang, and W. Chen, “Metaset: exploring shape and property spaces for data-driven metamaterials design,” J. Mech. Des., vol. 143, no. 3, p. 031707, 2020. https://doi.org/10.1115/1.4048629.Search in Google Scholar

Received: 2025-02-01

Accepted: 2025-06-07

Published Online: 2025-07-03

This work is licensed under the Creative Commons Attribution 4.0 International License.

https://doi.org/10.1515/nanoph-2025-0049

Keywords for this article

machine learning; nanophotonics; inverse design

Creative Commons

BY 4.0