Inverse design of nanophotonic devices enabled by optimization algorithms and deep learning: recent achievements and future prospects

Junhyeong Kim; Jae-Yong Kim; Jungmin Kim; Yun Hyeong; Berkay Neseli; Jong-Bum You; Joonsup Shim; Jonghwa Shin; Hyo-Hoon Park; Hamza Kurt

doi:10.1515/nanoph-2024-0536

Article Open Access

Inverse design of nanophotonic devices enabled by optimization algorithms and deep learning: recent achievements and future prospects

Junhyeong Kim , Jae-Yong Kim , Jungmin Kim , Yun Hyeong , Berkay Neseli , Jong-Bum You , Joonsup Shim , Jonghwa Shin , Hyo-Hoon Park and Hamza Kurt

Published/Copyright: January 27, 2025

Published by

Become an author with De Gruyter Brill

Submit Manuscript Author Information

From the journal Nanophotonics Volume 14 Issue 2

Abstract

Nanophotonics, which explores significant light–matter interactions at the nanoscale, has facilitated significant advancements across numerous research fields. A key objective in this area is the design of ultra-compact, high-performance nanophotonic devices to pave the way for next-generation photonics. While conventional brute-force, intuition-based forward design methods have produced successful nanophotonic solutions over the past several decades, recent developments in optimization methods and artificial intelligence offer new potential to expand these capabilities. In this review, we delve into the latest progress in the inverse design of nanophotonic devices, where AI and optimization methods are leveraged to automate and enhance the design process. We discuss representative methods commonly employed in nanophotonic design, including various meta-heuristic algorithms such as trajectory-based, evolutionary, and swarm-based approaches, in addition to adjoint-based optimization. Furthermore, we explore state-of-the-art deep learning techniques, involving discriminative models, generative models, and reinforcement learning. We also introduce and categorize several notable inverse-designed nanophotonic devices and their respective design methodologies. Additionally, we summarize the open-source inverse design tools and commercial foundries. Finally, we provide our perspectives on the current challenges of inverse design, while offering insights into future directions that could further advance this rapidly evolving field.

Keywords: nanophotonics; silicon photonics; inverse design; optimization; artificial intelligence; deep learning

1 Introduction

Nanophotonics, which facilitates light–matter interactions at the nanoscale, has become a major area of interest in modern optics and photonics. It can pave the way for various applications including high-speed data communications [1], data centers [2], [3], computing [4], [5], [6], [7], energy-saving [8], [9], [10], healthcare [11], [12], [13], [14], and sensing [14], [15], [16], [17], among others. To achieve these applications, it is important to design nanophotonic devices, including power splitters, grating couplers, resonators, wavelength multiplexers/de-multiplexers, modulators, switches, nanoscale lasers, photodetectors, and metasurfaces. Traditionally, devices were designed using a forward design process, where the designer determines the structural parameters based on intuition and expertise, followed by numerical calculations (e.g. finite element method [FEM], finite-difference time-domain [FDTD] method, transfer-matrix method [TMM], or rigorous coupled-wave analysis [RCWA]) to assess optical performance. Although this approach is easy to implement, intuitive, and delivers reliable results, it is often limited by high computational costs and a limited degree of freedom, which hinders the exploration of large-scale designs. In contrast, the emerging inverse design method reverses this process, overcoming these limitations. Structural design can be automatically determined by optimization algorithms or artificial intelligence (AI) based on desired optical characteristics. Inverse design methods offer a greater degree of freedom and higher potential for global optimization through an automated design process. Leveraging these advantages, several efforts in the 2000s focused on using inverse design for nanophotonic devices through optimization algorithms, leading to the discovery of high-performance devices previously unattainable [18], [19].

Recently, AI has achieved remarkable advancements across various fields such as image processing, healthcare, and material design. In many other areas of science, AI has the potential to push the boundaries of what is possible, such as by accelerating drug discovery, enhancing climate modelling, and enabling the creation of autonomous systems that can interact with complex environments. With the rapid development of AI, AI-based inverse design techniques have been widely applied to nanophotonic device design over the past decade [20], [21], [22]. From conventional deep learning methods to state-of-the-art generative models, successful demonstrations have established the potential of these design approaches.

In this article, we introduce the current progress of inverse design methods and their corresponding designs in nanophotonics. First, the theoretical details of the inverse design methods are discussed. These methods can be classified into three categories: meta-heuristic algorithm-based inverse design, adjoint methods, and AI-based inverse design. Various optimization algorithms, including trajectory-based, evolutionary, and swarm-based algorithms within meta-heuristic algorithms, as well as adjoint methods, are explored. For the AI-based inverse design algorithms, discriminative methods, generative models, and reinforcement learning techniques are discussed. Following detailed descriptions of the inverse design methods, we present selected examples of various inverse-designed nanophotonic devices, including photonic power splitters, wavelength (de)multiplexers, grating couplers, waveguide devices, and metasurfaces. We also introduce open-source inverse design tools and available fabrication facilities, which can serve as useful resources for newcomers. Finally, we address some challenges in the inverse design of nanophotonic devices, aiming to overcome obstacles in the development of future technologies.

2 Inverse design algorithms

Regarding the low degree of freedom in conventional forward design approaches, limitations have been encountered in optimizing device footprint and performance. In this process, numerous optimization algorithms have been implemented for the inverse design of nanophotonic devices over the past two decades. Abundant approaches including meta-heuristic algorithms, gradient-based optimization algorithms (e.g. adjoint method), and AI-based techniques, can be applied to inverse design. In the following sections, we introduce these methods along with their theoretical backgrounds and representative examples. Each of these algorithms has its own advantages and disadvantages and carefully choosing the inverse design algorithm is essential considering computational costs, the possibility of global optimization, the difficulty of implementation, etc.

2.1 Meta-heuristic algorithms

Meta-heuristic algorithms are problem-independent approaches that find near-optimal solutions to complex optimization problems at reasonable computational costs. These algorithms are widely applied to complex problems where an exact solution is difficult to obtain and trial-and-error approaches are impractical. In this section, several meta-heuristic algorithms commonly used for the inverse design of nanophotonic devices are investigated. Meta-heuristic algorithms can be classified as either trajectory-based or population-based, depending on the number of solutions considered during the optimization process. Furthermore, population-based algorithms can be classified as evolutionary or swarm-based, based on their exploration strategies within the search space. In this section, an overview of the several algorithms is presented. First, we introduce trajectory-based algorithms, including hill-climbing and direct binary search algorithms. Next, we discuss evolutionary algorithms, such as genetic algorithms and differential evolution. Finally, swarm-based algorithms, including ant colony optimization and particle swarm optimization, are explained.

2.1.1 Trajectory-based algorithms

Trajectory algorithms explore the search space by making small incremental changes to the current solution. These algorithms follow a specific trajectory in the solution space, selecting locally optimal choices at each step. However, they do not always achieve a global optimum and can be classified as greedy algorithms. In this section, we introduce several trajectory algorithms commonly implemented for the inverse design of nanophotonic devices, namely the hill-climbing algorithm and direct binary search algorithm.

The hill-climbing algorithm is a well-known and intuitive method that begins with an initial solution and iteratively makes small changes to it to enhance it. For example, assume that there are several design parameters, such as period, width, and length. Starting from the initial guess, one of these parameters is updated, and the resulting optical performance is evaluated. If the performance improves upon the previous state, the algorithm updates the parameter and repeats the process.

Conversely, if the updated performance does not exceed the previous state, the algorithm retains the current parameter and proceeds to the next one. Consequently, all parameters are updated until the algorithm reaches a local optimum. The hill-climbing algorithm is advantageous due to its simplicity, intuitive process, and ease of implementation. Moreover, it can be readily extended with additional strategies or constraints and can efficiently find local optima when a quick solution is required. However, this method has several drawbacks. First, it often fails to find the global optimum, leading to suboptimal performance compared to other algorithms. Second, the algorithm is highly sensitive to the initial guess, which complicates the search for optimal solutions.

The direct binary search (DBS) algorithm is another well-known and intuitive method for designing nanophotonic devices. Similar to the hill-climbing algorithm, it updates the design space iteratively, but the key difference is that the DBS algorithm makes binary changes to the design parameters. It operates in a discrete search space where the design is represented in a binary format. Each element or pixel of the design can be turned on or off; for example, the state between two different materials (i.e. silicon, silicon dioxide, air, etc.) can be selected. Starting from the initial guess, the algorithm iteratively flips the state of individual pixels and evaluates the optical performance. If a change improves performance, the algorithm updates the parameter and repeats the process. If not, the algorithm retains the state and flips another pixel. As a result, all parameters are randomly updated, resulting in a locally optimized design. Its binary nature makes it particularly suited for problems where the design space is inherently discrete and non-continuous. The DBS algorithm offers a straightforward and intuitive approach to optimization, making it ideal for discrete design spaces. Since nanophotonic devices are composed of several discrete materials, the algorithm can effectively find a local optimum by making binary adjustments. However, the DBS algorithm can easily get stuck in local optima, missing the global optimum, and its performance is highly sensitive to the initial design. This method can also be computationally expensive for large design spaces due to the iterative evaluation process, and therefore may not scale well with increasing problem size or complexity.

Several examples of inverse-designed nanophotonic devices leveraging trajectory-based algorithms are shown in Figure 1. These successful design results demonstrate that these methods hold significant potential for designing various kinds of nanophotonic devices, especially digitized structures.

Figure 1:

Representative examples of inverse-designed nanophotonic devices enabled by trajectory-based algorithms. (A) Schematic of the photonic spin selector inverse-designed by the direct binary search algorithm. (B) SEM image of the inverse-designed waveguide device on an SOI platform, its corresponding optical responses, and the flow chart of the trajectory-based algorithm. (C) SEM image of the tunable photonic crystal nanocavity inverse-designed by the hill-climbing algorithm (left) and corresponding optical responses (right). (A) Is reprinted from Ref. [23], with permission (CC BY 4.0); (B) is adapted from Ref. [24]; (C) is reprinted from Ref. [25], with permission from Optica Publishing Group.

2.1.2 Evolutionary algorithms

Evolutionary algorithms, inspired by principles of natural selection and genetics, are widely applied optimization techniques. These algorithms are implemented to solve complex optimization problems iteratively, by improving a set of solutions in the current generation based on mechanisms such as reproduction, mutation, selection, and recombination. Evolutionary algorithms are widely applied to inverse-design problems across various research fields and are considered powerful tools for designing nanophotonic devices. In this section, we introduce several evolutionary algorithms commonly used for the inverse design of nanophotonic devices, namely the genetic algorithm and differential evolution.

The genetic algorithm (GA) is one of the most popular evolutionary algorithms [26]. This algorithm mimics the process of natural evolution to find near-optimal solutions. In GA, each solution in the generation corresponds to a chromosome, a structured array containing the variables to be optimized. Each variable, referred to as a gene, represents a specific design parameter. In the context of photonic device design, a chromosome comprises the variables to be optimized. These parameters can be the width, radius, position, or refractive index information of the unit cells inside the design area. Therefore, a gene can correspond to a single variable within the array of a chromosome, such as the radius of a photonic crystal at a specific location. In the initial stage, GA generates a population of individuals randomly. The algorithm then evaluates the fitness of each individual in the population and selects the best solutions using methods such as roulette wheel, tournament, or rank-based selection. After selecting the individuals, the algorithm combines two parent solutions to produce offspring through a process known as crossover (or recombination). After creating offspring (child solutions), the mutation process introduces small random changes to genes to maintain genetic diversity and avoid premature convergence. After a sufficient number of generations or once satisfactory convergence of population has been achieved, the algorithm is terminated.

Differential evolution (DE) is another evolutionary algorithm similar in concept to GA [27], but particularly effective for continuous and high-dimensional optimization problems. Unlike GA, DE employs unique mechanisms for mutation and crossover, which makes it particularly well-suited for optimizing complex and nonlinear functions. In DE, the mutation process is carried out between randomly chosen vectors. The difference between two randomly chosen vectors is scaled by a mutation factor, F, and then added to a third randomly chosen vector, resulting in a mutant vector. Crossover in DE results in a trial vector, which is obtained by the combination between the mutant vector and a vector in the population. Each component of the trial vector is chosen from either the mutant vector or the target vector based on a predefined crossover probability, CR.

Several inverse-designed nanophotonic devices leveraging evolutionary algorithms are shown in Figure 2. These results highlight the potential of these methods for creating various nanophotonic devices for different applications.

Figure 2:

Representative examples of inverse-designed nanophotonic devices enabled by evolutionary algorithms. (A) Schematic of the inverse-designed metalens for near-infrared applications (left) and flowchart of the genetic algorithm and structural examples of the algorithm (right). (B) Flowchart of the genetic algorithm and the schematic of inverse-designed metasurface absorber. (C) Schematic of the dielectric nano-antenna inverse-designed by the differential evolution algorithm. (D) Flowchart of the segmented hierarchical evolutionary algorithm and full-color meta-hologram of optimized metasurfaces. (A) Is reprinted from Ref. [28], under the terms of the Open Access Publishing Agreement; (B) is reprinted from Ref. [29], with permission. Copyright 2024 Elsevier; (C) is reprinted from Ref. [30], with permission from Optica Publishing Group; (D) is reprinted from Ref. [31], with permission. Copyright 2024 American Chemical Society.

2.1.3 Swarm-based algorithms

Swarm-based algorithms are a class of optimization techniques inspired by the collective intelligence observed in natural systems, such as flocks of birds, schools of fish, and colonies of ants. Unlike single-agent optimization methods, swarm-based optimization methods utilize multiple agents that share information to find optimal solutions. During this process, even if some of the agents fall into local optima, the solution can reach the global optimum due to other agents. However, computational costs increase proportionally with the number of agents, which must be managed carefully based on the task. In this section, we introduce several swarm-based algorithms that are commonly implemented for the inverse design of nanophotonic devices, namely particle swarm optimization and ant colony optimization.

Particle swarm optimization (PSO) is a stochastic optimization algorithm proposed by Kennedy and Eberhart in 1995 [32]. It was inspired by the behaviour of a flock of birds: when a bird searches randomly for food, the entire flock benefits from its discoveries, thereby enhancing the group’s overall success in finding food. This concept has been applied to various optimization problems, especially for the inverse design of nanophotonic devices. During the optimization process, the structure is updated iteratively while minimizing the user-defined figure of merit (FoM), and the positions and velocities of particles in the algorithm are defined as follows:

(1) x t = x t − 1 + v t ,

(2) v t = ω v t − 1 + c 1 η 1 p best, t − 1 − x t − 1 + c 2 η 2 g best, t − 1 − x t − 1 ,

where x _t and v _t are the position and velocity of the particle at time t, c ₁ and c ₂ are cognitive and social constants, η ₁ and η ₂ are random coefficients, p _best and g _best represent the personal and global best, and ω is the inertia weight. PSO is easy to implement and capable of finding optimal solutions in high-dimensional spaces without requiring additional optimization methods (e.g. local search algorithms). Moreover, it provides a higher possibility of global optimization compared to other algorithms as the problem dimensionality increases.

Another swarm-based algorithm, ant colony optimization (ACO), is inspired by the foraging behaviour of ants in ant colonies [33]. Ants find the shortest paths to food sources by depositing pheromone trails behind them, which guides other ants to follow these paths, thereby forming efficient routes over time. By leveraging this collective behaviour, ACO effectively solves complex optimization problems. In nanophotonics, ACO can also be applied to optimize and inverse design devices. During the optimization process, an artificial ant moves from node i to node j with the probability P _ij defined as follows:

(3) P i j = τ i j α η i j β ∑ τ i j α η i j β ,

where τ _ij is the total amount of pheromone deposited on the edge i–j, η _ij is the visibility, and α and β are the relative coefficients of the pheromone trail versus visibility. The total amount of pheromone τ _ij is updated with the following equation:

(4) τ i j = 1 − ρ τ i j + Δ τ i j ,

where ρ is the pheromone evaporation coefficient within the range [0, 1], and Δτ _ij is the amount of pheromone deposited by an ant.

Several inverse-designed nanophotonic devices leveraging swarm-based algorithms are shown in Figure 3. These recent results emphasize the potential of these methods for designing various nanophotonic devices, particularly for tasks involving relatively continuous design parameters.

Figure 3:

Representative examples of inverse-designed nanophotonic devices enabled by swarm-based algorithms. (A) Schematic of searching mechanism in particle swarm optimization. (B) Experimental characterization of inverse-designed freeform waveguides enabled by particle swarm optimization. (C) Inverse-designed silicon polarizer beam splitter using particle swarm optimization. (D) Inverse-designed omnidirectional antireflection coatings using ant colony optimization method. (B) Is reprinted from Ref. [34], with permission (CC BY 4.0); (C) is reprinted from Ref. [35], under the terms of the Open Access Publishing Agreement; (D) is reprinted from Ref. [36], under the terms of the Open Access Publishing Agreement.

2.2 Adjoint methods

In sharp contrast to the aforementioned heuristic optimization techniques commonly used in photonics, a more direct and intuitive approach to finding the optimal point is to define an objective quantity as a function of optical structures (i.e. geometry and material properties) that is to be minimized. Then, by deriving the gradient of the objective function with respect to the structure, we can move toward the optimal point with the desired optical response along the steepest descent direction.

The challenge, however, lies in the vast number of parameters that constitute the design space, making it computationally expensive to calculate each element of the gradient one by one. This approach requires at least N + 1 full-wave simulations for N-dimensional parameters to approximate the first-order derivatives of the objective function in a brute-force manner. In this approach, solving N linear equations of the form y _n – y ₀ = ∇ L(x ₀) · (x _n – x ₀) for n = 1, …, N derives the gradient ∇ L(x ₀) at a point x ₀, where y _n = L(x _n) represent the objective function evaluated at N additional points in the vicinity of x ₀. Adjoint-sensitivity optimization, on the other hand, offers an elegant solution to this complexity by leveraging the chain rule in the mathematical expression of the gradient, reducing the required computations to just two simulations: one forward and one adjoint (backward).

Following the description in Ref. [37], a simplified optimization problem is considered: ϕ _opt = argmin L(e, e^*; ϕ) such that f(e, e^*, ϕ) = 0, where ϕ is a parameter vector, L is the objective function to be minimized, and e and e^* are the complex state vector and its corresponding complex conjugate, both implicitly functions of ϕ. The function f represents the multiple constraints (i.e. the vectorial governing equation) which typically have the same dimension as e. The expression for the total derivative of L with respect to ϕ is given by

(5) d L d ϕ = ∂ L ∂ ϕ + ∂ L ∂ e ∂ e ∂ ϕ + ∂ L ∂ e * ∂ e * ∂ ϕ = ∂ ϕ L + ∂ e L ∂ e * L ∂ ϕ e ∂ ϕ e * ,

where ∂_ϕ L, ∂_e L and ∂_e* L are straightforward to derive from the explicit dependence of L, while ∂_ϕe and ∂_ϕe^* are only accessible through solving the governing equation. Here, the chain rule is applied similarly to f and f ^*, which can be written in matrix form as:

(6) ∂ ϕ f ∂ ϕ f * = − ∂ e f ∂ e * f ∂ e f * ∂ e * f * ∂ ϕ e ∂ ϕ e * .

Substituting the relationship in Eqs. (6) into (5), the total derivative becomes:

(7) d L d ϕ = ∂ ϕ L − ∂ e L ∂ e * L ∂ e f ∂ e * f ∂ e f * ∂ e * f * − 1 ∂ ϕ f ∂ ϕ f * = ∂ ϕ L + 2 Re e adj T ⋅ ∂ ϕ f ,

where e_adj and e adj * define the adjoint fields, satisfying the adjoint simulation:

(8) ∂ e f T ∂ e f * T ∂ e * f T ∂ e * f * T e adj e adj * = − ∂ e L T ∂ e * L T ,

with adjoint sources ∂_e L and ∂ e * L derivable from the forward simulation.

In general photonics problems with isotropic, nonmagnetic and monochromatic assumptions, the wave equation is expressed as [∇² + k ₀ ² ε(r)]E(r) – j(r) = 0, where k ₀ = ω/c is the free-space wave number, ε is the relative permittivity distribution, and E is the electric field, and j represents current density distribution as a forward source (as shown in Figure 4A). By setting E(r) and ε(r ₀) as e and scalar ϕ in Eqs. (7) and (8), ∂_e f corresponds to the operator inside the bracket applied to E(r), which happens to be a symmetric operator. Consequently, (∂_e f)^T = ∂_e f, meaning that solving Eq. (8) is equivalent to solving the standard wave equation with an adjoint source j _adj = –(∂_e L)^T (Figure 4B). The resulting electric field E _adj(r) gives rise to the single dimensional derivative when multiplied by ∂_ϕ f := δ(r – r ₀)E _fw(r). This approach can be easily extended to multi-dimensional vector ϕ, still requiring only two (forward and adjoint) simulations. It is also worth noting that this formulation can be readily modified to handle the generalized wave equation, including time-dependence [38], [39], nonlinearity [37], [40], and/or nonreciprocity [41].

Figure 4:

Concept of adjoint-sensitivity optimization. (A) Forward simulation involves calculating the objective function (L) by comparing the simulated field (E _fw) with the target response. (B) The adjoint simulation uses a backward source derived from L to generate the adjoint field (E _adj). The gradient of L with respect to the permittivity distribution ε(r) within the design space is then derived as the product of forward and adjoint fields. (C–F) Optical gradient backpropagation through the adjoint method in analogy with that of deep neural network. Green arrows, forward input; blue area and rectangles, design region; orange arrows, adjoin sources; purple arrows, forward and time-reversed adjoint sources combined to interfere with two fields. Panels (C–F) are adapted with permission from Ref. [42].

Interestingly, the process of gradient calculation via adjoint simulation directly parallels the concept of “backpropagation” in deep neural networks, both in their chain-rule-based formulation and in the forward-then-backward flow of information. Based on this tight analogy between two concepts from different domains, Hughes et al. [42] demonstrated the in-situ implementation of backpropagation in an optical neural network as shown in Figure 4C–F. Figure 4C displays a waveguide mesh structure based on Mach–Zehnder interferometers (MZIs) with phase shifters (blue area) [43], capable of performing an arbitrary 3-by-3 unitary operation between input (|3〉) and output (|2〉) vectors (green arrows). To obtain gradient information within the blue areas for the target operation |2〉〈3|, one needs to (1) inject the forward field as shown in Figure 4D and store the intensity, I _fw = |E _fw|² within the blue areas, (2) inject the adjoint field as shown in Figure 4E and store the intensity I _adj = |E _adj|², (3) inject the forward and time-reversed adjoint field again at the front end as shown in Figure 4F and record the interference pattern, I _interf = |E _fw + E _adj ^*|² = I _fw + I _adj + 2Re(E _adj E _fw). Through this procedure, the gradient term Re(E _adj E _fw) can be obtained in-situ as I _interf – I _fw – I _adj. This example bridges the adjoint method and deep neural networks, enabling not only inference but also in-situ training of optical neural networks exactly in the same manner as traditional deep neural networks.

Among its various applications, this shift towards intelligence-driven optimization techniques has significantly advanced photonic devices as a computing platform, particularly in large-scale, reconfigurable, and neuromorphic applications. Here, we present a few examples where the adjoint method was utilized in the design of optical computing platforms. First, Figure 5A showcases the optical matrix-vector multiplication [44], as described in Ref. [42], but using a free-form Si core with binary thicknesses (220 and 150 nm) immersed in an SiO₂ cladding, operating at 1,525 nm wavelength. Due to the large footprint of the device, approximately 30 μm, the 2D effective index approximation technique was employed in the design of the 3D planar structure to reduce the computational cost. This combined approach, utilizing the adjoint-sensitivity method and theoretical effective medium approximation, significantly enhances scalability, which is crucial for practical use as a linear layer in deep neural networks.

Figure 5:

Optical computers designed with the adjoint method. (A) Silicon photonics platform for matrix-vector multiplication with a very high dimensionality (10 × 10). (B) Lithography-free trainable processor for vowel recognition. (C) Neuromorphic continuous medium including nonlinearity for image recognition. (A) Is reproduced from Ref. [44], with permission from SNCSC; (B) is reproduced from Ref. [45], with permission from SNCSC; (C) is reprinted from Ref. [40], with permission from Photonics Research.

Next, Figure 5B demonstrates the in-situ training of an optical gain medium for vowel recognition [45]. Unlike the phase shifters used in the MZI structure [42], this approach utilizes a spatially modulated pump beam to control the gain parameters across the design area, enabling targeted neuromorphic functionality. Figure 5C further demonstrates a nanophotonic medium that incorporates a nonlinear component, functioning as a nonlinear activation function in a deep neural network. This configuration is used for recognizing handwritten images, routing the flow of light into different paths based on the visual information contained in the input light.

2.3 AI-enabled design approaches

As discussed in the previous section, optimization algorithms have been well-established over the past few decades, resulting in superior designs and performances. However, these algorithms are often constrained by high computational costs, which can render the design process inefficient. Artificial intelligence (AI) has the capability to immediately map structural information to its corresponding optical responses, potentially replacing the time-consuming electromagnetic simulation. Here, we need only one-time cost simulations to train the AI by extensively running electromagnetic simulations that explore a sufficiently large portion of the parameter space for a specific type of device with standardized parameter space. Once trained, the model is capable of predicting the optical responses in microsecond order and can be applied to a range of related design problems. Over the last decade, numerous efforts have been made to implement this novel technique to design nanophotonic devices [46], [47]. There are several ways to inverse-design nanophotonic devices with AI and in this section, these methods are classified into three categories: discriminative models, generative models, and reinforcement learning.

2.3.1 Inverse design with discriminative models

Discriminative models have been extensively applied across various fields, including computer vision, image processing, and natural language processing, where the primary goal is to learn a direct relationship between input data and output labels. In nanophotonic design, discriminative models, such as fully connected neural networks (FCNs) and convolutional neural networks (CNNs), play a critical role in both forward and inverse design tasks. As described in Figure 6, these models allow for the mapping of structure parameters, such as the geometric details of nanostructures, to their corresponding optical responses, such as reflection or transmission spectra, or inversely, to predict the structural parameters required to achieve a desired optical outcome.

Figure 6:

Representative examples of inverse-designed nanophotonic devices enabled by discriminative models. (A) Schematic of the inverse-designed 1 × 2 wavelength demultiplexer (left) and experimental verification (right). (B) Schematic of the all-optical nonlinear plasmonic ring resonator switch and the structure of the inverse design network (left), and training results (right). (C) Inverse design configuration of nanophotonic nanohole arrays (left), and experimental verification (right). (A) Is reprinted from Ref. [48], under the terms of the Open Access Publishing Agreement; (B) is reprinted from Ref. [49], with permission (CC BY 4.0); (C) is reprinted from Ref. [50], with permission. Copyright 2024 Royal Society of Chemistry.

FCNs, one of the simplest forms of discriminative models, consist of layers of interconnected neurons where each neuron is connected to every neuron in adjacent layers. FCNs are particularly useful for capturing relationships between device structural parameters, such as material properties, thickness, or periodic patterns and their corresponding optical responses. For example, an FCN can predict the spectral response of a nanophotonic device based on a given set of design inputs as demonstrated in Ref. [24]. While FCNs are effective for simpler design tasks involving basic geometries, they often encounter difficulties when handling high-dimensional data and more complex design problems, as the increased complexity of the input data leads to an exponential growth in parameters. This results in the need for substantial training data and prolonged training periods, thereby restricting their effectiveness for more sophisticated designs.

To address these limitations, CNNs are often employed for more complex inverse design problems in nanophotonics. Unlike FCNs, CNNs are designed to handle high-dimensional data such as images or complex patterns, making them well-suited for processing intricate geometric features in nanophotonic design [21]. Through convolutional layers, CNNs can extract local features from the input data, enabling the model to capture spatial hierarchies and relationships within the structure. This is particularly powerful in nanophotonic design, where a device’s optical properties are highly sensitive to its detailed geometric configuration.

Despite these advantages, discriminative models in inverse design face challenges, most notably the “one-to-many mapping” problem. This issue arises when different structural configurations produce the same optical response, complicating the task of predicting a unique set of design parameters. To address this, advanced techniques such as tandem networks and mixture density networks (MDNs) have been proposed. Tandem networks involve training a forward model to map design parameters to optical responses and then using this pre-trained model to guide an inverse network in predicting the specific structural parameters that match a given optical outcome. This approach enables one-to-one mapping, thereby enhancing the accuracy and reliability of the inverse design process. MDNs offer another solution by employing a probabilistic approach to capture the non-uniqueness problem in inverse design. Instead of providing a single predicted outcome, MDNs generate a distribution of possible design solutions, allowing the exploration of multiple configurations that achieve the desired optical outcomes. This is particularly advantageous for exploring complex design spaces where numerous valid solutions exist.

Alongside these advanced techniques like tandem networks or MDNs, the primary strength of discriminative models lies in their ability to provide rapid and precise predictions, significantly reducing the computational time and resources, compared to the optimization algorithm-based inverse design method. However, they are constrained by their reliance on existing data, which limits their ability to explore new design spaces. This limitation has led to growing demand for solutions that can uncover new design possibilities and explore more sophisticated and unexplored areas, a challenge that generative models are particularly well-equipped to address.

2.3.2 Inverse design with generative models

Generative models have recently achieved significant breakthroughs across various domains such as image processing, large language models (LLMs), natural language processing (NLP), computer vision (CV), and the automotive industry. Notable examples that have implemented generative models include DALL-E 2 and ChatGPT, both of which have demonstrated superior performance. This growing interest has accelerated further advancements in generative models, encompassing techniques such as variational autoencoders, generative adversarial networks, diffusion models, and transformers. Following their successful applications in image, video, and language processing, researchers are now expanding the use of generative models to additional fields, including photonics, material science, healthcare, protein folding, environmental science, and beyond.

Variational autoencoders (VAEs) are well-established generative models that employ an encoder structure to extract a latent space capturing meaningful information [51]. Outputs are then generated using the decoder structure that reconstructs data from this latent space. During training, a VAE updates the parameters of both the encoder and decoder, enabling it to generate new data. To enhance performance, variations of the VAE, such as the conditional VAE (CVAE), are widely applied.

Generative adversarial networks (GANs) are also well-known generative models [52]. Unlike VAEs, GANs consist of two distinct networks: the generator network and the discriminator network. The purpose of the generator network is to create a synthetic distribution that closely resembles the real distribution.

Conversely, the purpose of the discriminator network is to differentiate between the generated and real distributions. During the training process, these two networks engage in a competitive process, where the discriminator’s probability of distinguishing between real and fake data approaches approximately 0.5. This outcome indicates that the discriminator can no longer effectively differentiate between the generated data and real data. Several different versions of GAN have also been developed, such as deep convolutional GAN (DCGAN) or conditional DCGAN (cDCGAN).

The diffusion model is a recently emerged generative network, which outperforms VAEs and GANs [53]. It first transforms the original image into a complex data distribution by gradually adding a simple noise distribution, a process called diffusion. The network is then trained to denoise this complex data distribution step-by-step, moving backward through the time steps. During the training process, the network learns to predict the added noise at each step, enabling it to denoise the complex sample while minimizing the difference between the true noise and the predicted noise.

In the field of nanophotonics, generative models are utilized to generate new structures that are not present in the original dataset but successfully replicate the interesting optical properties (see Figure 7). These models can replace the highly time-consuming design process once a sufficient number of designs are available.

Figure 7:

Representative examples of inverse-designed nanophotonic devices enabled by generative models. (A) Twelve examples of inverse-designed unit cells in metasurfaces and their optical characteristics suggested by cDCGAN. (B) Schematic of the inverse-designed 1 × 2 photonic power splitter by leveraging CVAE (upper) and simulation results with arbitrary splitting ratios (down). (C) Schematic of the diffusion model-enabled inverse design process. High-degree-of-freedom metasurfaces can be inverse designed, successfully mimicking the original shape. (D) Schematic of the inverse-designed nanophotonic device with deep generative models (left) and SEM image of the fabricated device with experimental result. (A) Is reprinted from Ref. [54], with permission (CC BY 4.0); (B) is reprinted from Ref. [55], with permission. Copyright 2024 John Wiley and Sons; (C) is reprinted from Ref. [56], with permission (CC BY 4.0); (D) is reprinted from Ref. [57], with permission. Copyright 2024 John Wiley and Sons.

2.3.3 Inverse design with reinforcement learning

Reinforcement learning (RL) is a different type of AI technique, where an agent learns to make decisions by receiving feedback from its environment [58]. Following the success of RL in several tasks including Go, chess, and video games, it has received significant attention across various fields [59], [60], [61], [62]. By leveraging RL to explore complex design spaces, abundant nanophotonic devices can be optimized, including waveguides, metasurfaces, gratings, and beyond. During the training process, the agent optimizes the structure by evaluating optical properties, such as efficiency, bandwidth, and wavelength selectivity.

Implementing RL in nanophotonics involves several key steps. First, an environment is created in which the RL agent will be trained. This can include design parameters of the device itself (e.g. period, radius, width, height) or optical properties (e.g. refractive index distribution, polarization, phase). After defining the environment, the reward process is defined. Based on the RL target task, the agent needs to define the amount of reward to determine the next action. The reward may be based on performance metrics related to optical characteristics such as maximizing efficiency, enlarging bandwidth, and maximizing transmission. Note that these optical characteristics are usually obtained through numerical calculations such as the finite-difference time-domain (FDTD) method, finite element method (FEM), and others. Next, the agent is defined, and it begins taking actions to update the environment. Finally, the agent is trained. Starting from the initial state of the nanophotonic structure, the reward is numerically calculated and the agent takes actions to update the structural information. RL repeats this process to inversely design nanophotonic devices as shown in Figure 8. Several methods can be utilized for RL, including deep Q-learning (DQN), double DQN (DDQN), and others.

Figure 8:

Representative examples of inverse-designed nanophotonic devices enabled by reinforcement learning. (A) An overview of physics-informed RL consists of the pre-training stage and RL optimization stage. (B) The schematic of DDQN-based design of metasurfaces and simulation results of high-quality metasurface holograms. (C) The schematic of the parameterized grating coupler inverse-designed with RL, microscopic image of fabricated devices, and experimental verifications. (D) Inverse design of metagrating structure leveraging RL combined with the supervised learning method. (A) Is reprinted from Ref. [63], with permission (CC BY 4.0); (B) is reprinted from Ref. [64], with permission (CC BY 4.0); (C) is reprinted from Ref. [65], with permission (CC BY 4.0). (D) Is reprinted from Ref. [66], under the terms of the Open Access Publishing Agreement.

3 Inverse-designed nanophotonic devices: selected examples

Inverse design has revolutionized the field of nanophotonics, facilitating the optimization of photonic devices through metaheuristic algorithms, adjoint methods, and AI-driven designs discussed in previous chapter. This approach has led to the creation of next-generation devices, including power splitters, wavelength (de)multiplexers, grating couplers, and waveguide devices. Metasurfaces have also seen significant improvements, achieving new performance levels in light control. Beyond nanophotonics, inverse design is also expanding into material science and mechanical engineering, proving its potential across diverse domains. This chapter will explore these advancements and highlight how inverse design is driving the evolution of photonics and other areas of research.

3.1 Nanophotonic power splitters

Various research results have been implemented for nanophotonic power splitters, employing methods ranging from conventional heuristic algorithms like PSO or DBS to advanced topology optimization (TO) as demonstrated in Figure 9. Many of these have been experimentally verified and some studies have explored the potential for designing splitters using deep learning techniques. By applying inverse design methods, nanophotonic power splitters can be engineered with significantly smaller footprints than conventional structures, such as those utilizing directional coupler (DC) [67] or multimode interference (MMI) types [68]. These inverse-designed splitters can distribute light into multiple output ports, such as 1 × 3, 1 × 4, or 1 × 8, rather than being limited to 1 × 2, allowing for the desired distribution ratios and minimal optical loss across a broadband range.

Figure 9:

Representative examples of inverse-designed nanophotonic power splitters. (A) Inverse-designed 1 × 4 nanophotonic power splitter using PSO. The fabricated device’s scanning electron microscopy (SEM) image (upper left) and measured transmission spectrum of the 1 × 4 power splitter (upper right). Comparison between the simulated and measured spectra for the insertion loss (lower left) and comparison between the simulated spectra and measured spectra for the uniformity loss (lower right). (B) Inverse-designed 1 × 3 nanophotonic power splitter with TO. SEM image of the fabricated device (upper left) and simulated field propagation showing light distribution into three ports (upper right). Simulated spectra of the transmission for each port (lower left) and measured spectra of the transmission for each port (lower right). (C) Inverse-designed 1 × 2 nanophotonic power splitter with TO. SEM image of the fabricated device (left) and comparison of the simulated and measured transmission spectra (right). (A) Is reprinted from Ref. [69], with permission (CC BY 4.0); (B) is reprinted from Ref. [70], with permission (CC BY 4.0); (C) is reprinted from Ref. [71], with permission (CC BY 4.0).

For instance, Kim et al. detailed the optimization of a 1 × 4 splitter using the PSO algorithm. Two different devices were fabricated, each composed of 40 rectangular shapes with varying widths, designed to optimize the arrangement of 40 particles [69]. The dimensions of the devices were compact, measuring 6.0 × 7.2 μm² and 8.4 × 12 μm², respectively. The outcomes demonstrated a maximum insertion loss of 0.76 dB and a uniformity of less than 0.84 dB for one device, and 1.08 dB and 0.81 dB for the other, closely matching the predicted simulation results. Similarly, Xie et al. described the use of the DBS algorithm to simulate and experimentally validate a 1 × 4 power splitter capable of splitting in ratios such as 1:1:1:1, 2:2:1:1, 2:2:2:1, and 4:3:2:1 over a 2 μm wavelength band, achieving a compact size of 3.6 × 3.6 μm² [72].

Following these heuristic approaches, advanced TO methods were also applied to enhance the design and performance of power splitters. Piggott et al. demonstrated a 1 × 3 splitter using TO with curvature constraints to prevent small feature formations, setting a minimum curvature radius of 100 nm. The fabricated device showed an insertion loss of 0.642 ± 0.057 dB and power uniformity of 0.641 ± 0.054 dB across a 1,400–1,700 nm wavelength range [70]. Xu et al. applied digital TO, enhancing traditional methods by incorporating process-oriented constraints such as minimum feature size and edge smoothing [73]. This led to the successful simulation and optimization of a broadband 1 × 2 splitter that supports TE0/TE1 mode beam splitting over a wavelength bandwidth of 445 nm and a compact size of 5.4 × 2.88 μm². Hansen et al. further demonstrated the experimental validation of a TO-optimized 1 × 2 power splitter, achieving an impressively low excess loss of under 0.5 dB across a 245 nm wavelength range [71].

Table 1 presents additional results on power splitters using inverse design methods, as reported in various papers.

Table 1:

Summary of inverse-designed power splitters.

Ref	Design method	Structure	IL (dB) (sim./exp.)	UL (dB) (sim./exp.)	BW (nm) (sim./exp.)	Foot print (μm²)
[67]	Forward (DC)	1 × 2	–/1	–/0.7	100/88	31.4 × 1.3
[68]	Forward (MMI)	1 × 2	0.3/0.3	–/0.6	200/171	43 × 3.1
[71]	TO	1 × 2	0.1/0.1	–	325/245	2 × 3
[73]	TO	1 × 2	0.8/–	–	447/–	5.4 × 2.9
[70]	TO	1 × 3	0.8/0.6	0.5/0.6	300/300	3.8 × 2.5
[74]	DBS	1 × 3	1.9/–	N/A	100/–	77.2
[75]	DL	1 × 3	0.45/–	N/A	200/–	2.6 × 2.6
[72]	DBS	1 × 4	1/1.5	N/A	40/30	3.6 × 3.6
[76]	PSO	1 × 4	–/0.6	–/1.0	–/44	12.3 × 5
[77]	PSO	1 × 4	0.6/0.6	0.3/0.9	150/104	36 × 6
[77]	PSO	1 × 8	0.6/0.6	0.8/0.8	150/104	47.8 × 11.3

3.2 Wavelength (de)multiplexers

In conventional designs, wavelength (de)multiplexers have employed structures such as micro-rings [78], [79], subwavelength-grating (SWG)-based contra-directional coupler (contra-DC) filters [80], [81], cascaded Mach–Zehnder interferometers (MZI) structures [82], [83], or arrayed waveguide gratings (AWGs) [84], [85]. Recently, inverse design methods have been increasingly adopted for designing wavelength demultiplexers as shown in Figure 10, primarily employing TO or using DBS algorithms for pixelized design. These sophisticated approaches allow for enhanced performance and precise wavelength filtering within a compact device.

Figure 10:

Representative inverse-designed nanophotonic wavelength (de)multiplexers. (A) Inverse-designed 1 × 3 wavelength demultiplexer with TO. SEM image of the fabricated device (left), simulated field propagation results for the optimized structure, varying the wavelength from 1,500 nm to 1,580 nm in 40 nm increments (middle) and measured transmission spectra of the device for each port (right). (B) Inverse-designed 1 × 2 wavelength demultiplexer with TO. Whole scheme and SEM image of the fabricated device (left), simulated field propagation results for the optimized structure, varying the wavelength at 1,507 nm and 1,565 nm (middle) and measured transmission spectra for each port (right). (C) Inverse-designed 1 × 6 nanophotonic power splitter optimized with DBS algorithm and additional photonic crystal structures. SEM image of the fabricated device with a magnified view of the nanoholes and PhC structure (left), simulated field propagation results for the optimized structure, varying the wavelength from 1,500 nm to 1,600 nm in 20 nm increments (middle) and measured transmission spectra for each port (right). (A) Is reprinted from Ref. [86], with permission. Copyright 2024 American Chemical Society; (B) is reprinted from Ref. [87], with permission (CC BY 4.0); (C) is reprinted from Ref. [88]. Copyright 2024 IEEE.

In 2015, Piggott et al. reported the design and fabrication of a 1 × 2 wavelength demultiplexer using TO, capable of splitting light from an input waveguide into two output waveguides at wavelengths of 1,300 nm and 1,550 nm [89]. The fabricated device achieved an insertion loss of less than 2.4 dB, crosstalk below −11 dB, and a 3-dB bandwidth exceeding 100 nm with notably compact dimensions of just 2.8 × 2.8 µm². Su et al. incorporated self-biasing techniques into the TO process to enhance the robustness of the device against fabrication errors [86]. A 1 × 3 wavelength demultiplexer was designed and fabricated, capable of demultiplexing wavelengths of 1,500 nm, 1,540 nm, and 1,580 nm. The simulated performance showed an insertion loss of less than −1.55 dB and crosstalk levels below −15 dB. The fabricated device with a size of 5.5 × 4.5 µm² demonstrated an insertion loss of less than −2.29 dB and crosstalk levels of −10.7 dB, slightly inferior to the simulation results. Huang et al. employed a refined TO approach to address the minimum feature size and performance degradation challenges [87]. The resulting device, with dimensions of 2.4 × 10 µm², effectively demultiplexed wavelengths of 1,520 nm and 1,580 nm through a 10 µm-wide waveguide into two distinct ports. Experimental results confirmed insertion losses of −1.77 dB and −2.1 dB at each respective port, with crosstalk levels of −25.17 dB and −12.14 dB in the 1 × 2 wavelength multiplexer configuration.

Recent advancements have led to the development of structures using DBS algorithms with rectangular or nanohole-pixelized designs, which have increasingly become a leading choice for addressing the performance limitations of TO alone, particularly regarding insertion loss and crosstalk degradation post-fabrication. Recently, Deng et al. presented a hybrid analog-digital algorithm that begins with TO to determine the optimal structure, followed by an analog-to-digital conversion to create a pixelized structure, which is further refined using the DBS algorithm [90]. This method produced a device with compact size of 3 × 3 µm², efficiently demultiplexed wavelengths at 1,550 nm and 2,000 nm, achieving insertion losses under 1.2 dB and 0.9 dB, with crosstalk levels below −17.7 dB and −16.4 dB. The device also demonstrated successful PAM-8 signal transmission at 138 and 84 Gbps for the respective wavelengths. Additionally, Wu et al. described an advanced device featuring an inverse-designed metastructure for wavelength demultiplexing, combined with cascaded photonic crystal filters to reduce crosstalk [88]. The design, optimized as a 1 × 6 demultiplexer using DBS algorithms, incorporated rectangular units with nanoholes. To further minimize crosstalk between ports, nanohole-based photonic crystal filters were added. The overall device dimensions were 27 × 12 µm², with an insertion loss of 6 dB and crosstalk levels maintained below 20 dB. The results of these inverse-designed wavelength demultiplexer devices are summarized in Table 2.

Table 2:

Summary of inverse-designed (de)multiplexers.

Ref	Design method	Structure	IL (dB) (sim./exp.)	XT (dB) (sim./exp.)	Footprint (μm²)
[79]	Forward (ring)	1 × 4	–/0.6	–/16	26 × 40 (single ring)
[80]	Forward (contra-DC)	1 × 4	–/1.8	–/−21.6	250 × 25 (single DC)
[82]	Forward (MZI)	1 × 4	–/3.7	–/−16	1,680 × 870
[89]	TO	1 × 2	2/2.4	−12.6/−11	2.8 × 2.8
[86]	TO	1 × 3	1.55/2.3	−15/−10.7	5.5 × 4.5
[87]	TO	1 × 2	1.45/2.1	−23.5/−12.1	2.4 × 10
[90]	TO + DBS	1 × 2	0.67/1.2	−19/−16.4	3.0 × 3.0
[88]	DBS	1 × 6	2.5/6	−38/−20	27 × 12

3.3 Grating couplers

In recent years, significant efforts have been dedicated to the designing grating couplers (GCs), essential components in optical I/O systems. Conventional straightforward approaches, such as chirped-grating [91], dual-etched grating [92], metallic mirror [93], and dual-layer grating designs [94], apodized grating [95], [96], [97] been proposed to enhance GC directivity and reduce back reflection. However, these methods often involve complicated fabrication processes that are incompatible with standard silicon photonics processes and have demonstrated limited performance. The inverse design approach has emerged as a promising alternative in designing GCs. While inverse-designed GCs do not achieve the dramatic size reductions seen in power splitters or wavelength multiplexers/demultiplexers, they offer greater design freedom. This enables GCs that better match the Gaussian beam profile at the intended coupling angle, thereby achieving higher coupling efficiency beyond what is achievable with conventional apodization techniques. Representative examples of inverse-designed GCs are depicted in Figure 11.

Figure 11:

Representative inverse-designed grating couplers (GCs). (A) Gradient-based inverse-designed focused GC with a 70 nm etched structure. SEM image of the fabricated GC (left), gradient-optimized structure, and simulated field propagation (middle), and comparison between simulated coupling efficiency and measured coupling efficiency (right). (B) PSO-optimized meta GC with a triple-etched structure. SEM image of the fabricated device, including a magnified top and cross-sectional view (left), an illustration of the optimized structure (middle) and simulated radiation angle results showing vertical emission (right). (C) Multi-layer vertical emitting GC optimized by density-based TO. (a) Optical micrographs of the fabricated single-polarization GC (left). The density-based TO optimization process is depicted, showing the jointly optimized poly-Si and SOI layers, which achieved high coupling efficiency after 600 iterations (middle). A comparison between the simulated coupling efficiency and measured coupling efficiency (right). (A) Is reprinted from Ref. [98], with permission (CC BY 4.0); (B) is reprinted from Ref. [99], with permission from Photonics Research; (C) is reprinted from Ref. [100], under the terms of the Open Access Publishing Agreement.

Yang et al. introduced an inverse-designed GC capable of fully vertical coupling [98]. This study utilized a gradient-based inverse design method implemented in the Python suite (Lumopt) to optimize a focused grating coupler with a 70 nm etch depth. The resulting device consisting of 30 periods, with rib and groove configurations was fabricated with a feature size of 100 nm. The measured coupling efficiency of the fabricated GC was −5.86 dB, representing a 3.04 dB improvement over conventional uniform GC. Nonetheless, there was a performance degradation of −2.86 dB compared to the simulation results, which was attributed to fabrication errors and difficulties in precise fiber alignment. Similarly, Sapra et al. applied a gradient-based inverse design method to design and fabricate out-of-plane GCs [101]. The GCs were optimized for target bandwidths of 40 nm, 100 nm, and 120 nm, using the same etch depths of 40 %, 60 %, 80 %, and 100 %. Simulations were conducted to assess the coupling efficiency spectra, revealing a trade-off between bandwidth and coupling efficiency. As representatives, three fully-etched grating couplers optimized for target bandwidths of 40 nm, 100 nm, and 120 nm were fabricated and tested. Compared to the simulation results, the measured coupling efficiencies showed a discrepancy of less than 0.5 dB, and the 3 dB bandwidths were also highly consistent, with a variance of only 7–19 nm.

Yoon et al. utilized a PSO-based inverse design approach to design a vertical emitting GC [99]. Unlike previous studies [98], [101], which focused on optimizing the grating period to achieve apodization, this study concentrated on optimizing the depth of each grating. The optimized GC was fabricated using a triple-etched meta-grating structure with three distinct height levels: 0 nm, 150 nm, and 220 nm. Simulation results indicated a coupling efficiency of −2.2 dB with a 3-dB bandwidth of 88 nm, however, experimental results revealed a performance degradation with a coupling efficiency of −4.2 dB with a 3-dB bandwidth of 48 nm. This performance degradation was attributed to fabrication-related issues, including undesired etching, over-etching due to mask misalignment, and variations in the sidewall angles. Hammond et al. applied a density-based TO method to design both single- and dual-polarization vertical emitting GCs on the double-layer platform [100]. This method optimized the full 3D structure across both SOI and Poly-Si layers to maximize coupling efficiency. The single-polarization GC showed coupling efficiency of −3 dB and a 3-dB bandwidth of 73 nm in simulation, but experimental results showed −4.7 dB with a 75 nm bandwidth, likely due to fabrication challenges such as conformal layer misalignment. The dual-polarization GC similarly showed a slight coupling efficiency drop from −5.6 dB (simulation) to −7 dB (experiment) though it maintained high polarization extinction. Despite the fabrication-related variations, both GCs showed consistent performance with a variation of only 2.4 dB across multiple wafers.

Although the fabrication and experimental validation of an inverse-designed GC were not provided, there exists research leveraging inverse design to demonstrate GCs with record-breaking performance. Michaels et al. demonstrated, through simulation, a grating coupler achieving a remarkable chip-to-fiber coupling efficiency of 99.2 % at a wavelength of 1,550 nm, with a 1-dB bandwidth of 24 nm [102]. This study presents an idealized design, highlighting the potential of inverse-designed grating couplers while setting a benchmark for ultra-high-efficiency GCs. Also, some studies have explored deep learning-based inverse design methods to enhance the computational efficiency, particularly for GC designs that typically require a large simulation cost. Tu et al. proposed a DNN-based GC optimization method with a data-driven approach to predict GC performance and also conduct inverse design [103]. In this work, two deep learning models were designed: a forward design model and an inverse design model. The forward design approach predicts a GC’s coupling efficiency and center wavelength based on input parameters such as the grating’s etching depth, pitch, and duty cycle. Conversely, the inverse design approach predicts GC design parameters from given values of coupling efficiency and center wavelength. A total of 937 datasets were gathered and utilized for training. The forward design model achieved a prediction accuracy of 91.7 % with a low MSE loss of 0.0034, while the inverse design approach also successfully generated viable designs with a reasonable MSE loss of 0.0403, closely matching the target spectrum. Witt et al. also explored the design and optimization of GCs using the deep learning method [65]. The fab-in-the-loop reinforcement learning approach, which incorporates feedback from real-world fabrication, was utilized along with the Deep Deterministic Policy Gradient (DDPG) algorithm. This method involved running 10,000 episodes to optimize 12 adjustable parameters with sub-wavelength holes, resulting in an optimized GC with an insertion loss of 3.24 dB, a marked improvement over the 8.8 dB loss seen with conventional methods. A summary of the results of these inverse-designed GCs is provided in Table 3.

Table 3:

Summary of inverse-designed grating couplers.

Ref	Design method	Structure	CE (%) (sim./exp.)	3-dB BW (nm)
[91]	Forward	Chirped	42/34	48
[92]	Forward	Dual-etched	45.3/27.6	68
[95]	Recursive	Apodized grating	61.4/–	–/–
[96]	GA	Apodized grating	61/–	35/– (1-dB)
[97]	GA	Apodized grating	64.7/–	33/– (1-dB)
[98]	Gradient-based	Partial-etched binary grating	50.1/25.7	29/37
[99]	PSO	Triple-etched meta-grating	60.2/38	88/74
[100]	DTO	Double-layer (single pol.)	50.1/33.9	73/75
[101]	Gradient-based	Fully-etched binary grating	38/34.6	40/33
[102]	Gradient-based	Double-layer fully etched grating	99.2/–	24/– (1-dB)
[103]	DNN	Partial-etched binary grating	34.6/–	107/–
[65]	DDPG	Ph.C-based binary grating	47.4/47.4	29/

3.4 Waveguide devices: bends, crossings, and cavities

Waveguide devices, fundamental components in nanophotonic circuits, are structures that confine and guide light along defined paths, enabling a wide range of optical functionalities on a compact scale. Inverse design techniques have revolutionized the development of these devices, allowing for the precise control of light propagation, coupling, and interaction with more compact and efficient devices, with representative examples shown in Figure 12.

Figure 12:

Representative examples of the waveguide devices. (A) Experimental setup and the SEM image of the fabricated U-bend device (up). Comparison graph between optimized and unoptimized U-bends in terms of experimental and simulation results (down). (B) SEM image of the TO-based waveguide crossing device on LNOI platform (up) and the obtained experimental results (down). (C) Optimization schematic of the reflector and the targeted comb response of the cavity (right). (A) Is reprinted from Ref. [104], with permission. Copyright 2024 American Chemical Society; (B) is reprinted from Ref. [105], with permission. Copyright 2024 Elsevier; (C) is reprinted from Ref. [106], with permission (CC BY 4.0).

Waveguide bends, crossings, and cavities are some of the simple yet crucial elements of photonic circuits. Conventional bends require adiabatic mode converters and a wide radius of curvature to evade bending loss and modal mismatch [107], [108]. To further reduce the bending curvature and avoid the implementation of mode converters, researchers have applied inverse design methods to design compact and efficient waveguide bends. Zhang et al. used PSO to design compact, and low-loss waveguide bends along with experimental verification [109]. Their design employed cubic spline interpolation optimized with PSO to achieve the most suitable bending shape that supports a targeted number of modes. The reported waveguide bend devices show insertion loss of 0.009 dB, 0.028 dB, and 0.048 dB for waveguides supporting one, two, and three modes respectively. In a recent study by Irfan et al., it was demonstrated that by using TO, it is possible to achieve compact and low-loss L-bend and U-bend structures for the SOI platform [104]. They reported three different bends for both shapes and with the help of TO, the insertion loss was reduced by nearly half. Waveguide bends, crossings and other types of devices have also been proposed on the lithium niobate-on-insulator (LNOI) platform, owing to the promising material characteristics of lithium niobate. In this regard, Shang et al. proposed inverse-designed mode multiplexers, waveguide crossings, and waveguide bend structures on the LNOI platform [105]. Their experimental results demonstrate that the bent waveguide reduces the insertion loss by up to 8 times compared to a waveguide bend with the same radius of curvature. All three devices operate between 1,500 nm and 1,600 nm with improved performance.

Beyond these fundamental elements of photonic circuits, advanced waveguide devices such as resonators, sensors, and gratings offer enhanced functionality and precision in light manipulation, paving the way for innovative applications in sensing, filtering, and signal processing. In this regard, Yang et al. proposed a silicon carbide (SiC)-based optical Fabry–Perot cavity design with the help of inverse-designed reflectors [106]. The proposed reflector had dimensions of 6.75 μm by 1 μm and represents one of the first demonstrations of inverse design applied to quantum and nonlinear light generation. Chung et al. proposed an inverse-designed waveguide biosensor operating at 1,550 nm on the SOI platform [110]. They implemented the high-contrast probe cleavage detection (HCCD) method to design and optimize the device, demonstrating a biosensor suitable for rapid sensing with a high transmission rate for the target molecule, thus enhancing sensing effectiveness and precision. The results of these inverse-designed waveguide devices are summarized in Table 4.

Table 4:

Summary of inverse-designed waveguide devices.

Ref	Design method	Waveguide device	Material platform
[107]	Forward	90° bend	SOI
[108]	Forward	90° bend	SOI
[109]	PSO	90° bend	SOI
[104]	TO	U-bend	SOI
[105]	TO	Waveguide crossing	LNOI
[106]	TO	Reflector	SiC

3.5 Metasurfaces

Metasurfaces, owing to their capability of manipulating wavefronts with sub-micron-thick optical elements, have attracted significant attention over the past decade. However, the extremely large degrees of freedom in their designs also present challenges in finding the optimized structure for a particular application. To address this problem, various inverse design methods, such as evolutionary algorithms, gradient-based methods, and deep learning methods have been utilized for the rapid design of high-performance metasurfaces. These inverse design methods can be categorized according to their main objective: reducing optimization iterations, accelerating forward calculations, and directly deriving structure from target properties without iteration. In this subsection, we introduce selected examples of inversely designed metasurfaces classified by their design methodology.

Population-based heuristics and gradient-based methods have been utilized to minimize the design iterations needed to find the optimal structure. Sun et al. used a genetic algorithm to determine the optimal arrangement of “0” and “1” unit cells on the metasurface, achieving uniform backscattering throughout a broad frequency range with a reduced radar cross-section [111]. Work by Fan et al. utilized a genetic algorithm for the optimization of a 1D Pancharatnam–Berry phase-controlled metasurface [112]. The authors implemented a light-sheet mode by selecting appropriate phase profiles for unit cells. Haji-Ahmadi et al. designed a pixelated checkerboard metasurface for broadband radar cross-section reduction using a binary PSO algorithm [113]. A sigmoid limiting transformation was applied to the particle for binarization. The optimized unit cells are shown in Figure 13(A). The out-of-phase reflection from these unit cells enables broadband radar cross-section reduction.

Figure 13:

Representative examples of the inverse-designed metasurfaces. (A) Unit cell layouts optimized by binary PSO. (B) Flowchart of the inverse design method of multicolor hologram metasurfaces. (C) Network architecture of the physics-based neural network surrogate model. (D) Tandem DNN architecture used for silicon color prediction. (A) Is reprinted from Ref. [113], with permission (CC BY 4.0); (B) is reprinted from Ref. [114], with permission. Copyright 2024 John Wiley and Sons; (C) is reprinted from Ref. [115], with permission from Photonics Research; (D) is reprinted from Ref. [116], with permission. Copyright 2024 John Wiley and Sons.

Gradient-based optimization methods, including the adjoint method, have been widely implemented for the inverse design of metasurfaces [114], [117], [118], [119], [120]. In the work of Chung et al., a metalens with extremely high NA was theoretically demonstrated using an adjoint method and minimax optimization [118]. The designed metalens has a 2D freeform topology consisting of binary pixels. In another work by Chung et al. [119], adjoint-based local optimization was merged within a global optimization process for the design of a tunable beam-deflecting metasurface. The optimized triple grating structure achieved 80 % switching efficiency and a deflection angle of up to 144°. Mansouree et al. demonstrated a multifunctional 2.5D metasurface that focuses 2 different wavelengths at different focal points using adjoint optimization [120]. The authors illustrated that the inverse-designed structure outperforms the conventional unit-cell-based metasurfaces. Moreover, they showed that increasing the degree of freedom can further enhance the performance of multifunctional metasurfaces. So et al. designed multicolor and multiplane holograms enabled by single-cell metasurfaces [114]. A gradient descent optimization was implemented for the optimization of the phase profile for multiple wavelengths as shown in Figure 13(B). Automatic diffraction was used for the efficient calculation of gradients.

On the other hand, attempts have been made to accelerate forward calculations by replacing the conventional electromagnetic simulations with faster alternatives. Machine learning-based surrogate models and coupled mode theory (CMT) are examples. Jing et al. designed an orbital angular momentum multiplexing metasurface using an iterative hybrid optimization algorithm, which includes a neural-net surrogate model and genetic algorithm [115]. In the optimization process, the fitness of the structure was predicted using a physics-based neural net surrogate model. At the end of each iteration, genetic operators were applied for the reproduction of the next population. Wiecha et al. trained a CNN capable of predicting the near fields and far fields of a nanostructure [121]. In the work of Tanriover et al., a forward-predicting network consisting of an autoencoder and a fully connected network was used for the prediction of the optical response of metasurface unit cells [122]. Zhou et al. used coupled mode theory and adjoint optimization for the inverse design of large-area high NA metalenses [123]. The far-fields calculated by CMT were optimized with the adjoint method and then converted to precalculated geometric parameters. Wu et al. employed spatial coupled-mode theory for the inverse design of a large NA metalens [124]. By modelling the meta-atoms as truncated waveguides, they proposed an inverse design framework that is faster than conventional full-wave simulation.

Potentially, the fastest inverse design methods are those that can generate optimal structures directly from target properties. Liu et al. trained a generative adversarial network that generates the unit cell structure for a given transmittance spectrum [125]. Gao et al. addressed the non-uniqueness problem in inverse design by employing a bidirectional DNN with a tandem network architecture for the structural color design of silicon [116]. An inverse network was connected to a pre-trained forward network trained to predict the color of a given geometry. The tandem network was then trained to minimize the error between the input and output color. Lin et al. designed a nanodisc structure-based plasmonic metasurface using a CNN [126]. The CNN takes the absorption spectra as inputs and outputs the corresponding geometric parameters. The geometrical parameters were restricted to avoid non-uniqueness problems from symmetry. Jiang et al. presented a conditional generative neural network capable of global optimization for the inverse design of metagratings [127]. Forward and adjoint simulations were performed, providing physics-based gradients to the network. Chang et al. reported an inverse design method using factorization of Jones matrices [128]. The authors proved that an arbitrary Jones matrix could be implemented with a bilayer elliptical silicon nanopost array. Inverse-designed metasurfaces with 2 independent vectorial holographic images and an optical CNOT gate were demonstrated. The results of the inverse-designed metasurfaces are summarized in Table 5.

Table 5:

Summary of inverse-designed metasurfaces.

Ref	Design method	Functionality	Note
[129]	Forward	Holograms
[130]	Forward	Eyepiece for augmented reality
[111]	GA	RCS reduction	Broadband, broad angle
[112]	GA	Light sheet mode
[113]	BPSO	RCS reduction	Broadband
[118]	Adjoint	High-NA metalens (0.99)	Freeform
[119]	Adjoint + PSO	Tunable beam deflector (144°)
[120]	Adjoint	Metalens	Multifunctional
[114]	Gradient descent	Multicolor/multiplane hologram
[115]	NNSM + GA	OAM (de)multiplexer
[122]	Autoencoder + FCN	Metalens	Fabrication feasible
[123]	CMT + Adjoint	Metalens, hologram	Large area
[124]	SCMT	Metalens, aberration-reduced lens
[125]	GAN	Spectrum prediction
[116]	Tandem network	Si structural color
[126]	CNN	Plasmonic metasurface
[127]	GLOnet	Metagrating	Global optimization
[128]	Matrix factorization	Vectorial hologram, CNOT gate

4 Commercial and open-source inverse design tools

The optimization and inverse design techniques discussed so far rely on full-wave simulations, such as finite-difference time-domain (FDTD) and finite element method (FEM). FDTD solves the time-domain Maxwell equations on a structured Yee grid with a finite time step, while FEM does the time-harmonic Maxwell equations on a complex mesh using a weak-form discretization of linear equations. Many electromagnetic solvers now offer optimization plug-ins, allowing seamless integration of photonic simulations with post-simulation optimization workflows.

We note that, although there is no theoretical preference between these solvers once a certain level of accuracy is achieved, FDTD has recently been widely used for inverse design problems due to its ease of parallel computation and hardware acceleration. The Yee grid’s simple structure supports spatial parallelism across multiple processors, and the time-domain approach allows for broadband simulations in a single run, enabling frequency parallelism [131]. FDTD also benefits significantly from GPU acceleration, overcoming CPU memory bandwidth limitations [132].

In the following, we introduce several commercial and open-source tools commonly used in photonic inverse-design applications.

First, MEEP [131], [133] and Tidy3D [134] are open-source and commercial FDTD solvers, respectively, both available through Python interfaces. Users can define electromagnetic problems using Python scripts to set object geometries, material properties, wave sources, monitors, boundary conditions, and other parameters. Running these scripts generates EM field data and their derivatives such as the Poynting vector on the predefined monitors. Notably, MEEP, as a free and open-source tool, supports MPI-based parallel computing, while Tidy3D, built for GPU acceleration, is particularly efficient for large-scale problems with more than one billion grid points.

These tools support built-in adjoint-based optimization, which are integrated with Autograd [135] and JAX [136]. In this approach, a single optimization step involves running two electromagnetic simulations (forward and adjoint) to compute the gradient of the objective function with respect to optical parameters (permittivity). These automatic differentiation tools streamline the optimization process, from pre-simulation parameter adjustments that enforce fabrication constraints to post-simulation customization of the objective function using basic field elements. In addition, heuristic optimization techniques such as genetic algorithms and particle swarm optimization can be implemented with open-source libraries, such as pyGAD [137] and PySwarms [138].

In addition to the tools mentioned, Lumerical [139], a widely used photonics simulation software suite developed by Ansys, provides robust support for optimization workflows, particularly within its Lumerical FDTD Solutions. It includes built-in heuristic methods like PSO and supports adjoint method-based TO through Lumopt [140], a specialized Python API. Moreover, Lumerical is also compatible with external Python libraries, such as Splayout [141], allowing users to implement various heuristic algorithms like GA, PSO, DBS, as well as adjoint-based TO.

On the other hand, COMSOL Multiphysics [142], a commercial FEM solver, is also used for photonic inverse design. Unlike FDTD, FEM can address explicit material boundaries, which is particularly advantageous in cases involving sharp metallic object corners where electromagnetic fields are highly enhanced. Also, FEM can be computationally efficient for strongly resonant structures, which might take a long simulation time with FDTD. Consequently, FEM solvers, including COMSOL, are often a more suitable choice for inverse design problems, particularly those involving plasmonic objects [143]. Like FDTD solvers, COMSOL supports the MATLAB interface, allowing for optimizations such as particle swarm optimization [144] and adjoint sensitivity [44]. Moreover, TO codes in MATLAB, such as the one introduced by Ref. [145], can serve as a useful platform for implementing customized photonic design workflows within COMSOL.

Notably, simpler structures with specific symmetry may not require full-wave simulations. For example, RCWA [146], [147] is a semi-analytical method that efficiently solves Maxwell’s equations in Fourier space, especially for periodic structures. Similarly, infinite multilayer structures can be analyzed using the transfer matrix method (TMM) [148], [149], which relates the electromagnetic fields across the boundaries of different layers. Mie theory [150], [151] can solve problems involving cylindrical structures with radial symmetry. These methods are implemented in commercial software (e.g. Lumerical) and open-source platforms (e.g. RETICOLO), or can be developed in-house for heuristic or adjoint optimization, as well as for generating ground-truth data for the training of deep neural networks.

5 Commercial foundry implementations for silicon photonics and other material platforms

Most inverse designs have been fabricated primarily using electron-beam (e-beam) lithography, which enables reduced pixel sizes for optimal designs with an enhanced degree of freedom. While e-beam lithography is convenient for readily verifying the design feasibility through in-house fabrication, it remains inherently impractical for mass production. Therefore, the adoption of commercial foundry production utilizing deep ultraviolet photolithography (DUVL) is essential for the widespread dissemination of the inverse design in legacy PIC technologies. This section presents a range of examples where inverse-designed photonic devices have been successfully produced by DUVL (see Table 6), demonstrating how it bridges the gap between innovative designs and scalable manufacturing.

Table 6:

Summary of silicon photonic devices fabricated using DUVL.

Ref.	Exposure wavelength	Method	Device	Minimum feature size	Foundry
[70]	248 nm	PSO	MMI	100 nm	IME
[152]	248 nm	ADJ	Crossing	500 nm	Sandia
[99]	248 nm	PSO	GC	200 nm	–
[153]	193 nm	PSO-ADJ	MC	200 nm	–
[154]	193 nm	ADJ	GC	65 nm	CEA-Leti
[155]	193 nm	ADJ	GC	60 nm	CEA-Leti
[156]	193 nm	ADJ-LST	GC-WDM	160 nm	AMF
[157]	193 nm	DTO	MDM, WDM, DC, BS	40 nm	AIM photonics
[100]	193 nm	DTO	GC	100 nm	Global foundries
[44]	–	DTO	Vector matrix product	–	AMF

Several research efforts in inverse design have leveraged DUVL with KrF [70], [99], [152] and ArF [100], [153]–[157] laser exposure (as summarized in Table 1). Various devices including an MMI power splitter (PS), vertical grating coupler (GC) [99], and mode converter (MC) [153] have been inversely designed with the aid of PSO and fabricated by DUVL. The adjoint method and adjoint-inspired designs have demonstrated vertical GC with an SOI single layer [154] and SOI/SiN dual layer [155], fabricated by DUVL at CEA-Leti. More recently, DUVL on AMF has fabricated a GC demultiplexer designed by the adjoint method with a fast integral technique [156].

The device geometry can be optimized to be more fabricable by enforcing the level-set fabrication constraint, where a penalty term can be incorporated to facilitate the concurrent optimization of performance and fabricability [158]. Inverse-designed devices including MDM, WDM, DC, and PS have been demonstrated under the constraints of a minimum gap and minimum radius curvature by DUVL at AIM Photonics [157].

The density-based topology optimization (DTO) method has been successfully implemented for DUVL-based fabrication of inversely designed devices. Fabrication constraints concerning minimum area and minimum enclosed area can be implemented in conjunction with previously established constraints of minimum linewidth, line spacing, and curvature [159]. Based on this method, it has been experimentally shown that both single and dual polarization vertical GCs can be fabricated through DUVL at AMF [100]. More recently, the two-dimensional effective index approximation has been applied to fabricate vector-matrix products for N by N matrix, inverse-designed by DTO with a large computational domain, via DUVL on AMF. The DTO technique has been theoretically investigated to ensure DRC compliance using a conditional generator for feasible design and straight-through estimator [160].

While the Si-based photonic platforms, leveraging mature CMOS fabrication technologies, remain dominant in inverse design methodologies, they inherently possess limitations such as weak nonlinear optical properties and a narrow transparency window. These drawbacks have driven interest in alternative materials – such as III–V compounds [161], silicon carbide (SiC) [106], diamond [162], and lithium niobate (LN) [105], [163] – which offer unique advantages that can be further enhanced through advanced inverse design techniques. Notably, commercial fabrication processes for these materials are less available, prompting research teams to rely on in-house e-beam lithography for fabrication.

Nevertheless, significant efforts have been devoted to inverse-designed photonic devices on these alternative material platforms to overcome their inherent fabrication constraints that are more challenging than those of Si-based counterparts, especially for compact devices, while simultaneously achieving decent device performances (see Table 7). A gallium arsenide (GaAs)-based inverse-designed coupler [161] has been demonstrated with a novel sleeve and bulk fabrication methodology to overcome feature-size-dependent etch rates in the reactive-ion-etching (RIE) process. A fabrication-tolerant inverse-designed Fabry–Pérot (FP) cavity structure based on SiC [106] has been explored to realize second and third-order nonlinear light generation, optimized for low scattering loss and enhanced robustness against fabrication errors. Implementing inverse-designed components on diamond platforms presents significant challenges due to critical constraints in diamond nanofabrication technologies. An advanced optimization-based inverse design technique was proposed to provide the full parameter space of fabricable devices, as demonstrated with a vertical coupler [162]. LN-based devices face difficulties not only in achieving minimum feature sizes but also in substantial sidewall angles due to the physical etching process, which should be accounted for during inverse design optimization processes. A variety of inverse-designed LN devices [105], [163] have been demonstrated to address these practical fabrication constraints.

Table 7:

Summary of fabricated inverse-designed devices in other material platforms.

Ref.	Material	Method	Device	Lithography	Minimum feature size
[161]	GaAs	Gradient-based	GC	e-beam	150 nm
[106]	SiC	Gradient-based	FP cavity	e-beam	120 nm
[162]	Diamond	–	Vertical coupler	e-beam	100 nm
[105]	LN (z-cut)	Gradient TO	Mode multiplexer, crossing, bend	e-beam	200 nm
[163]	LN (x-cut)	Adjoint-based gradient	Mode converter	e-beam	424 nm

6 Discussions

In this review, the vast potential of inverse design methods is highlighted through representative examples. Although these methods have already achieved remarkable results, there is still room for further improvement. Currently, inverse design in nanophotonics is restricted by several challenges. First, the resolution of the fabrication process restricts the degree of freedom in the inverse design process. Commercial foundries including DUV and e-beam lithography, have a minimum feature size of 40 nm. Recently, novel lithography techniques such as extreme ultraviolet (EUV) lithography have been developed and hold promising potential for improving inverse design capabilities. Other challenges involved the inverse design techniques themselves. Inverse design methods are sometimes ineffective and are not the best option for every design task. In the following section, we discuss the challenges of inverse design methods for two categories: optimization methods-based inverse design and deep learning-based inverse design. By highlighting these challenges, we can focus on design tasks where inverse design methods are most effective. Furthermore, applications of the inverse design have extended its effectiveness in other fields beyond nanophotonics and optics. Insights gained from these diverse applications can inspire further advances in these methodologies.

6.1 Challenges in optimization methods

In this section, several challenges in optimization methods enabled by meta-heuristic algorithms are discussed. Primarily, meta-heuristic algorithms suffer from being sensitive to initial conditions and optimization parameters, as well as from being time-consuming. We next examine how these limitations affect the optimization process regarding several aspects: multi-objective optimization and global optimization.

Multi-objective optimization problems deal with the optimization of multiple conflicting objectives simultaneously. Such problems can have various solutions and multi-objective algorithms are essential for such cases. As the objectives conflict with one another, focusing on one objective often leaves others unmet. Thus, multi-objective algorithms aim to identify a solution set that closely approximates these conflicting objectives so that all objectives are at least partially fulfilled. Gradient-based algorithms, genetic algorithms, and particle swarm optimization are among the most popular methods for multi-objective optimization. Even though these methods have resulted in compact and highly efficient devices, multi-objective optimization also has some limitations and difficulties. One of the main challenges of these algorithms is the need to set appropriate values for various parameters. These parameters significantly influence the performance, convergence, and quality of the solutions generated by the algorithm. Population size, crossover rate, mutation rate for genetic algorithms, swarm size, inertia weight, and other constants for PSO are some of the parameters that greatly affect the optimization performance. Another challenge is the computational cost. The optimization process in nanophotonics involves a vast parameter space, such as shape, size, and material properties. Evaluating the performance of potential solutions in this wide range is computationally intensive, especially when multiple objectives must be considered simultaneously. Scalability is another important issue in multi-objective optimization. As the number of decision variables increases, along with the number of objectives, evolutionary algorithms’ performance might decline. These issues have been addressed and researchers have tried to develop ideas and new algorithms to overcome such limitations [164], [165], [166].

Global optimization also remains a challenge. The optimization results of algorithms do not always lead to the global optimum and can easily become trapped in local optima. As discussed in previous sections, the performance of meta-heuristic algorithms is often sensitive to the parameters and initial conditions. The performance is strongly dependent on each selection and parameter. Consequently, designers often resort to iterative optimization processes, adjusting parameters in a trial-and-error manner to obtain near-optimal solutions. This repetitive adjustment can result in an inefficient and time-consuming design process, especially in the case of more complex, multidimensional problems. Furthermore, such sensitivity increases the likelihood of suboptimal solutions and can lead to higher computational costs. Thus, the need for robust strategies that enhance global search capabilities and reduce the reliance on parameter tuning remains critical to overcome these inefficiencies.

6.2 Challenges in deep learning methods

As widely covered in this review, the broadened interest in artificial intelligence has accelerated research in nanophotonics due to its unique characteristics. Once trained, AI can generate optimized designs quickly, making them ideal for real-time applications. Moreover, it can be applied to a range of related design problems, offering flexibility that conventional optimization algorithms lack. But still, a one-time cost simulation is needed to train the AI. As structures become more complex, more data is required. Although increasing the quantity of labeled data leads to improved network performance, a major drawback of this design approach is its significant computational expense, especially when applied to advanced AI models that require enormous datasets. This challenge becomes even more pronounced in fields like nanophotonics, where training neural networks demands an extensive number of electromagnetic simulations. These simulations are computationally intensive and highly time-consuming, making it difficult to efficiently train models without access to substantial computing resources. Consequently, while more data holds the promise of better performance, practical limitations related to computational power and time requirements present significant barriers to scaling up this method in cutting-edge applications. Recent studies try to find a way to reduce this one-time cost by leveraging advanced techniques such as generative AI or transfer learning. For example, in Ref. [57], a semi-supervised learning technique with the generative model was applied to reduce the one-time cost simulations. In Ref. [167], transfer learning was implemented to improve the efficacy of deep neural networks for electromagnetic metamaterials. Because these researches are very new, there is a high potential to reduce the computational cost further with the development of AI algorithms.

Another concern is that the power of deep learning is limited in nanophotonics; while it can successfully inverse design devices at a one-time computational cost, the performance of these devices cannot exceed the capabilities within the training dataset and can only mimic their performance. To inverse design nanophotonic devices with even higher performance, an additional optimization process is required, which diminishes the advantages of deep learning methods.

In the future, deep learning methods can be developed to improve the inverse design of nanophotonic devices. First, deep learning methods can design nanophotonic devices at the speed of AI, by reducing the time cost of electromagnetic simulations by implementing models like physics-informed neural networks. Second, the development of AI algorithms such as transfer learning, continuous learning, etc., and the development of models such as vision transformers, etc. can accelerate the inverse design capabilities, paving the way for designing high-performance nanophotonic devices.

6.3 Inverse design in other research fields

While inverse design has been predominantly developed for applications in nanophotonics and optics, it has increasingly demonstrated its versatility and effectiveness across a range of scientific domains. Its scope has broadened to include areas such as advanced photonic system design [168], metastructure engineering within the microwave spectrum [169], sophisticated mechanical metamaterial design for tailored properties [170], and the search of materials with target functionalities [171]. These advancements in related research fields not only highlight the inverse design method’s exceptional adaptability, but also offer valuable insights and methodologies, inspiring innovative solutions and transformative breakthroughs in nanophotonic and optical design.

7 Conclusions

In this review, we summarized the latest advancements in inverse design techniques applied to nanophotonics, which merge nanotechnology and photonics. The first section covers the inverse design algorithms utilized in the design of nanophotonic devices, discussing these methods in detail along with their theoretical backgrounds and representative examples. The second section presents a range of inverse-designed nanophotonic devices, categorized by their functionalities, showcasing the versatility of inverse design in creating diverse and high-performance devices. We also highlight open-source deep learning platforms that facilitate the inverse design process, as well as commercial foundries capable of fabricating these advanced devices. Finally, we explore the challenges that remain in inverse design methods, identifying key areas for future innovation.

Among the achievements in nanophotonic devices through various inverse design methods, there is still significant potential for further advancements. Current limitations in computational resources, fabrication feasibility, and design flexibility continue to pose challenges, but they also provide fertile ground for future exploration. Enhancing the efficiency of optimization algorithms, improving the accuracy of machine learning models for design prediction, and advancing nanofabrication technologies will be key factors in pushing the boundaries of what can be achieved with inverse design in nanophotonics. Additionally, as nanophotonic devices are increasingly adopted in real-world applications, new requirements and constraints will emerge. For example, designing devices that are robust against fabrication tolerances, environmental variations, and large-scale integration will become increasingly critical. Moreover, expanding the scope of inverse design to account for multi-physics optimizations, such as thermal and mechanical considerations, will allow for the development of more versatile and reliable devices. Overall, while the field of inverse-designed nanophotonics has already demonstrated remarkable potential, the road ahead is filled with opportunities for innovation. The future of nanophotonics, driven by intelligent design integrating inverse design and deep learning methods, holds the promise of transforming the way we manipulate and utilize light at the nanoscale.

Corresponding author: Hamza Kurt, The School of Electrical Engineering, Korea Advanced Institute of Science and Technology (KAIST), Daejeon, Republic of Korea, E-mail: hamzakurt@kaist.ac.kr

Junhyeong Kim and Jae-Yong Kim contributed equally to this work.

Funding source: Ministry of Education

Award Identifier / Grant number: 4120200113769

Funding source: National Research Foundation of Korea

Award Identifier / Grant number: NRF-2022R1A2C1009773

Award Identifier / Grant number: RS-2024-00439005

Research funding: This work was supported by the National Research Foundation of Korea grant funded by the Korea government (MIST) (RS-2024-00439005); National Research Foundation of Korea (NRF-2022R1A2C1009773); Ministry of Education, Republic of Korea (BK21 Four, No. 4120200113769); KAIST UP program.
Author contributions: All authors have accepted responsibility for the entire content of this manuscript and approved its submission.
Conflict of interest: Authors state no conflicts of interest.
Data availability: The datasets generated and/or analyzed during the current study are available from the corresponding author upon reasonable request.

References

[1] C. Haffner, et al.., “All-plasmonic Mach–Zehnder modulator enabling optical high-speed communication at the microscale,” Nat. Photonics, vol. 9, no. 8, pp. 525–528, 2015. https://doi.org/10.1038/nphoton.2015.127.Search in Google Scholar

[2] Y. Li, Y. Zhang, L. Zhang, and A. W. Poon, “Silicon and hybrid silicon photonic devices for intra-datacenter applications: state of the art and perspectives,” Photonics Res., vol. 3, no. 5, pp. B10–B27, 2015. https://doi.org/10.1364/PRJ.3.000B10.Search in Google Scholar

[3] R. Blum, “Integrated silicon photonics for high-volume data center applications,” Opt. Interconnects, vol. 11286, pp. 141–149, 2020, https://doi.org/10.1117/12.2550326.Search in Google Scholar

[4] B. J. Shastri, et al.., “Photonics for artificial intelligence and neuromorphic computing,” Nat. Photonics, vol. 15, no. 2, pp. 102–114, 2021. https://doi.org/10.1038/s41566-020-00754-y.Search in Google Scholar

[5] C. Lian, C. Vagionas, T. Alexoudi, N. Pleros, N. Youngblood, and C. Ríos, “Photonic (computational) memories: tunable nanophotonics for data storage and computing,” Nanophotonics, vol. 11, no. 17, pp. 3823–3854, 2022. https://doi.org/10.1515/nanoph-2022-0089.Search in Google Scholar PubMed PubMed Central

[6] X.-Y. Xu and X.-M. Jin, “Integrated photonic computing beyond the von Neumann architecture,” ACS Photonics, vol. 10, no. 4, pp. 1027–1036, 2023. https://doi.org/10.1021/acsphotonics.2c01543.Search in Google Scholar

[7] N. Maring, et al.., “A versatile single-photon-based quantum computing platform,” Nat. Photonics, vol. 18, no. 6, pp. 603–609, 2024. https://doi.org/10.1038/s41566-024-01403-4.Search in Google Scholar

[8] E. C. Garnett, B. Ehrler, A. Polman, and E. Alarcon-Llado, “Photonics for photovoltaics: advances and opportunities,” ACS Photonics, vol. 8, no. 1, pp. 61–70, 2020. https://doi.org/10.1021/acsphotonics.0c01045.Search in Google Scholar PubMed PubMed Central

[9] C. Chen, et al.., “Zero-energy switchable radiative cooler for enhanced building energy efficiency,” J. Photonics Energy, vol. 14, no. 2, p. 028501, 2024. https://doi.org/10.1117/1.JPE.14.028501.Search in Google Scholar

[10] P. Cheng, Y. An, A. K. Y. Jen, and D. Lei, “New nanophotonics approaches for enhancing the efficiency and stability of perovskite solar cells,” Adv. Mater., vol. 36, no. 17, p. 2309459, 2024. https://doi.org/10.1002/adma.202309459.Search in Google Scholar PubMed

[11] G.-H. Lee, et al.., “Multifunctional materials for implantable and wearable photonic healthcare devices,” Nat. Rev. Mater., vol. 5, no. 2, pp. 149–165, 2020. https://doi.org/10.1038/s41578-019-0167-3.Search in Google Scholar PubMed PubMed Central

[12] S. Zhang, et al.., “Metasurfaces for biomedical applications: imaging and sensing from a nanophotonics perspective,” Nanophotonics, vol. 10, no. 1, pp. 259–293, 2020. https://doi.org/10.1515/nanoph-2020-0373.Search in Google Scholar

[13] H. Altug, S.-H. Oh, S. A. Maier, and J. Homola, “Advances and applications of nanophotonic biosensors,” Nat. Nanotechnol., vol. 17, no. 1, pp. 5–16, 2022. https://doi.org/10.1038/s41565-021-01045-5.Search in Google Scholar PubMed

[14] A. Barulin, D. D. Nguyen, Y. Kim, C. Ko, and I. Kim, “Metasurfaces for quantitative biosciences of molecules, cells, and tissues: sensing and diagnostics,” ACS Photonics, vol. 11, no. 3, pp. 904–916, 2024. https://doi.org/10.1021/acsphotonics.3c01576.Search in Google Scholar

[15] X. Miao, L. Yan, Y. Wu, and P. Q. Liu, “High-sensitivity nanophotonic sensors with passive trapping of analyte molecules in hot spots,” Light Sci. Appl., vol. 10, no. 1, p. 5, 2021. https://doi.org/10.1038/s41377-020-00449-7.Search in Google Scholar PubMed PubMed Central

[16] J. Xavier, D. Yu, C. Jones, E. Zossimova, and F. Vollmer, “Quantum nanophotonic and nanoplasmonic sensing: towards quantum optical bioscience laboratories on chip,” Nanophotonics, vol. 10, no. 5, pp. 1387–1435, 2021. https://doi.org/10.1515/nanoph-2020-0593.Search in Google Scholar

[17] E. Mohammadi, K. Tsakmakidis, A. N. Askarpour, P. Dehkhoda, A. Tavakoli, and H. Altug, “Nanophotonic platforms for enhanced chiral sensing,” ACS Photonics, vol. 5, no. 7, pp. 2669–2675, 2018. https://doi.org/10.1021/acsphotonics.8b00270.Search in Google Scholar

[18] A. Håkansson and J. Sánchez-Dehesa, “Inverse designed photonic crystal de-multiplex waveguide coupler,” Opt. Express, vol. 13, no. 14, pp. 5440–5449, 2005. https://doi.org/10.1364/OPEX.13.005440.Search in Google Scholar PubMed

[19] P. I. Borel, et al.., “Imprinted silicon-based nanophotonics,” Opt. Express, vol. 15, no. 3, pp. 1261–1266, 2007, https://doi.org/10.1364/oe.15.001261.Search in Google Scholar PubMed

[20] S. So, T. Badloe, J. Noh, J. Bravo-Abad, and J. Rho, “Deep learning enabled inverse design in nanophotonics,” Nanophotonics, vol. 9, no. 5, pp. 1041–1057, 2020. https://doi.org/10.1515/nanoph-2019-0474.Search in Google Scholar

[21] P. R. Wiecha, A. Arbouet, C. Girard, and O. L. Muskens, “Deep learning in nano-photonics: inverse design and beyond,” Photonics Res., vol. 9, no. 5, pp. B182–B200, 2021. https://doi.org/10.1364/PRJ.415960.Search in Google Scholar

[22] Q. Wang, M. Makarenko, A. Burguete Lopez, F. Getman, and A. Fratalocchi, “Advancing statistical learning and artificial intelligence in nanophotonics inverse design,” Nanophotonics, vol. 11, no. 11, pp. 2483–2505, 2022. https://doi.org/10.1515/nanoph-2021-0660.Search in Google Scholar PubMed PubMed Central

[23] Y. Sebbag, E. Talker, A. Naiman, Y. Barash, and U. Levy, “Demonstration of an integrated nanophotonic chip-scale alkali vapor magnetometer using inverse design,” Light Sci. Appl., vol. 10, no. 1, p. 54, 2021. https://doi.org/10.1038/s41377-021-00499-5.Search in Google Scholar PubMed PubMed Central

[24] J. Kim, et al.., “Inverse design of an on-chip optical response predictor enabled by a deep neural network,” Opt. Express, vol. 31, no. 2, pp. 2049–2060, 2023. https://doi.org/10.1364/OE.480644.Search in Google Scholar PubMed

[25] T. Lin, et al.., “Design of mechanically-tunable photonic crystal split-beam nanocavity,” Opt. Lett., vol. 40, no. 15, pp. 3504–3507, 2015. https://doi.org/10.1364/OL.40.003504.Search in Google Scholar PubMed

[26] J. H. Holland, Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence, Cambridge, MIT Press, 1992.10.7551/mitpress/1090.001.0001Search in Google Scholar

[27] R. Storn and K. Price, “Differential evolution–a simple and efficient heuristic for global optimization over continuous spaces,” J. Glob. Opt., vol. 11, pp. 341–359, 1997. https://doi.org/10.1023/A:1008202821328.10.1023/A:1008202821328Search in Google Scholar

[28] S. Xiao, et al.., “Inverse design of a near-infrared metalens with an extended depth of focus based on double-process genetic algorithm optimization,” Opt. Express, vol. 31, no. 5, pp. 8668–8681, 2023. https://doi.org/10.1364/OE.484471.Search in Google Scholar PubMed

[29] Y. Wang, G. Wu, J. Zhang, X. Wu, G. Yuan, and J. Liu, “Genetic algorithm-enhanced design of ultra-broadband tunable terahertz metasurface absorber,” Opt. Laser Technol., vol. 170, p. 110262, 2024, https://doi.org/10.1016/j.optlastec.2023.110262.Search in Google Scholar

[30] R. Hernandez, et al.., “Directional silicon nano-antennas for quantum emitter control designed by evolutionary optimization,” J. Opt. Soc. Am. B, vol. 41, no. 2, pp. A108–A115, 2024. https://doi.org/10.1364/JOSAB.506085.Search in Google Scholar

[31] Z. Jin, et al.., “Complex inverse design of meta-optics by segmented hierarchical evolutionary algorithm,” ACS Nano, vol. 13, no. 1, pp. 821–829, 2019. https://doi.org/10.1021/acsnano.8b08333.Search in Google Scholar PubMed

[32] J. Kennedy and R. Eberhart, “Particle swarm optimization,” in Proceedings of ICNN’95-International Conference on Neural Networks, vol. 4, IEEE, 1995, pp. 1942–1948.10.1109/ICNN.1995.488968Search in Google Scholar

[33] M. Dorigo, “Optimization, learning and natural algorithms,” Ph. D. thesis, Politecnico di Milano, 1992.Search in Google Scholar

[34] C.-Y. Lee, Y. Liu, Y. Cheng, C. Lao, and Q.-F. Yang, “Inverse design of coherent supercontinuum generation using free-form nanophotonic waveguides,” APL Photonics, vol. 9, no. 6, 2024, https://doi.org/10.1063/5.0196434.Search in Google Scholar

[35] W. Chen, et al.., “Ultra-compact and low-loss silicon polarization beam splitter using a particle-swarm-optimized counter-tapered coupler,” Opt. Express, vol. 28, no. 21, pp. 30701–30709, 2020. https://doi.org/10.1364/OE.408432.Search in Google Scholar PubMed

[36] X. Guo, et al.., “Design of broadband omnidirectional antireflection coatings using ant colony algorithm,” Opt. Express, vol. 22, no. 104, pp. A1137–A1144, 2014. https://doi.org/10.1364/OE.22.0A1137.Search in Google Scholar PubMed

[37] T. W. Hughes, M. Minkov, I. A. Williamson, and S. Fan, “Adjoint method and inverse design for nonlinear nanophotonic devices,” ACS Photonics, vol. 5, no. 12, pp. 4781–4787, 2018. https://doi.org/10.1021/acsphotonics.8b01522.Search in Google Scholar

[38] M. H. Bakr, O. S. Ahmed, M. H. El Sherif, and T. Nomura, “Time domain adjoint sensitivity analysis of electromagnetic problems with nonlinear media,” Opt. Express, vol. 22, no. 9, pp. 10831–10843, 2014. https://doi.org/10.1364/OE.22.010831.Search in Google Scholar PubMed

[39] J. Gedeon, E. Hassan, and A. Calà Lesina, “Time-domain topology optimization of arbitrary dispersive materials for broadband 3d nanophotonics inverse design,” ACS Photonics, vol. 10, no. 11, pp. 3875–3887, 2023. https://doi.org/10.1021/acsphotonics.3c00572.Search in Google Scholar

[40] E. Khoram, et al.., “Nanophotonic media for artificial neural inference,” Photonics Res., vol. 7, no. 8, pp. 823–827, 2019. https://doi.org/10.1364/PRJ.7.000823.Search in Google Scholar

[41] C. M. Cisowski, M. C. Waller, and R. Bennett, “Toward nanophotonic optical isolation via inverse design of energy transfer in nonreciprocal media,” Phys. Rev. A, vol. 109, no. 4, p. 043533, 2024. https://doi.org/10.1103/PhysRevA.109.043533.Search in Google Scholar

[42] T. W. Hughes, M. Minkov, Y. Shi, and S. Fan, “Training of photonic neural networks through in situ backpropagation and gradient measurement,” Optica, vol. 5, no. 7, pp. 864–871, 2018. https://doi.org/10.1364/OPTICA.5.000864.Search in Google Scholar

[43] J. Carolan, et al.., “Universal linear optics,” Science, vol. 349, no. 6249, pp. 711–716, 2015. https://doi.org/10.1126/science.aab3642.Search in Google Scholar PubMed

[44] V. Nikkhah, A. Pirmoradi, F. Ashtiani, B. Edwards, F. Aflatouni, and N. Engheta, “Inverse-designed low-index-contrast structures on a silicon photonics platform for vector–matrix multiplication,” Nat. Photonics, pp. 1–8, 2024, https://doi.org/10.1038/s41566-024-01394-2.Search in Google Scholar

[45] T. Wu, M. Menarini, Z. Gao, and L. Feng, “Lithography-free reconfigurable integrated photonic processor,” Nat. Photonics, vol. 17, no. 8, pp. 710–716, 2023. https://doi.org/10.1038/s41566-023-01205-0.Search in Google Scholar

[46] W. Ma, Z. Liu, Z. A. Kudyshev, A. Boltasseva, W. Cai, and Y. Liu, “Deep learning for the design of photonic structures,” Nat. Photonics, vol. 15, no. 2, pp. 77–90, 2021. https://doi.org/10.1038/s41566-020-0685-y.Search in Google Scholar

[47] J. Jiang, M. Chen, and J. A. Fan, “Deep neural networks for the evaluation and design of photonic devices,” Nat. Rev. Mater., vol. 6, no. 8, pp. 679–700, 2021. https://doi.org/10.1038/s41578-020-00260-1.Search in Google Scholar

[48] M. Yuan, G. Yang, S. Song, L. Zhou, R. Minasian, and X. Yi, “Inverse design of a nano-photonic wavelength demultiplexer with a deep neural network approach,” Opt. Express, vol. 30, no. 15, pp. 26201–26211, 2022. https://doi.org/10.1364/OE.462038.Search in Google Scholar PubMed

[49] E. Adibnia, M. A. Mansouri-Birjandi, M. Ghadrdan, and P. Jafari, “A deep learning method for empirical spectral prediction and inverse design of all-optical nonlinear plasmonic ring resonator switches,” Sci. Rep., vol. 14, no. 1, p. 5787, 2024. https://doi.org/10.1038/s41598-024-56522-3.Search in Google Scholar PubMed PubMed Central

[50] T. Jahan, et al.., “Deep learning-driven forward and inverse design of nanophotonic nanohole arrays: streamlining design for tailored optical functionalities and enhancing accessibility,” Nanoscale, vol. 16, no. 35, pp. 16641–16651, 2024. https://doi.org/10.1039/D4NR03081H.Search in Google Scholar

[51] D. P. Kingma, “Auto-encoding variational bayes,” arXiv preprint arXiv:1312.6114, 2013, https://doi.org/10.48550/arXiv.1312.6114.Search in Google Scholar

[52] I. Goodfellow, et al.., “Generative adversarial nets,” Adv. Neural Inf. Process. Syst., vol. 27, 2014.Search in Google Scholar

[53] J. Sohl-Dickstein, E. Weiss, N. Maheswaranathan, and S. Ganguli, “Deep unsupervised learning using nonequilibrium thermodynamics,” in International Conference on Machine Learning, PMLR, 2015, pp. 2256–2265.Search in Google Scholar

[54] S. So and J. Rho, “Designing nanophotonic structures using conditional deep convolutional generative adversarial networks,” Nanophotonics, vol. 8, no. 7, pp. 1255–1261, 2019. https://doi.org/10.1515/nanoph-2019-0117.Search in Google Scholar

[55] Y. Tang, et al.., “Generative deep learning model for inverse design of integrated nanophotonic devices,” Laser Photonics Rev., vol. 14, no. 12, p. 2000287, 2020. https://doi.org/10.1002/lpor.202000287.Search in Google Scholar

[56] Z. Zhang, C. Yang, Y. Qin, H. Feng, J. Feng, and H. Li, “Diffusion probabilistic model based accurate and high-degree-of-freedom metasurface inverse design,” Nanophotonics, vol. 12, no. 20, pp. 3871–3881, 2023. https://doi.org/10.1515/nanoph-2023-0292.Search in Google Scholar PubMed PubMed Central

[57] J. Kim, et al.., “Semi-supervised learning leveraging denoising diffusion probabilistic models for the characterization of nanophotonic devices,” Laser Photonics Rev., p. 2300998, 2024, https://doi.org/10.1002/lpor.202300998.Search in Google Scholar

[58] R. S. Sutton, “Learning to predict by the methods of temporal differences,” Mach. Learn., vol. 3, pp. 9–44, 1988, https://doi.org/10.1007/bf00115009.Search in Google Scholar

[59] I. D. Lutz, et al.., “Top-down design of protein architectures with reinforcement learning,” Science, vol. 380, no. 6642, pp. 266–273, 2023. https://doi.org/10.2210/pdb8f4x/pdb.Search in Google Scholar

[60] C.-Y. Yang, C. Shiranthika, C.-Y. Wang, K.-W. Chen, and S. Sumathipala, “Reinforcement learning strategies in cancer chemotherapy treatments: a review,” Comput. Methods Programs Biomed., vol. 229, p. 107280, 2023, https://doi.org/10.1016/j.cmpb.2022.107280.Search in Google Scholar PubMed

[61] C. Li, P. Zheng, Y. Yin, B. Wang, and L. Wang, “Deep reinforcement learning in smart manufacturing: a review and prospects,” CIRP J. Manuf. Sci. Technol., vol. 40, pp. 75–101, 2023, https://doi.org/10.1016/j.cirpj.2022.11.003.Search in Google Scholar

[62] H. Hong, W. Kim, W. Kim, J.-M. Jeong, S. Kim, and S. S. Kim, “Machine learning-driven design optimization of buckling-induced quasi-zero stiffness metastructures for low-frequency vibration isolation,” ACS Appl. Mater. Interfaces, vol. 16, no. 14, pp. 17965–17972, 2024. https://doi.org/10.1021/acsami.3c18793.Search in Google Scholar PubMed PubMed Central

[63] C. Park, et al.., “Sample-efficient inverse design of freeform nanophotonic devices with physics-informed reinforcement learning,” Nanophotonics, vol. 13, no. 8, pp. 1483–1492, 2024. https://doi.org/10.1515/nanoph-2023-0852.Search in Google Scholar PubMed PubMed Central

[64] I. Sajedian, H. Lee, and J. Rho, “Double-deep Q-learning to increase the efficiency of metasurface holograms,” Sci. Rep., vol. 9, no. 1, p. 10899, 2019. https://doi.org/10.1038/s41598-019-47154-z.Search in Google Scholar PubMed PubMed Central

[65] D. Witt, J. Young, and L. Chrostowski, “Reinforcement learning for photonic component design,” APL Photonics, vol. 8, no. 10, 2023, https://doi.org/10.1063/5.0159928.Search in Google Scholar

[66] C. Yeung, B. Pham, Z. Zhang, K. T. Fountaine, and A. P. Raman, “Hybrid supervised and reinforcement learning for the design and optimization of nanophotonic structures,” Opt. Express, vol. 32, no. 6, pp. 9920–9930, 2024. https://doi.org/10.1364/OE.512159.Search in Google Scholar PubMed

[67] Z. Lu, et al.., “Broadband silicon photonic directional coupler using asymmetric-waveguide based phase control,” Opt. Express, vol. 23, no. 3, pp. 3795–3808, 2015. https://doi.org/10.1364/OE.23.003795.Search in Google Scholar PubMed

[68] Q. Yi, et al.., “Silicon MMI-based power splitter for multi-band operation at the 1.55 and 2 µm wave bands,” Opt. Lett., vol. 48, no. 5, pp. 1335–1338, 2023. https://doi.org/10.1364/OL.486428.Search in Google Scholar PubMed

[69] J. Kim, J.-Y. Kim, J. Yoon, H. Yoon, H.-H. Park, and H. Kurt, “Experimental demonstration of inverse-designed silicon integrated photonic power splitters,” Nanophotonics, vol. 11, no. 20, pp. 4581–4590, 2022. https://doi.org/10.1515/nanoph-2022-0443.Search in Google Scholar PubMed PubMed Central

[70] A. Y. Piggott, J. Petykiewicz, L. Su, and J. Vučković, “Fabrication-constrained nanophotonic inverse design,” Sci. Rep., vol. 7, no. 1, p. 1786, 2017. https://doi.org/10.1038/s41598-017-01939-2.Search in Google Scholar PubMed PubMed Central

[71] S. E. Hansen, G. Arregui, A. N. Babar, R. E. Christiansen, and S. Stobbe, “Inverse design and characterization of compact, broadband, and low-loss chip-scale photonic power splitters,” Mater. Quantum Technol., vol. 4, no. 1, p. 016201, 2024. https://doi.org/10.1088/2633-4356/ad2521.Search in Google Scholar

[72] H. Xie, et al.., “Inversely designed 1 × 4 power splitter with arbitrary ratios at 2-μm spectral band,” IEEE Photonics J., vol. 10, no. 4, pp. 1–6, 2018. https://doi.org/10.1109/jphot.2018.2863122.Search in Google Scholar

[73] J. Xu, Y. Liu, X. Guo, Q. Song, and K. Xu, “Inverse design of a dual-mode 3-dB optical power splitter with a 445 nm bandwidth,” Opt. Express, vol. 30, no. 15, pp. 26266–26274, 2022. https://doi.org/10.1364/oe.463274.Search in Google Scholar PubMed

[74] H. Ma, J. Huang, K. Zhang, and J. Yang, “Inverse-designed arbitrary-input and ultra-compact 1 × N power splitters based on high symmetric structure,” Sci. Rep., vol. 10, no. 1, p. 11757, 2020. https://doi.org/10.1038/s41598-020-68746-0.Search in Google Scholar PubMed PubMed Central

[75] J. Wen, et al.., “Inverse design of high efficiency and large bandwidth power splitter for arbitrary power ratio based on deep residual network,” Opt. Quantum Electron., vol. 56, no. 4, p. 512, 2024. https://doi.org/10.1007/s11082-023-06165-x.Search in Google Scholar

[76] S. Hong, et al.., “Inverse-designed taper configuration for the enhancement of integrated 1 × 4 silicon photonic power splitters,” Nanophotonics, vol. 13, no. 22, pp. 4127–4135, 2024. https://doi.org/10.1515/nanoph-2024-0295.Search in Google Scholar PubMed PubMed Central

[77] R. Yao, et al.., “Compact and low-insertion-loss 1 × N power splitter in silicon photonics,” J. Lightwave Technol., vol. 39, no. 19, pp. 6253–6259, 2021. https://doi.org/10.1109/jlt.2021.3098346.Search in Google Scholar

[78] Q. Xu, B. Schmidt, J. Shakya, and M. Lipson, “Cascaded silicon micro-ring modulators for WDM optical interconnection,” Opt. Express, vol. 14, no. 20, pp. 9431–9436, 2006. https://doi.org/10.1364/oe.14.009431.Search in Google Scholar PubMed

[79] X. Zheng, et al.., “A tunable 1 × 4 silicon CMOS photonic wavelength multiplexer/demultiplexer for dense optical interconnects,” Opt. Express, vol. 18, no. 5, pp. 5151–5160, 2010. https://doi.org/10.1364/oe.18.005151.Search in Google Scholar

[80] B. Naghdi and L. R. Chen, “Silicon photonic four-channel optical add-drop multiplexer enabled by subwavelength grating waveguides,” IEEE Photonics J., vol. 10, no. 4, pp. 1–10, 2018. https://doi.org/10.1109/jphot.2018.2857769.Search in Google Scholar

[81] D. Mu, et al.., “A four-channel DWDM tunable add/drop demultiplexer based on silicon waveguide Bragg gratings,” IEEE Photonics J., vol. 11, no. 1, pp. 1–8, 2019. https://doi.org/10.1109/jphot.2019.2897359.Search in Google Scholar

[82] T.-H. Yen and Y.-J. Hung, “Fabrication-tolerant CWDM (de) multiplexer based on cascaded Mach–Zehnder interferometers on silicon-on-insulator,” J. Lightwave Technol., vol. 39, no. 1, pp. 146–153, 2020, https://doi.org/10.1109/JLT.2020.3026314.Search in Google Scholar

[83] Q. Yi, et al.., “Silicon photonic flat-top WDM (de) multiplexer based on cascaded Mach-Zehnder interferometers for the 2 µm wavelength band,” Opt. Express, vol. 30, no. 15, pp. 28232–28241, 2022. https://doi.org/10.1364/oe.467473.Search in Google Scholar PubMed

[84] A. M. Taha, et al.., “Compact MMI-based AWGs in a scalable monolithic silicon photonics platform,” IEEE Photonics J., vol. 13, no. 4, pp. 1–6, 2021. https://doi.org/10.1109/jphot.2021.3099436.Search in Google Scholar

[85] X. Shen, C. Li, W. Zhao, H. Li, Y. Shi, and D. Dai, “Ultra-low-crosstalk silicon arrayed-waveguide grating (de) multiplexer with 1.6-nm channel spacing,” Laser Photonics Rev., vol. 18, no. 1, p. 2300617, 2024. https://doi.org/10.1002/lpor.202300617.Search in Google Scholar

[86] L. Su, A. Y. Piggott, N. V. Sapra, J. Petykiewicz, and J. Vuckovic, “Inverse design and demonstration of a compact on-chip narrowband three-channel wavelength demultiplexer,” ACS Photonics, vol. 5, no. 2, pp. 301–305, 2018. https://doi.org/10.1021/acsphotonics.7b00987.Search in Google Scholar

[87] J. Huang, et al.., “Implementation of on-chip multi-channel focusing wavelength demultiplexer with regularized digital metamaterials,” Nanophotonics, vol. 9, no. 1, pp. 159–166, 2020. https://doi.org/10.1515/nanoph-2019-0368.Search in Google Scholar

[88] R. Wu, F. Ding, F. Li, and Y. Liu, “Inverse-designed low-crosstalk CWDM (de) multiplexer assisted by photonic crystals,” J. Lightwave Technol., vol. 42, no. 14, pp. 4899–4905, 2024. https://doi.org/10.1109/jlt.2024.3385741.Search in Google Scholar

[89] A. Y. Piggott, J. Lu, K. G. Lagoudakis, J. Petykiewicz, T. M. Babinec, and J. Vučković, “Inverse design and demonstration of a compact and broadband on-chip wavelength demultiplexer,” Nat. Photonics, vol. 9, no. 6, pp. 374–377, 2015. https://doi.org/10.1038/nphoton.2015.69.Search in Google Scholar

[90] X. Deng, et al.., “Inverse design of a wavelength (de) multiplexer for 1.55-and 2-μm wavebands by using a hybrid analog-digital method,” J. Lightwave Technol., vol. 42, no. 15, pp. 5231–5240, 2024. https://doi.org/10.1109/jlt.2024.3386668.Search in Google Scholar

[91] X. Chen, C. Li, and H. K. Tsang, “Fabrication-tolerant waveguide chirped grating coupler for coupling to a perfectly vertical optical fiber,” IEEE Photonics Technol. Lett., vol. 20, no. 23, pp. 1914–1916, 2008. https://doi.org/10.1109/lpt.2008.2004715.Search in Google Scholar

[92] L. Cheng, S. Mao, X. Tu, and H. Fu, “Dual-wavelength-band grating coupler on 220-nm silicon-on-insulator with high numerical aperture fiber placed perfectly vertically,” J. Lightwave Technol., vol. 39, no. 18, pp. 5902–5909, 2021. https://doi.org/10.1109/jlt.2021.3090172.Search in Google Scholar

[93] N. Hoppe, et al.., “Ultra-efficient silicon-on-insulator grating couplers with backside metal mirrors,” IEEE J. Sel. Top. Quantum Electron., vol. 26, no. 2, pp. 1–6, 2019. https://doi.org/10.1109/jstqe.2019.2935296.Search in Google Scholar

[94] M. Dai, L. Ma, Y. Xu, M. Lu, X. Liu, and Y. Chen, “Highly efficient and perfectly vertical chip-to-fiber dual-layer grating coupler,” Opt. Express, vol. 23, no. 2, pp. 1691–1698, 2015. https://doi.org/10.1364/oe.23.001691.Search in Google Scholar

[95] Z. Zhao and S. Fan, “Design principles of apodized grating couplers,” J. Lightwave Technol., vol. 38, no. 16, pp. 4435–4446, 2020. https://doi.org/10.1109/jlt.2020.2992574.Search in Google Scholar

[96] D. Taillaert, P. Bienstman, and R. Baets, “Compact efficient broadband grating coupler for silicon-on-insulator waveguides,” Opt. Lett., vol. 29, no. 23, pp. 2749–2751, 2004. https://doi.org/10.1364/OL.29.002749.Search in Google Scholar

[97] A. Bozzola, L. Carroll, D. Gerace, I. Cristiani, and L. C. Andreani, “Optimising apodized grating couplers in a pure SOI platform to −0.5 dB coupling efficiency,” Opt. Express, vol. 23, no. 12, pp. 16289–16304, 2015. https://doi.org/10.1364/OE.23.016289.Search in Google Scholar PubMed

[98] M. Yang, et al.., “High-performance grating couplers on 220-nm thick silicon by inverse design for perfectly vertical coupling,” Sci. Rep., vol. 13, no. 1, p. 18112, 2023. https://doi.org/10.1038/s41598-023-45168-2.Search in Google Scholar PubMed PubMed Central

[99] J. Yoon, et al.., “Inverse design of a Si-based high-performance vertical-emitting meta-grating coupler on a 220 nm silicon-on-insulator platform,” Photonics Res., vol. 11, no. 6, pp. 897–905, 2023. https://doi.org/10.1364/prj.473978.Search in Google Scholar

[100] A. M. Hammond, J. B. Slaby, M. J. Probst, and S. E. Ralph, “Multi-layer inverse design of vertical grating couplers for high-density, commercial foundry interconnects,” Opt. Express, vol. 30, no. 17, pp. 31058–31072, 2022. https://doi.org/10.1364/OE.466015.Search in Google Scholar PubMed

[101] N. V. Sapra, et al.., “Inverse design and demonstration of broadband grating couplers,” IEEE J. Sel. Top. Quantum Electron., vol. 25, no. 3, pp. 1–7, 2019. https://doi.org/10.1109/jstqe.2019.2891402.Search in Google Scholar

[102] A. Michaels and E. Yablonovitch, “Inverse design of near unity efficiency perfectly vertical grating couplers,” Opt. Express, vol. 26, no. 4, pp. 4766–4779, 2018. https://doi.org/10.1364/OE.26.004766.Search in Google Scholar PubMed

[103] X. Tu, et al.., “Analysis of deep neural network models for inverse design of silicon photonic grating coupler,” J. Lightwave Technol., vol. 39, no. 9, pp. 2790–2799, 2021. https://doi.org/10.1109/jlt.2021.3057473.Search in Google Scholar

[104] S. Irfan, J.-Y. Kim, and H. Kurt, “Ultra-compact and efficient photonic waveguide bends with different configurations designed by topology optimization,” Sci. Rep., vol. 14, no. 1, p. 6453, 2024. https://doi.org/10.1038/s41598-024-53881-9.Search in Google Scholar PubMed PubMed Central

[105] C. Shang, et al.., “Inverse-designed lithium niobate nanophotonics,” ACS Photonics, vol. 10, no. 4, pp. 1019–1026, 2023. https://doi.org/10.1021/acsphotonics.3c00040.Search in Google Scholar

[106] J. Yang, M. A. Guidry, D. M. Lukin, K. Yang, and J. Vučković, “Inverse-designed silicon carbide quantum and nonlinear photonics,” Light Sci. Appl., vol. 12, no. 1, p. 201, 2023. https://doi.org/10.1038/s41377-023-01253-9.Search in Google Scholar PubMed PubMed Central

[107] Y. A. Vlasov and S. J. McNab, “Losses in single-mode silicon-on-insulator strip waveguides and bends,” Opt. Express, vol. 12, no. 8, pp. 1622–1631, 2004. https://doi.org/10.1364/OPEX.12.001622.Search in Google Scholar

[108] M. Bahadori, M. Nikdast, Q. Cheng, and K. Bergman, “Universal design of waveguide bends in silicon-on-insulator photonics platform,” J. Lightwave Technol., vol. 37, no. 13, pp. 3044–3054, 2019. https://doi.org/10.1109/jlt.2019.2909983.Search in Google Scholar

[109] E. Zhang, S. Yang, and L. Zhang, “General waveguide bend design based on cubic spline interpolation and inverse design,” J. Lightwave Technol., vol. 42, no. 13, pp. 4614–4625, 2024. https://doi.org/10.1109/jlt.2024.3370675.Search in Google Scholar

[110] H. Chung, J. Park, and S. V. Boriskina, “Inverse-designed waveguide-based biosensor for high-sensitivity, single-frequency detection of biomolecules,” Nanophotonics, vol. 11, no. 7, pp. 1427–1442, 2022. https://doi.org/10.1515/nanoph-2022-0012.Search in Google Scholar PubMed PubMed Central

[111] H. Sun, et al.., “Broadband and broad-angle polarization-independent metasurface for radar cross section reduction,” Sci. Rep., vol. 7, no. 1, p. 40782, 2017. https://doi.org/10.1038/srep40782.Search in Google Scholar PubMed PubMed Central

[112] Y. Fan, et al.., “Phase-controlled metasurface design via optimized genetic algorithm,” Nanophotonics, vol. 9, no. 12, pp. 3931–3939, 2020. https://doi.org/10.1515/nanoph-2020-0132.Search in Google Scholar

[113] M.-J. Haji-Ahmadi, V. Nayyeri, M. Soleimani, and O. M. Ramahi, “Pixelated checkerboard metasurface for ultra-wideband radar cross section reduction,” Sci. Rep., vol. 7, no. 1, p. 11437, 2017. https://doi.org/10.1038/s41598-017-11714-y.Search in Google Scholar PubMed PubMed Central

[114] S. So, et al.., “Multicolor and 3D holography generated by inverse-designed single-cell metasurfaces,” Adv. Mater., vol. 35, no. 17, p. 2208520, 2023. https://doi.org/10.1002/adma.202208520.Search in Google Scholar PubMed

[115] G. Jing, et al.., “Neural network-based surrogate model for inverse design of metasurfaces,” Photonics Res., vol. 10, no. 6, pp. 1462–1471, 2022. https://doi.org/10.1364/prj.450564.Search in Google Scholar

[116] L. Gao, X. Li, D. Liu, L. Wang, and Z. Yu, “A bidirectional deep neural network for accurate silicon color design,” Adv. Mater., vol. 31, no. 51, p. 1905467, 2019. https://doi.org/10.1002/adma.201905467.Search in Google Scholar PubMed

[117] O. D. Miller, Photonic Design: From Fundamental Solar Cell Physics to Computational Inverse Design, Berkeley, University of California, 2012.Search in Google Scholar

[118] H. Chung and O. D. Miller, “High-NA achromatic metalenses by inverse design,” Opt. Express, vol. 28, no. 5, pp. 6945–6965, 2020. https://doi.org/10.1364/oe.385440.Search in Google Scholar

[119] H. Chung and O. D. Miller, “Tunable metasurface inverse design for 80% switching efficiencies and 144 angular deflection,” ACS Photonics, vol. 7, no. 8, pp. 2236–2243, 2020. https://doi.org/10.1021/acsphotonics.0c00787.Search in Google Scholar

[120] M. Mansouree, H. Kwon, E. Arbabi, A. McClung, A. Faraon, and A. Arbabi, “Multifunctional 2.5 D metastructures enabled by adjoint optimization,” Optica, vol. 7, no. 1, pp. 77–84, 2020. https://doi.org/10.1364/optica.374787.Search in Google Scholar

[121] P. R. Wiecha and O. L. Muskens, “Deep learning meets nanophotonics: a generalized accurate predictor for near fields and far fields of arbitrary 3D nanostructures,” Nano Lett., vol. 20, no. 1, pp. 329–338, 2019. https://doi.org/10.1021/acs.nanolett.9b03971.Search in Google Scholar PubMed

[122] I. Tanriover, D. Lee, W. Chen, and K. Aydin, “Deep generative modeling and inverse design of manufacturable free-form dielectric metasurfaces,” ACS Photonics, vol. 10, no. 4, pp. 875–883, 2022. https://doi.org/10.1021/acsphotonics.2c01006.Search in Google Scholar

[123] M. Zhou, et al.., “Inverse design of metasurfaces based on coupled-mode theory and adjoint optimization,” ACS Photonics, vol. 8, no. 8, pp. 2265–2273, 2021. https://doi.org/10.1021/acsphotonics.1c00100.Search in Google Scholar

[124] Z. Wu, X. Huang, N. Yu, and Z. Yu, “Inverse design of a dielectric metasurface by the spatial coupled mode theory,” ACS Photonics, vol. 11, no. 8, pp. 3019–3025, 2024. https://doi.org/10.1021/acsphotonics.4c00171.Search in Google Scholar

[125] Z. Liu, D. Zhu, S. P. Rodrigues, K.-T. Lee, and W. Cai, “Generative model for the inverse design of metasurfaces,” Nano Lett., vol. 18, no. 10, pp. 6570–6576, 2018. https://doi.org/10.1021/acs.nanolett.8b03171.Search in Google Scholar PubMed

[126] R. Lin, Y. Zhai, C. Xiong, and X. Li, “Inverse design of plasmonic metasurfaces by convolutional neural network,” Opt. Lett., vol. 45, no. 6, pp. 1362–1365, 2020. https://doi.org/10.1364/ol.387404.Search in Google Scholar

[127] J. Jiang and J. A. Fan, “Global optimization of dielectric metasurfaces using a physics-driven neural network,” Nano Lett., vol. 19, no. 8, pp. 5366–5372, 2019. https://doi.org/10.1021/acs.nanolett.9b01857.Search in Google Scholar PubMed

[128] T. Chang, et al.., “Universal metasurfaces for complete linear control of coherent light transmission,” Adv. Mater., vol. 34, no. 44, p. 2204085, 2022. https://doi.org/10.1002/adma.202204085.Search in Google Scholar PubMed

[129] G. Zheng, H. Mühlenbernd, M. Kenney, G. Li, T. Zentgraf, and S. Zhang, “Metasurface holograms reaching 80% efficiency,” Nat. Nanotechnol., vol. 10, no. 4, pp. 308–312, 2015. https://doi.org/10.1038/nnano.2015.2.Search in Google Scholar PubMed

[130] G.-Y. Lee, et al.., “Metasurface eyepiece for augmented reality,” Nat. Commun., vol. 9, no. 1, pp. 1–10, 2018. https://doi.org/10.1038/s41467-018-07011-5.Search in Google Scholar PubMed PubMed Central

[131] A. M. Hammond, A. Oskooi, M. Chen, Z. Lin, S. G. Johnson, and S. E. Ralph, “High-performance hybrid time/frequency-domain topology optimization for large-scale photonics inverse design,” Opt. Express, vol. 30, no. 3, pp. 4467–4491, 2022. https://doi.org/10.1364/oe.442074.Search in Google Scholar

[132] M. Minkov, P. Sun, B. Lee, Z. Yu, and S. Fan, “GPU-accelerated photonic simulations,” Opt. Photonics News, vol. 35, no. 9, pp. 44–50, 2024. https://doi.org/10.1364/opn.35.9.000044.Search in Google Scholar

[133] A. F. Oskooi, D. Roundy, M. Ibanescu, P. Bermel, J. D. Joannopoulos, and S. G. Johnson, “MEEP: a flexible free-software package for electromagnetic simulations by the FDTD method,” Comput. Phys. Commun., vol. 181, no. 3, pp. 687–702, 2010. https://doi.org/10.1016/j.cpc.2009.11.008.Search in Google Scholar

[134] Flexcompute Inc., “Tidy3D Simulation Platform,” https://flexcompute.com/tidy3d [accessed: Jan. 13, 2025].Search in Google Scholar

[135] HIPS, “Autograd: Efficiently computes derivatives of NumPy code,” https://github.com/HIPS/autograd [accessed: Jan. 13, 2025].Search in Google Scholar

[136] Google, “JAX: Composable transformations of Python+NumPy programs,” https://github.com/google/jax [accessed: Jan. 13, 2025].Search in Google Scholar

[137] PyGAD, “PyGAD - Python Genetic Algorithm!,” https://pygad.readthedocs.io [accessed: Jan. 13, 2025].Search in Google Scholar

[138] PySwarms, “Welcome to PySwarms’s documentation!,” https://pyswarms.readthedocs.io [accessed: Jan. 13, 2025].Search in Google Scholar

[139] Ansys Inc., “Lumerical,” https://www.lumerical.com [accessed: Jan. 13, 2025].Search in Google Scholar

[140] Ansys Inc., “Getting Started with lumopt - Python API,” https://optics.ansys.com/hc/en-us/articles/360050995394-Getting-Started-with-lumopt-Python-API [accessed: Jan. 13, 2025].Search in Google Scholar

[141] SPLayout, “Getting Started — SPLayout 0.5.14 documentation,” https://splayout.readthedocs.io [accessed: Jan. 13, 2025].Search in Google Scholar

[142] COMSOL, “COMSOL - Software for Multiphysics Simulation,” https://www.comsol.com [accessed: Jan. 13, 2025].Search in Google Scholar

[143] P. Hansen and L. Hesselink, “Accurate adjoint design sensitivities for nano metal optics,” Opt. Express, vol. 23, no. 18, pp. 23899–23923, 2015. https://doi.org/10.1364/oe.23.023899.Search in Google Scholar PubMed

[144] E. Briones, et al.., “Particle swarm optimization of nanoantenna-based infrared detectors,” Opt. Express, vol. 26, no. 22, pp. 28484–28496, 2018. https://doi.org/10.1364/oe.26.028484.Search in Google Scholar

[145] R. E. Christiansen and O. Sigmund, “Compact 200 line MATLAB code for inverse design in photonics by topology optimization: tutorial,” J. Opt. Soc. Am. B, vol. 38, no. 2, pp. 510–520, 2021. https://doi.org/10.1364/josab.405955.Search in Google Scholar

[146] J. Jiang, D. Sell, S. Hoyer, J. Hickey, J. Yang, and J. A. Fan, “Free-form diffractive metagrating design based on generative adversarial networks,” ACS Nano, vol. 13, no. 8, pp. 8872–8878, 2019. https://doi.org/10.1021/acsnano.9b02371.Search in Google Scholar PubMed

[147] J. P. Hugonin and P. Lalanne, “Reticolo software for grating analysis,” arXiv preprint arXiv:2101.00901, 2021.Search in Google Scholar

[148] J. Kim, S. Park, S. Yu, and N. Park, “Machine-engineered active disorder for digital photonics,” Adv. Opt. Mater., vol. 10, no. 7, p. 2102642, 2022. https://doi.org/10.1002/adom.202102642.Search in Google Scholar

[149] S. Oh, et al.., “Control of localization and optical properties with deep-subwavelength engineered disorder,” Opt. Express, vol. 30, no. 16, pp. 28301–28311, 2022. https://doi.org/10.1364/oe.461766.Search in Google Scholar PubMed

[150] Y. Huang, Z. Zhen, Y. Shen, C. Min, and G. Veronis, “Optimization of photonic nanojets generated by multilayer microcylinders with a genetic algorithm,” Opt. Express, vol. 27, no. 2, pp. 1310–1325, 2019. https://doi.org/10.1364/oe.27.001310.Search in Google Scholar PubMed

[151] Y. Zhu, Y. Chen, S. Gorsky, T. Shubitidze, and L. Dal Negro, “Inverse design of functional photonic patches by adjoint optimization coupled to the generalized Mie theory,” J. Opt. Soc. Am. B, vol. 40, no. 7, pp. 1857–1874, 2023. https://doi.org/10.1364/josab.491882.Search in Google Scholar

[152] G. B. Hoffman, et al.., “Improved broadband performance of an adjoint shape optimized waveguide crossing using a Levenberg-Marquardt update,” Opt. Express, vol. 27, no. 17, pp. 24765–24780, 2019. https://doi.org/10.1364/oe.27.024765.Search in Google Scholar PubMed

[153] J. Liao, D. Huang, Y. Lu, Y. Li, and Y. Tian, “Low-loss and compact arbitrary-order silicon mode converter based on hybrid shape optimization,” Nanophotonics, vol. 13, no. 22, pp. 4137–4148, 2024. https://doi.org/10.1515/nanoph-2024-0301.Search in Google Scholar PubMed PubMed Central

[154] T. Van Vaerenbergh, et al.., “Wafer-level testing of inverse-designed and adjoint-inspired vertical grating coupler designs compatible with DUV lithography,” Opt. Express, vol. 29, no. 23, pp. 37021–37036, 2021. https://doi.org/10.1364/oe.433744.Search in Google Scholar PubMed

[155] T. Van Vaerenbergh, et al.., “Wafer-level testing of inverse-designed and adjoint-inspired dual layer Si-SiN vertical grating couplers,” J. Phys. Photonics, vol. 4, no. 4, p. 044001, 2022. https://doi.org/10.1088/2515-7647/ac943c.Search in Google Scholar

[156] C. Sideris, A. Khachaturian, A. D. White, O. P. Bruno, and A. Hajimiri, “Foundry-fabricated grating coupler demultiplexer inverse-designed via fast integral methods,” Commun. Phys., vol. 5, no. 1, p. 68, 2022. https://doi.org/10.1038/s42005-022-00839-w.Search in Google Scholar

[157] A. Y. Piggott, et al.., “Inverse-designed photonics for semiconductor foundries,” ACS Photonics, vol. 7, no. 3, pp. 569–575, 2020. https://doi.org/10.1021/acsphotonics.9b01540.Search in Google Scholar

[158] D. Vercruysse, N. V. Sapra, L. Su, R. Trivedi, and J. Vučković, “Analytical level set fabrication constraints for inverse design,” Sci. Rep., vol. 9, no. 1, p. 8999, 2019. https://doi.org/10.1038/s41598-019-45026-0.Search in Google Scholar PubMed PubMed Central

[159] A. M. Hammond, A. Oskooi, S. G. Johnson, and S. E. Ralph, “Photonic topology optimization with semiconductor-foundry design-rule constraints,” Opt. Express, vol. 29, no. 15, pp. 23916–23938, 2021. https://doi.org/10.1364/oe.431188.Search in Google Scholar PubMed

[160] M. F. Schubert, A. K. Cheung, I. A. Williamson, A. Spyra, and D. H. Alexander, “Inverse design of photonic devices with strict foundry fabrication constraints,” ACS Photonics, vol. 9, no. 7, pp. 2327–2336, 2022. https://doi.org/10.1021/acsphotonics.2c00313.Search in Google Scholar

[161] H. Carfagno, et al.., “Inverse designed couplers for use in gallium arsenide photonics,” ACS Photonics, vol. 10, no. 5, pp. 1286–1292, 2023. https://doi.org/10.1021/acsphotonics.2c01864.Search in Google Scholar

[162] C. Dory, et al.., “Inverse-designed diamond photonics,” Nat. Commun., vol. 10, no. 1, p. 3309, 2019. https://doi.org/10.1038/s41467-019-11343-1.Search in Google Scholar PubMed PubMed Central

[163] K. Kwon, et al.., “Photon-pair generation using inverse-designed thin-film lithium niobate mode converters,” APL Photonics, vol. 9, no. 5, 2024, https://doi.org/10.1063/5.0192026.Search in Google Scholar

[164] M. Helbig, K. Deb, and A. Engelbrecht, “Key challenges and future directions of dynamic multi-objective optimisation,” in 2016 IEEE Congress on Evolutionary Computation (CEC), IEEE, 2016, pp. 1256–1261.10.1109/CEC.2016.7743931Search in Google Scholar

[165] M. R. Sharifi, S. Akbarifard, K. Qaderi, and M. R. Madadi, “A new optimization algorithm to solve multi-objective problems,” Sci. Rep., vol. 11, no. 1, p. 20326, 2021. https://doi.org/10.1038/s41598-021-99617-x.Search in Google Scholar PubMed PubMed Central

[166] S. Sharma and V. Kumar, “A comprehensive review on multi-objective optimization techniques: past, present and future,” Arch. Comput. Methods Eng., vol. 29, no. 7, pp. 5605–5633, 2022. https://doi.org/10.1007/s11831-022-09778-9.Search in Google Scholar

[167] R. Peng, S. Ren, J. Malof, and W. J. Padilla, “Transfer learning for metamaterial design and simulation,” Nanophotonics, vol. 13, no. 13, pp. 2323–2334, 2024. https://doi.org/10.1515/nanoph-2023-0691.Search in Google Scholar PubMed PubMed Central

[168] B. MacLellan, et al.., “Inverse design of photonic systems,” Laser Photonics Rev., vol. 18, no. 5, p. 2300500, 2024. https://doi.org/10.1002/lpor.202300500.Search in Google Scholar

[169] N. Mohammadi Estakhri, B. Edwards, and N. Engheta, “Inverse-designed metastructures that solve equations,” Science, vol. 363, no. 6433, pp. 1333–1338, 2019. https://doi.org/10.1126/science.aaw2498.Search in Google Scholar PubMed

[170] X. Zheng, X. Zhang, T. T. Chen, and I. Watanabe, “Deep learning in mechanical metamaterials: from prediction and generation to inverse design,” Adv. Mater., vol. 35, no. 45, p. 2302530, 2023. https://doi.org/10.1002/adma.202302530.Search in Google Scholar PubMed

[171] A. Zunger, “Inverse design in search of materials with target functionalities,” Nat. Rev. Chem., vol. 2, no. 4, p. 0121, 2018. https://doi.org/10.1038/s41570-018-0121.Search in Google Scholar

Received: 2024-10-11

Accepted: 2024-12-18

Published Online: 2025-01-27

This work is licensed under the Creative Commons Attribution 4.0 International License.

Articles in the same Issue

https://doi.org/10.1515/nanoph-2024-0536

Keywords for this article

nanophotonics; silicon photonics; inverse design; optimization; artificial intelligence; deep learning

Creative Commons

BY 4.0