Home Technology A learning based approach for designing extended unit cell metagratings
Article Open Access

A learning based approach for designing extended unit cell metagratings

  • Soumyashree S. Panda ORCID logo and Ravi S. Hegde ORCID logo EMAIL logo
Published/Copyright: December 8, 2021
Become an author with De Gruyter Brill

Abstract

The possibility of arbitrary spatial control of incident wavefronts with the subwavelength resolution has driven research into dielectric optical metasurfaces in the last decade. The unit-cell based metasurface design approach that relies on a library of single element responses is known to result in reduced efficiency attributed to the inadequate accounting of the coupling effects between meta-atoms. Metasurfaces with extended unit-cells containing multiple resonators can improve design outcomes but their design requires extensive numerical computing and optimizations. We report a deep learning based design methodology for the inverse design of extended unit-cell metagratings. In contrast to previous reports, our approach learns the metagrating spectral response across its reflected and transmitted orders. Through systematic exploration, we discover network architectures and training dataset sampling strategies that allow such learning without requiring extensive ground-truth generation. The one-time investment of model creation can then be used to significantly accelerate numerical optimization of multiple functionalities as demonstrated by considering the inverse design of various spectral and polarization dependent splitters and filters. The proposed methodology is not limited to these proof-of-concept demonstrations and can be broadly applied to meta-atom-based nanophotonic system design and in realising the next generation of metasurface functionalities with improved performance.

1 Introduction

State-of-the-art nanofabrication technologies with extraordinary lateral resolution and high stitching accuracy now enable the realization of wide area precision nanostructures. In combination with advanced deposition technologies which provide great control over film thickness and uniformity, close and precise stacking of nanostructured films is also possible. A wide range of material choices, e.g., plasmonic materials (Au, Ag etc.), high index dielectrics (Si, Ge, GaAs, SiN, TiO2 etc.), graphene and other 2D materials, phase change materials and transparent conducting oxides complement the large number of spatial degrees of freedom available in nanopatterning. The optical metasurface, a spatially heterogeneous array of nanoresonators is one example of the devices that are now within the reach of nanofabrication with the tantalizing possibility of full local control of an incident wavefront’s properties (amplitude, phase, spectrum or polarization state). Such exquisite control in conjunction with active tunability can enable a new class of ultra-thin, actively controllable, optical devices with a variety of functionalities like beam shaping [1], lensing [24], beam steering [57], polarizing [8, 9] and positively impact diverse applications like imaging [10], sensing [11], holography [12, 13] and optical computing [14].

However, for designing a broad class of nanophotonic systems, including photonic crystals, metasurfaces, and plasmonic structures, the motif of a unit cell with a single resonator is predominantly employed owing to its simplicity. For the optical metasurface, the so-called unit-cell approximation creates a library that maps the geometrical parameters of a resonator to the optical response of a periodic metasurface consisting of this meta-atom. The final design is then arrived at by simply stitching together various library elements. The limitations of this design approach have been recently explored in the nanophotonics literature [1518]. Gigli et al. [19] demonstrated the infeasibility of quasi-independent operation with weak crosstalk for nanoresonators arrayed with a sub-wavelength or even wavelength-scale distance. The limitations are most noteworthy in the case of beam shaping and lensing applications. For instance, the numerical calculations by Gigli et al. show that the efficiency of the Huygens’ elements remains below 40% for monochromatic operation, even for meta-gratings with periods significantly larger than the wavelength. This performance reduction is now well known in the field and is ascribed to inter-element electromagnetic coupling [2022].

To overcome these limitations, local periodization [22, 23] and global optimization techniques [21, 24] have been explored and found to give performance improvements. Specifically, metagratings capable of higher beam bending angles in transmission [2527] and reflection [28, 29] mode have been proposed. The idea behind these metagratings is to break the symmetry of illumination conditions and channelize light into diffraction orders by engineering their scattering patterns. Such extended unit-cell geometries, where high-index dielectric nanoantennas feature multiple geometry-tunable resonances [3033], have enabled selective enhancement/suppression of single diffraction order in both reflection and transmission modes. A second aspect is that the extension of the unit cell with multiple meta-atoms drastically enlarges the parameter space and consequently provides potential configurations with improved device performance. While extended-unit cell designs using the concept of meta-atom multiplication do not induce additional complexity for lithography-based fabrications, the substantially increased number of parameters makes the design methodology based on physical intuition and parameter sweep impractical. The presence of multiple diffraction orders in metagratings and the absence of easily discernible empirical relationships between geometries and optical properties make it an especially challenging problem. Multiple separate inclusions have an inherent frequency divergence that hinders the high diffraction efficiency from wide frequency and angle ranges, which are highly desirable for beam steering applications. Asymmetric nanoantenna separation is also strongly correlated to diffraction efficiency, which raises the challenge of stringent fabrication precision.

In this article, we propose an improved learning based nanophotonic structure discovery methodology for the constrained inverse design of extended unit-cell metagratings. Inverse design with adequate accounting of fabrication constraints is an optimization occurring in a constrained higher dimensional parameter space. The computational cost of such optimization scales exponentially with linear scaling in the parameter space dimensionality. This is often termed as the “curse of dimensionality” in the optimization literature [34]. This has opened a new computation-intensive front for nanophotonic research [3538]. Machine learning, in particular, deep learning [39, 40] (DL), is being increasingly investigated for inverse design problems in photonics [4145]. DL based inverse design [46] has been explored for a broader class of photonics problems [41, 47]: nanoresonators [48, 49], plasmonic structures [5052], metasurfaces/metamaterials [5066], integrated photonics [67, 68] and topological photonics [50], [51], [52, 61], [62], [63], [64], [65], [66, 69]. As DL begins to be increasingly applied in the inverse design of complex structures with high forward simulation load, feasible datasets may be “small” with potentially significant biasing due to the dataset construction. Thus, there is a need for workflows that not only emphasize efficient sample generation but also reduce the burden involved in learned model creation.

We consider the complex multi-variable geometry of rectangular unit-cells with multiple non-intersecting elliptical nanopillars [33] for wide-band multi-order functionalities. Previous reports [7073] have limited the structure discovery to a narrow-band wavelength-limited space (as reported by Inampudi and coworkers [55, 72, 74, 75]) or diffraction order limited space (as reported by Kiarashinejad and coworkers [73, 76, 77]). In creating good models, we found that a systematic study of network architectures and techniques to limit the amount of ground-truth training data was essential. The learning methodology typically employed in the literature has overwhelmingly favored simple fully-connected feedforward neural network architectures, randomly sampled training datasets and model performance estimation using testing sets sampled identically to the training dataset. The commonly used feedforward architectures suffer from the well-known issue of vanishing gradients which leads to a saturated prediction performance with increasing network depth. Residual layers implemented by Jiang and coworkers [78] and Yeung and coworkers [79] are adopted here for addressing this shortcoming along with Dense network architectures (DenseNet). Greedy sampling and centroid based sampling strategies are compared to random dataset sampling. The prediction performances of trained models on unseen data are investigated by building surrogate models and implementing them in surrogate-based evolutionary optimization.

The paper is organized as follows. Following this introduction, a detailed discussion of problem description and design approach is presented in Section 2 including problem encoding (subsection 2.2.1), ground truth generation (subsection 2.2.2) and learned model creation (subsection 2.2.3). Methods to estimate the global prediction capability of models via surrogate-assisted optimization are discussed in subsection 2.2.4. The results and discussion section (Section 3) begins with global prediction abilities of the surrogate models. Surrogate assisted optimizations are discussed in subsection 3.1 to design different spectral filters before concluding in Section 4.

2 Problem description and design approach

2.1 The extended unit-cell metagrating

The building block of the metagratings proposed in this work is an extended unit cell that is composed of two nonidentical non-intersecting elliptical TiO2 nanopillars. The configuration results in the coexistence of multipoles of high orders when arranged in a periodic fashion. A typical symmetric 2D grating, when subjected to normally incident polarized white light, splits the beams into different diffraction orders uniformly at angles depending on the wavelength and lattice constants (Figure 1A). However, engineering the multipolar interference in an extended unit-cell allows us to control the power distribution in different orders and wavelengths. Building on the seminal works of Shibanuma and coworkers [80], tunable control of directional scattering has been proposed with the help of cylindrical dimers [30], touching dimers [31], elliptical dimers [33] and prismoid-circular dimers [33]. The extended unit cell adapted in this work takes inspiration from the metasurface geometry proposed by Khaidarov and coworkers [33]. In their work, Khaidarov and coworkers considered a unit-cell that consists of asymmetric elliptical dimers in a single unit cell. They have demonstrated that such asymmetric arrangement of elliptical nano-antenna can control the power distribution into various diffraction orders. Such arrangements can help achieving a variety of wavelength and polarization-dependent functionalities like filtering [32], beam deflection [81], beam splitting etc.

Figure 1: 
The extended unit cell metagrating and its optical response. (A) Schematic of the extended unit-cell metagrating and scattering patterns of a metagrating dividing lights equally among orders when subjected to white light. Both transmitted and reflected diffraction orders are shown. (B) Comparison of diffraction efficiencies of three metagratings across different diffraction orders. The metagratings are constructed by periodically extending (i) first constituent meta-atom in a unit-cell, (ii) second constituent meta-atom in a unit-cell and (iii) both meta-atoms in an extended unit-cell. Peak diffraction efficiencies are marked. (C) E field distributions and (D) scattering power patterns corresponding to marked points in B are shown.
Figure 1:

The extended unit cell metagrating and its optical response. (A) Schematic of the extended unit-cell metagrating and scattering patterns of a metagrating dividing lights equally among orders when subjected to white light. Both transmitted and reflected diffraction orders are shown. (B) Comparison of diffraction efficiencies of three metagratings across different diffraction orders. The metagratings are constructed by periodically extending (i) first constituent meta-atom in a unit-cell, (ii) second constituent meta-atom in a unit-cell and (iii) both meta-atoms in an extended unit-cell. Peak diffraction efficiencies are marked. (C) E field distributions and (D) scattering power patterns corresponding to marked points in B are shown.

The optical behavior of a typical extended unit-cell metagrating is shown in Figure 1B-iii. The spectral behavior of two other metagratings designed by periodically extending only one of the constituent meta-atom is shown in Figure 1B-i and ii. Both constituent meta-atoms are observed to be giving a peak diffraction efficiency in (0,0)th transmission order at wavelength 450 and 560 nm, respectively, with negligible efficiency in other orders; whereas, the extended unit-cell gives diffraction peaks in the (0,0)th, (0,−1)th and (1,0)th orders at 450, 500 and 700 nm, respectively. The E-field distributions for these marked diffraction peaks are shown in Figure C-iFigure 1C-i–v. For single-atom unit cells (Figure 1C-i and ii), fundamental electric dipole modes can be observed inside the resonators which can be determined with the help of Mie theory in the limit of large index contrast [82]. However, for extended unit-cells (Figure C-iiiFigure 1C-iii–v), hybrid modes are observed which do not show an easily discernible relationship with individual particle resonators. This coexistence of multipoles leads to the asymmetrical farfield scattering patterns shown in Figure D-iFigure 1D-i–v.

2.2 Design approach

As noted in subsection 2.1, the behavior of an extended unit-cell meta-atom cannot be easily expressed in terms of the optical behavior of constituting resonators when considered in isolation. The proposed inverse design process is carried out by an optimization algorithm that searches the full geometrical parameter space of the extended unit-cell with the help of a learning based prediction model. Using a prediction model in place of an EM solver significantly accelerates the numerical optimization with reduced computational cost, but requires a one-time computational investment for ground-truth dataset generation and subsequent DNN model training. Due to the high dimensionality of the problem, special care is given in smart-sampling the parameter space and design of efficient network architectures. The DL enabled surrogate optimization process can be divided into three steps: (1) choosing a suitable problem encoding; (2) ground truth generation; (3) choosing a model architecture and learned model creation; and (4) inverse design using the learned model. In this section, each of these steps is described briefly with further details provided in the supporting document.

2.2.1 Problem encoding

The geometry considered here (Figure 1A) is an extended unit cell [32] metagrating consisting of two elliptical TiO2 nano-pillars on an SiO2 substrate. Each elliptical pillar can be represented by the following set of geometrical parameters: coordinates of the center (x and y coordinates), major and minor radii (r major and r minor, respectively) and angle of inclination (θ) (as shown in the inset of Figure 1A). The geometrical parameters of the two elliptical nanopillars; the lattice periodicities along the x and y directions (P x and P y , respectively); and, the metasurface thickness (t) form a 13-dimensional parameterized vector. Fixing the x and y position of the center of one of the ellipses to (0,0) allows us to reduce the dimensionality of the solution space by 2.

The individual parameters in the parameter vector are constrained to lie between their minimum and maximum values and the solution space is a region in the higher dimensional parameter space defined by these constraints. However, this solution space may also contain some structures that are infeasible for nanofabrication. Additionally, some parameter combinations may result in the intersection of the ellipses. Although the structures with intersecting ellipses may be feasible for fabrication, their optical response may greatly differ from the response of structures without such intersections. Therefore, subregions of the solution space are excluded from consideration which contains designs that are infeasible for nanofabrication (which violate lithographic constraints [83] like minimum feature size) and those that lead to intersecting ellipses. Specifically, the minimum value of major and minor radii was set to 50 nm and, to avoid redundancy, r major is always taken greater than r minor. To maintain adequate spacing between ellipses, the ellipses a first scaled to buffer ellipses (indicated by dotted blue lines in the inset of figure: schematic A) such that r buffer = r ellipse + 10 nm for both major and minor radii. Geometries with intersecting buffer ellipses are left out; which not only ensures a minimum gap of 20 nm between ellipses but also eliminates structures with intersecting ellipses. The buffer ellipses should also not intersect the lattice rectangle; thus providing adequate distance (20 nm) between neighboring unit cells.

The observations in this work are limited to visible wavelength (400–700 nm) and the periodicities of structures are set between 600 and 1000 nm. Only for the periodicities higher than 800 nm we get diffraction into the second order and that too for a shorter range of wavelengths (e.g., 400–500 nm for the largest periodicity of 1000 nm). For all other periodicities and wavelengths, diffraction is only up to the first order. Thus, we limit the observation up to the first diffraction order only. Therefore, we consider 18 diffraction orders (9 in transmission and 9 in reflection) for a single polarization at a given wavelength. Considering two linear polarizations (s and p) and discretizing the visible spectrum (λ = 400–700 nm) with 32 sample wavelengths with 10 nm spacing, we get 36 spectral responses at each of the 32 discrete wavelengths. These numbers constitute the “output” of any learned model. There are various choices for representing this “output”; the approach in this paper is to represent it as a 3D tensor of shape (18,2,32) (see Figure 1B). This choice was motivated by the observation that although the spectral responses of any particular order are smooth functions of wavelength, the splitting various orders at any given wavelength was not very smooth. Cascading them to a 1D tensor causes discontinuities at the points of succession; making the training performance suffer. A 3D tensor representation (Figure 1B) of dimensionality (18,2,32) as chosen in this work not only avoids those discontinuities but also helps to exploit the correlations among output parameters. Additionally, this encoding allows us to use a deconvolution layer toward the end of the network (as shown in Figure 1B) which helps in reducing the number of network trainable parameters. A detailed comparison between the two encoding choices is provided in Section S-1.1 of the supporting document.

2.2.2 Ground-truth generation

As discussed in 2.2.1, the design space is a continuous hypercube in the 11-dimensional continuous solution space (H). The imposed constraints reduce the solution space to a “feasible” continuous solution space (F) with irregular boundaries (Figure S-2C-i). First we generate a discrete version of the feasible solution space F s (Figure S-2C-ii) by adequately sampling F as follows: x, y − 100, 200, …, 700, r major, r minor − 50, 100, …, 450, θ − 0°, 45°, …, 135°, P x , P y − 600, 800, 1000 and t − 500, 600, 700, 800 with the lengths measured in nanometers. Once generated, samples are selected from F s and labeled for training the networks. Training and testing datasets include both the input (X) and the associated outputs (Y) (outputs are termed as “labels”). The labeling process (ground truth generation) is a computationally expensive process. While building a prediction model, a smart selection of datasets (i.e., choosing the X array) could lead to a reduction in this burden or lead to a better model for the same computational burden. Three criteria are needed to be considered when selecting samples (X) from the design space; informativeness, representativeness and diversity [84]. The conventional ways of sampling are the random sampling [85] (uniform sampling probabilities assigned to all the regions of the solution space) and Latin Hypercube (LHC) sampling [86] (stratifies the input probability distributions). However, due to the irregularities in the distribution of the infeasible designs, LHC sampling does not lead to a representative sampling in our case. In this work, we explore three smart sampling strategies: (1) greedy sampling, (2) k-means clustering and (3) k-medoids clustering to generate the train/test datasets and compare them with random sampling. See Section S-1.2 of the supporting document for further details.

2.2.3 Learned model creation

Three network architectures have been explored in this work and are compared with the most prevalent feedforward architecture; (1) ResNet architecture which is built by cascading multiple residual blocks (a residual block uses a single skip connection in a deep neural network); (2) DenseNet-I architecture where the network is built by cascading dense blocks (a single dense block consists of two skip layers); and (3) DenseNet-II architecture where skip layers are repeated throughout to connect the hidden layers. The architectures are shown in Figure S-2B of the supporting document. In each of these network architectures, the number of hidden layers is increased up to 24 to find the best network configuration. The results are documented in Section S-1.3 of the supporting file. Based on the result given in Figure S-1D, the number of layers is fixed to 13 for feedforward network architectures and 24 for other architectures for the rest of the article. Unless otherwise mentioned, the dense layers consist of 288 neurons.

An extensive comparison of performance was carried out for all combinations of network architectures and sampling strategies discussed above and are documented in Section S-2 of the supporting document. A DenseNet-II network architecture trained with samples selected with the help of k-means clustering strategy is observed to be having a greater prediction ability as compared to other combinations. This prediction model is referred to as “Surrogate-1” in the further part of this work. In order to showcase its efficacy, this model is compared with another prediction model “Surrogate-2” which uses a feed-forward network architecture trained with randomly selected samples.

2.2.4 Surrogate-assisted evolutionary optimization

A DE optimization is employed in this work for the inverse design of the metagratings where a randomly initialized set of solutions (called the “population”) are taken and nature inspired techniques like mutation and crossover are performed to produce a new set of solutions [87, 88]. In our previous works [89, 90], where DE was used to optimize nanophotonic structures, the optical response was obtained using a forward solver (S 4). In this work, we substitute the forward solver with the above discussed DNN based prediction models Surrogate-1 and Surrogate-2 and the associated DE optimization models are referred to as DESurrogate-1 and DESurrogate-2, respectively. Figure 2C shows the block diagram of the surrogate optimization. The DE optimization searches for a design that closely matches a targeted response (specified across orders, wavelengths and input polarization). The fitness (η) in the optimization is calculated by taking the mean of squared error between the obtained transmittance (s) and target spectra (τ) as given below.

(1) η = 1 2 N o N λ i = 1 N o j = 1 2 k = 1 N λ [ s ( i , j , k ) τ ( i , j , k ) ] 2 ,

where N o = 18 and N λ = 32 are the number of diffraction orders (both reflection and transmissions) and wavelength samples, respectively.

Figure 2: 
Design of experiments for DL based structure discovery of extended unit cell metagratings.
(A) Schematic of the extended unit cell metagrating and its optical response which results in splitting the incoming normally incident beam into various diffraction orders. The structure consists of two elliptical TiO2 nano-pillars on SiO2 substrate (Insets show the top and perspective views of a single unit cell and the geometrical parameters). (B) A “learned” deep neural network (DNN) model of the structure which predicts the power splitting in various orders over a wavelength range for a given member geometry. (C) Flowchart of learning based surrogate differential evolution (DE) optimization for extended unit-cell metagrating design.
Figure 2:

Design of experiments for DL based structure discovery of extended unit cell metagratings.

(A) Schematic of the extended unit cell metagrating and its optical response which results in splitting the incoming normally incident beam into various diffraction orders. The structure consists of two elliptical TiO2 nano-pillars on SiO2 substrate (Insets show the top and perspective views of a single unit cell and the geometrical parameters). (B) A “learned” deep neural network (DNN) model of the structure which predicts the power splitting in various orders over a wavelength range for a given member geometry. (C) Flowchart of learning based surrogate differential evolution (DE) optimization for extended unit-cell metagrating design.

3 Results and discussions

The numerical experiments were performed in the Keras DL platform with a Tensorflow backend on a workstation with an Intel™ i9-7920X CPU with an NVIDIA™ GeForce GTX 1080 GPU card with 128 GB memory. The source code for the implementation, datasets and saved models has been made publicly available at [91]. For ground-truth generation and exact fitness estimation of structures for use in evolutionary algorithms, the Stanford Stratified Structure Simulator [92] (S 4) was used, which uses the Rigorous Coupled Wave Analysis (RCWA) technique along with the S-matrix algorithm to solve Maxwell’s equations in layered periodic structures. In the S 4 solver, the number of basic functions parameters was set to 50. For the results presented in this section, the prediction error of a trained model (the mean of squared errors (MSE) between the true and predicted output values for members of the test set) is considered. Models with prediction error values less than 5e-4 were found to give a satisfactory performance (see Figure S-5 in the supporting document for more details).

An extensive comparison of performance was carried out for all combinations of network architectures and sampling strategies discussed above and are documented in Section S-2 of the supporting document. A DenseNet-II network architecture trained with samples selected from F s with the help of k-means clustering strategy (referred to as “Surrogate-1”) is statistically observed to be having a greater prediction ability as compared to other combinations. Figure 3A and B shows the prediction ability of Surrogate-1 on two unseen testing datasets of size 50 K selected from F s and (F − F s ), respectively. In order to showcase its efficacy, this prediction model is compared with another prediction model “Surrogate-2” which uses a feed-forward network architecture trained with randomly selected samples. The training dataset size is increased from 40 to 400 K for both configurations. All experiments were repeated for 10 times and the distribution of prediction error is plotted in Figure 3A-i and B-i. A clear advantage of prediction quality in Surrogate-1 is observed over Surrogate-2 with mean error, as well as the error variance across multiple runs, are both noticeably lower. Although increasing training dataset size (S train) beyond 80 K further reduces the prediction error in test set F s , in test set (F − F s ) it gives a saturated performance, thus making 80 K a suitable training dataset size. To illustrate, Figure 3A-ii and B-ii show two example structures drawn from both test sets and their actual and predicted optical response. The training dataset size was set to 80 K. Noticeable differences are observed in the prediction of different models with Surrogate-1 outperforming Surrogate-2.

Figure 3: 
Comparison of the prediction performance on unseen data for two combinations of sampling strategies and network architectures (k-means + DenseNet-I and Random + Feedforward) on two test sets of size 50,000 selected from (A) (F

s
) and (B) (F − F

s
), respectively. Statistical comparison of prediction error along with the prediction of one example structure is shown for both cases. (C) Comparison of both trained models’ performances as surrogates in DE optimization. Statistics of multiple runs of DE optimization (100 iterations) for a target design of a polarization-independent color filter (red) are shown. The best surrogate fitness (fitness predicted by surrogates; see Eq. (1)) and the actual fitness values at the end of multiple optimization runs are shown. (D) and (E) Typical progression of fitnesses (with iteration number) for DESurrogate-1 and DESurrogate-2, respectively. For surrogate optimizations, the actual fitnesses (solid lines) and corresponding surrogate fitnesses (dotted lines) are shown.
Figure 3:

Comparison of the prediction performance on unseen data for two combinations of sampling strategies and network architectures (k-means + DenseNet-I and Random + Feedforward) on two test sets of size 50,000 selected from (A) (F s ) and (B) (F − F s ), respectively. Statistical comparison of prediction error along with the prediction of one example structure is shown for both cases. (C) Comparison of both trained models’ performances as surrogates in DE optimization. Statistics of multiple runs of DE optimization (100 iterations) for a target design of a polarization-independent color filter (red) are shown. The best surrogate fitness (fitness predicted by surrogates; see Eq. (1)) and the actual fitness values at the end of multiple optimization runs are shown. (D) and (E) Typical progression of fitnesses (with iteration number) for DESurrogate-1 and DESurrogate-2, respectively. For surrogate optimizations, the actual fitnesses (solid lines) and corresponding surrogate fitnesses (dotted lines) are shown.

Further, both surrogate models are implemented in designing a surrogate assisted DE optimization. The DE optimization searches for a design that closely matches a targeted response (specified across orders, wavelengths and input polarization). In all the DE optimizations, the population size is fixed to 110 and the initial populations are randomly selected. The DE schemes implemented here are DE/rand/1/bin (mutant is derived through one difference operation between two random solutions and a binary crossover takes place between parent and mutant solution vector to generate the offspring) for exploration. A good convergence rate is observed with mutation rate (m r) = 0.4 and crossover probability (c p) = 0.5. All the DE runs are observed to be converging after 100 iterations, making it the termination criterion. The fitness (η) in the optimization is calculated as discussed in Eq. (1). Specifically, for illustration, the design of a polarization-independent “Red” color filter is considered under the condition of normal incidence. For this target, Eq. (1) assume the following form:

(2) η = 1 2 N λ j = 1 2 k = 1 N λ [ s ( 1 , j , k ) CMF red ( k ) ] 2 ,

where CMFred is the CIE color matching function for red color (sampled at 32 wavelengths).

Figure 3C shows the comparative performance of DESurrogate-1 and DESurrogate-2 for this target. Each DE optimization was repeated 10 times with random initializations and the statistics of the best fitnesses obtained at the end are plotted in Figure 3C. As the surrogate models use DNN predictions of optical responses, the predicted fitness may vary from actual fitness. Therefore, the actual fitness of the best solutions was also calculated for DESurrogate-1 and DESurrogate-2 and plotted. It can be observed that DESurrogate-1 outperforms DESurrogate-2 noticeably for the same training dataset size of 80 K.

The surrogate optimizations perform the search in an approximated fitness landscape. In regions of the parameter space where the prediction capability of the model is poor, there will be a large divergence between actual fitness and predicted fitness. Figure 3E and F shows two typical progressions of an optimization run as indicated by the best obtained fitness value at the end of each iteration for DESurrogate-1 and DESurrogate-2, respectively. The exact fitness of the best individual is also plotted to indicate the divergence between the exact and the approximate fitness landscapes at various junctures of the optimizations. It was observed that the optimizations were converging within 100 iterations. In the case of DESurrogate-1 and DESurrogate-2, the actual fitness is observed to be diverging from predicted fitness; suggesting the degradation in performance. This divergence is caused as the optimization makes the population evolve to solutions where the predicted fitness is low, but the prediction is inferior. As observed in Figure 3E and F, in case of DESurrogate-1, the divergence is less as compared to DESurrogate-2 suggesting a better global prediction ability which results in enhanced optimization capability of DESurrogate-1. Solutions with best fitness in the exact fitness landscape are considered as best fit solutions.

3.1 Design of spectral filters and color splitters

In this section, surrogate DE optimization is employed for the inverse design of the following functionalities (for normal incidence); (1) polarization-insensitive RGB color filters; (2) polarization-dependent RGB color filters; and, (3) RGB color splitters. The performance (spectral behavior and color purity) of best-fit solutions obtained with surrogate optimizations (DESurrogate-1 and DESurrogate-2) are compared.

For the polarization-insensitive spectral filters (schematic configuration shown in Figure 4A), Eq. (1) assume the following form:

(3) η = 1 2 N λ j = 1 2 k = 1 N λ [ s ( 1 , j , k ) CMF ( j , k ) ] 2 ,

where the target CMFs are same for both polarizations; i.e., CMF(1, k) = CMF(2, k). The results of the inverse design process for all three primary RGB filters are compared across DESurrogate-1 and DESurrogate-2 in Figure 4B–D. It is seen that the designs found by DESurrogate-1 outperform DESurrogate-2 with an improved level of polarization insensitivity and color purity.

Figure 4: 
Inverse design of polarization independent transmission mode spectral filters using DESurrogate-1 and DESurrogate-2.
(A) Configuration of an example spectral filter. (B), (C) and (D) Transmission spectra of best fit spectral filters obtained by DESurrogate-1 and DESurrogate-2 for designing polarization independent blue, green and red color filters. Inset patches show the colors represented by the spectra in CIE 1931 color space. The unit cell structures obtained with both techniques are shown in color coding for all cases.
Figure 4:

Inverse design of polarization independent transmission mode spectral filters using DESurrogate-1 and DESurrogate-2.

(A) Configuration of an example spectral filter. (B), (C) and (D) Transmission spectra of best fit spectral filters obtained by DESurrogate-1 and DESurrogate-2 for designing polarization independent blue, green and red color filters. Inset patches show the colors represented by the spectra in CIE 1931 color space. The unit cell structures obtained with both techniques are shown in color coding for all cases.

For the polarization-dependent spectral filters (schematic configuration shown in Figure 5A), the fitness is again given by Eq. (3), but the target CMF is different for each polarizations (i.e., CMF(1, k) ≠ CMF(2, k)). It is seen that the designs found by DESurrogate-1 outperform DESurrogate-2 with higher level of polarization dependence.

Figure 5: 
Inverse design of polarization dependent transmission mode spectral filters using DESurrogate-1 and DESurrogate-2.
(A) Configuration of an example spectral filter. (B), (C) and (D) Transmission spectra of best fit spectral filters obtained by DESurrogate-1 and DESurrogate-2 for designing polarization dependent (B) blue-green, (C) green-red and (D) red-blue color filters. Inset patches show the colors represented by the spectra in CIE 1931 color space. The unit cell structures obtained with all three techniques are shown in color coding for all cases.
Figure 5:

Inverse design of polarization dependent transmission mode spectral filters using DESurrogate-1 and DESurrogate-2.

(A) Configuration of an example spectral filter. (B), (C) and (D) Transmission spectra of best fit spectral filters obtained by DESurrogate-1 and DESurrogate-2 for designing polarization dependent (B) blue-green, (C) green-red and (D) red-blue color filters. Inset patches show the colors represented by the spectra in CIE 1931 color space. The unit cell structures obtained with all three techniques are shown in color coding for all cases.

For designing RGB color splitters for s-polarized incident light (schematic configuration shown in Figure 6A), the fitness term in Eq. (1) assume the following form:

(4) η = 1 3 N λ i = 1 3 k = 1 N λ [ s ( i , 1 , k ) CMF ( i , k ) ] 2 ,

where the target CMFs are different for each order. For an RGB color splitter where blue, green and red colors are transmitted in T 0,0, T 1,0 and T −1,0 orders, respectively (Figure 6A), CMF(1, k) = CMFblue(k), CMF(2, k) = CMFgreen(k) and CMF(3, k) = CMFred(k). The inverse design of this color splitter was carried out using DESurrogate-1 and DESurrogate-2 and the results are compared in Figure 6B. A clear advantage of DESurrogate-1 is observed over DESurrogate-2 in all cases.

Figure 6: 
Inverse design of transmission mode color splitters using DESurrogate-1 and DESurrogate-2.
(A) Configuration of an example color splitter. (B) Transmission spectra of color splitter designed for s-polarized incidence obtained with DESurrogate-1 and DESurrogate-1. Inset patches show the colors represented by the spectra. The unit cell structures obtained with all three techniques are shown in color coding.
Figure 6:

Inverse design of transmission mode color splitters using DESurrogate-1 and DESurrogate-2.

(A) Configuration of an example color splitter. (B) Transmission spectra of color splitter designed for s-polarized incidence obtained with DESurrogate-1 and DESurrogate-1. Inset patches show the colors represented by the spectra. The unit cell structures obtained with all three techniques are shown in color coding.

Comparing the spectral results of optimized structures in Figures 4, 5 and 6, it can be inferred that DESurrogate-1 is able to follow the target curve better than DESurrogate-2 resulting a better fitness; which indicates a better prediction ability of DESurrogate-1. From the comparison of these colors, it can be observed that in most of the cases, structures optimized with DESurrogate-1 showcase a better color purity as compared to DESurrogate-2; apart from some anomalies in Figure 6B (red). In this exceptional case, although DESurrogate-2 gives structures with better color purity, the transmission peak of structure given by DESurrogate-2 is observed to be considerably small (∼0.15).

The proposed method involves a one-time investment in model creation but significantly reduces the computational time of any optimization objective. The surrogate optimization routine adapted in this work involves a single call (for the best fit solution) to the electromagnetic forward solver in a single iteration; unlike traditional optimizers where the forward solvers evaluate the whole population. The surrogate optimization is found to speed-up the optimization process by “N pop” times; where “N pop” is the population size (110 in our case). A detailed account of timing at different junctures of model creation and surrogate optimization are given in Section S-4 of the supporting document. The proposed algorithm is thus useful as the initial investment in model creation can be amortized across several different optimization objectives.

4 Conclusions

In conclusion, we demonstrate the advantage offered by densely connected neural network architectures and judicious dataset sampling strategies for learning based nanophotonic structure discovery. In particular, we consider learned models for use in surrogate-assisted evolutionary optimization. The inverse design of optical devices to meet a targeted spectral performance is a general problem and we speculate that the findings of this study can be adopted in inverse design scenarios beyond the example chosen here. Beyond DE considered here, there is a vast array of evolutionary optimization methods used in photonics and it is anticipated that learning based acceleration could be of interest. Although the scope of this work was restricted to parameterized geometry representation and moderate input dimensionality, we speculate on the extension to other problems. Using the methods of dimensionality reduction via the use of autoencoder networks [73, 93], the use of nonparameterized geometry representation can broaden the applicability of this work.

Our previous works [66, 94], have explored different combinations of surrogate models and evolutionary computing by which one can trade off optimality and computational load. In this work, we have considered surrogate-only optimization. When compared with DERCWA, DESurrogate-1 dramatically reduces the computational load but underperforms in terms of optimality. Specifically, choosing surrogate models exclusively for the candidate selection step in evolutionary optimization is recommended in some situations [66, 94].


Corresponding author : Ravi S. Hegde, Department of Electrical Engineering, IIT Gandhinagar, Gandhinagar, 382355, India, E-mail:

Award Identifier / Grant number: SN/NM/NS-65/2016

Acknowledgments

This work was supported by Department of Science and Technology, Govt. of India through the extramural grant of Nanomission Project: SN/NM/NS-65/2 016. The authors would like to acknowledge the contribution of Harshul Tandan for assisting with the experiments in Section S-1.2.

  1. Author contribution: All the authors have accepted responsibility for the entire content of this submitted manuscript and approved submission.

  2. Research funding: None declared.

  3. Conflict of interest statement: The authors declare no conflicts of interest regarding this article.

References

[1] N. Yu, P. Genevet, M. A. Kats, et al.., “Light propagation with phase discontinuities: generalized laws of reflection and refraction,” Science, vol. 334, no. 6054, pp. 333–337, 2011. https://doi.org/10.1126/science.1210713.Search in Google Scholar PubMed

[2] M. Khorasaninejad, W. T. Chen, R. C. Devlin, J. Oh, A. Y. Zhu, and F. Capasso, “Metalenses at visible wavelengths: diffraction-limited focusing and subwavelength resolution imaging,” Science, vol. 352, no. 6290, pp. 1190–1194, 2016. https://doi.org/10.1126/science.aaf6644.Search in Google Scholar PubMed

[3] M. Khorasaninejad and F. Capasso, “Metalenses: versatile multifunctional photonic components,” Science, vol. 358, no. 6367, pp. 1–8, 2017. https://doi.org/10.1126/science.aam8100.Search in Google Scholar PubMed

[4] W. Liu, Z. Li, H. Cheng, et al.., “Metasurface enabled wide-angle Fourier lens,” Adv. Mater., vol. 30, no. 23, p. 1706368, 2018. https://doi.org/10.1002/adma.201706368.Search in Google Scholar PubMed

[5] Y. Yang, W. Wang, P. Moitra, I. I. Kravchenko, D. P. Briggs, and J. Valentine, “Dielectric meta-reflectarray for broadband linear polarization conversion and optical vortex generation,” Nano Lett., vol. 14, no. 3, pp. 1394–1399, 2014. https://doi.org/10.1021/nl4044482.Search in Google Scholar PubMed

[6] M. Q. Mehmood, S. Mei, S. Hussain, et al.., “Visible-frequency metasurface for structuring and spatially multiplexing optical vortices,” Adv. Mater., vol. 28, no. 13, pp. 2533–2539, 2016. https://doi.org/10.1002/adma.201504532.Search in Google Scholar PubMed

[7] M. Zhang, M. Pu, F. Zhang, et al.., “Plasmonic metasurfaces for switchable photonic spin–orbit interactions based on phase change materials,” Adv. Sci., vol. 5, no. 10, p. 1800835, 2018. https://doi.org/10.1002/advs.201800835.Search in Google Scholar PubMed PubMed Central

[8] N. M. Estakhri and A. Alu, “Manipulating optical reflections using engineered nanoscale metasurfaces,” Phys. Rev. B, vol. 89, no. 23, p. 235419, 2014. https://doi.org/10.1103/physrevb.89.235419.Search in Google Scholar

[9] X. Ma, M. Pu, X. Li, Y. Guo, and X. Luo, “All-metallic wide-angle metasurfaces for multifunctional polarization manipulation,” Opto-Electron. Adv., vol. 2, no. 3, pp. 180023–180031, 2019. https://doi.org/10.29026/oea.2019.180023.Search in Google Scholar

[10] D. Lee, J. Gwak, T. Badloe, S. Palomba, and J. Rho, “Metasurfaces-based imaging and applications: from miniaturized optical components to functional imaging platforms,” Nanoscale Adv., vol. 2, no. 2, pp. 605–625, 2020. https://doi.org/10.1039/c9na00751b.Search in Google Scholar PubMed PubMed Central

[11] W. Xu, L. Xie, and Y. Ying, “Mechanisms and applications of terahertz metamaterial sensing: a review,” Nanoscale, vol. 9, no. 37, pp. 13864–13878, 2017. https://doi.org/10.1039/c7nr03824k.Search in Google Scholar PubMed

[12] G. Zheng, H. Mühlenbernd, M. Kenney, G. Li, T. Zentgraf, and S. Zhang, “Metasurface holograms reaching 80% efficiency,” Nat. Nanotech., vol. 10, no. 4, pp. 308–312, 2015. https://doi.org/10.1038/nnano.2015.2.Search in Google Scholar PubMed

[13] Zi-L. Deng, J. Deng, X. Zhuang, et al.., “Diatomic metasurface for vectorial holography,” Nano Lett., vol. 18, no. 5, pp. 2885–2892, 2018. https://doi.org/10.1021/acs.nanolett.8b00047.Search in Google Scholar PubMed

[14] F. Zangeneh-Nejad, D. L. Sounas, A. Andrea, and R. Fleury, “Analogue computing with metamaterials,” Nat. Rev. Mater., vol. 6, no. 3, pp. 207–225, 2021. https://doi.org/10.1038/s41578-020-00243-2.Search in Google Scholar

[15] Z. Zhou, J. Li, R. Su, et al.., “Efficient silicon metasurfaces for visible light,” ACS Photonics, vol. 4, no. 3, pp. 544–551, 2017. https://doi.org/10.1021/acsphotonics.6b00740.Search in Google Scholar

[16] S. Sun, K.-Y. Yang, C.-M. Wang, et al.., “High-efficiency broadband anomalous reflection by gradient meta-surfaces,” Nano Lett., vol. 12, no. 12, pp. 6223–6229, 2012. https://doi.org/10.1021/nl3032668.Search in Google Scholar PubMed

[17] N. M. Estakhri and A. Alu, “Wave-front transformation with gradient metasurfaces,” Phys. Rev. X, vol. 6, no. 4, p. 041008, 2016. https://doi.org/10.1103/physrevx.6.041008.Search in Google Scholar

[18] M. I. Shalaev, J. Sun, A. Tsukernik, A. Pandey, K. Nikolskiy, and N. M. Litchinitser, “High-efficiency all-dielectric metasurfaces for ultracompact beam manipulation in transmission mode,” Nano Lett., vol. 15, no. 9, pp. 6261–6266, 2015. https://doi.org/10.1021/acs.nanolett.5b02926.Search in Google Scholar PubMed

[19] C. Gigli, Q. Li, P. Chavel, G. Leo, M. L. Brongersma, and P. Lalanne, “Fundamental limitations of huygens’ metasurfaces for optical beam shaping,” Laser Photonics Rev, vol. 15, no. 8, p. 2000448, 2021.10.1002/lpor.202000448Search in Google Scholar

[20] S. M. Kamali, E. Arbabi, A. Arbabi, and A. Faraon, “A review of dielectric optical metasurfaces for wavefront control,” Nanophotonics, vol. 7, no. 6, pp. 1041–1068, 2018. https://doi.org/10.1515/nanoph-2017-0129.Search in Google Scholar

[21] A. J. Ollanik, J. A. Smith, M. J. Belue, and M. D. Escarra, “High-efficiency all-dielectric Huygens metasurfaces from the ultraviolet to the infrared,” ACS photonics, vol. 5, no. 4, pp. 1351–1358, 2018. https://doi.org/10.1021/acsphotonics.7b01368.Search in Google Scholar

[22] H. Yu, B. Zhang, C. Bulin, R. Li, and R. Xing, “High-efficient synthesis of graphene oxide based on improved hummers method,” Sci. Rep., vol. 6, no. 1, pp. 1–7, 2016. https://doi.org/10.1038/srep36143.Search in Google Scholar PubMed PubMed Central

[23] K. E. Chong, I. Staude, A. James, et al.., “Polarization-independent silicon metadevices for efficient optical wavefront control,” Nano Lett., vol. 15, no. 8, pp. 5369–5374, 2015. https://doi.org/10.1021/acs.nanolett.5b01752.Search in Google Scholar PubMed

[24] H. Cai, S. Srinivasan, D. A. Czaplewski, et al.., “Inverse design of metasurfaces with non-local interactions,” npj Comput. Mater., vol. 6, no. 1, pp. 1–8, 2020. https://doi.org/10.1038/s41524-020-00369-5.Search in Google Scholar

[25] Y. Ra’di and A. Andrea, “Reconfigurable metagratings,” ACS Photonics, vol. 5, no. 5, pp. 1779–1785, 2018.10.1021/acsphotonics.7b01528Search in Google Scholar

[26] W. Liu and A. E. Miroshnichenko, “Beam steering with dielectric metalattices,” ACS Photonics, vol. 5, no. 5, pp. 1733–1741, 2017. https://doi.org/10.1021/acsphotonics.7b01217.Search in Google Scholar

[27] A. Wu, H. Li, J. Du, et al.., “Experimental demonstration of in-plane negative-angle refraction with an array of silicon nanoposts,” Nano Lett., vol. 15, no. 3, pp. 2055–2060, 2015. https://doi.org/10.1021/nl5049516.Search in Google Scholar PubMed

[28] Zi-L. Deng, J. Deng, X. Zhuang, et al.., “Facile metagrating holograms with broadband and extreme angle tolerance,” Light: Sci. Appl., vol. 7, no. 1, pp. 1–8, 2018. https://doi.org/10.1038/s41377-018-0075-0.Search in Google Scholar PubMed PubMed Central

[29] Z.-L. Deng, S. Zhang, and G. P. Wang, “A facile grating approach towards broadband, wide-angle and high-efficiency holographic metasurfaces,” Nanoscale, vol. 8, no. 3, pp. 1588–1594, 2016. https://doi.org/10.1039/c5nr07181j.Search in Google Scholar PubMed

[30] T. Shibanuma, T. Matsui, R. Tyler, et al.., “Experimental demonstration of tunable directional scattering of visible light from all-dielectric asymmetric dimers,” ACS Photonics, vol. 4, no. 3, pp. 489–494, 2017. https://doi.org/10.1021/acsphotonics.6b00979.Search in Google Scholar

[31] S. Tan, Y. Wang, Z.-L. Deng, et al.., “All-dielectric kissing-dimer metagratings for asymmetric high diffraction,” Adv. Opt. Mater., vol. 7, no. 24, p. 1901389, 2019.10.1002/adom.201901389Search in Google Scholar

[32] C. Liu, S. A. Maier, and G. Li, “Genetic-algorithm-aided meta-atom multiplication for improved absorption and coloration in nanophotonics,” ACS Photonics, vol. 7, no. 7, pp. 1716–1722, 2020. https://doi.org/10.1021/acsphotonics.0c00266.Search in Google Scholar

[33] E. Khaidarov, H. Hao, R. Paniagua-Domínguez, et al.., “Asymmetric nanoantennas for ultrahigh angle broadband visible light bending,” Nano Lett., vol. 17, no. 10, pp. 6267–6272, 2017. https://doi.org/10.1021/acs.nanolett.7b02952.Search in Google Scholar PubMed

[34] R. Bellman, Dynamic Programming, reprint edition, Mineola, N.Y, Dover Publications Inc., 2003.Search in Google Scholar

[35] S. D. Campbell, D. Sell, R. P. Jenkins, E. B. Whiting, J. A. Fan, and H. D. Werner, “Review of numerical optimization techniques for meta-device design,” Opt. Mater. Express, vol. 9, no. 4, pp. 1842–1863, 2019. https://doi.org/10.1364/ome.9.001842.Search in Google Scholar

[36] W. Ma, Z. Liu, Z. A. Kudyshev, A. Boltasseva, W. Cai, and Y. Liu, “Deep learning for the design of photonic structures,” Nat. Photon., vol. 15, no. 2, pp. 77–90, 2021. https://doi.org/10.1038/s41566-020-0685-y.Search in Google Scholar

[37] K. Yao, R. Unni, and Y. Zheng, “Intelligent nanophotonics: merging photonics and artificial intelligence at the nanoscale,” Nanophotonics, vol. 8, no. 3, pp. 339–366, 2019. https://doi.org/10.1515/nanoph-2018-0183.Search in Google Scholar PubMed PubMed Central

[38] J. Zhou, B. Huang, Z. Yan, and J.-C. G. Bünzli, “Emerging role of machine learning in light-matter interaction,” Light: Sci. Appl., vol. 8, no. 1, pp. 1–7, 2019. https://doi.org/10.1038/s41377-019-0192-4.Search in Google Scholar PubMed PubMed Central

[39] I. Goodfellow, Y. Bengio, and A. Courville, Deep Learning (Adaptive Computation and Machine Learning Series), Cambridge, MA, MIT Press, 2017, pp. 321–359.Search in Google Scholar

[40] Y. LeCun, Y. Bengio, and G. Hinton, “Deep learning,” Nature, vol. 521, no. 7553, pp. 436–444, 2015. https://doi.org/10.1038/nature14539.Search in Google Scholar PubMed

[41] R. S. Hegde, “Deep learning: a new tool for photonic nanostructure design,” Nanoscale Adv., vol. 2, no. 3, pp. 1007–1023, 2020. https://doi.org/10.1039/c9na00656g.Search in Google Scholar PubMed PubMed Central

[42] Z. Liu, D. Zhu, L. Raju, and W. Cai, “Tackling photonic inverse design with machine learning,” Adv. Sci., vol. 8, no. 5, p. 2002923, 2021. https://doi.org/10.1002/advs.202002923.Search in Google Scholar PubMed PubMed Central

[43] O. Khatib, S. Ren, M. Jordan, and W. J. Padilla, Deep Learning the Electromagnetic Properties of Metamaterials—A Comprehensive review Advanced Functional Materials, Wiley Online Library, 2021, p. 2101748.10.1002/adfm.202101748Search in Google Scholar

[44] A. F. Koenderink, A. Alu, and P. Albert, “Nanophotonics: shrinking light-based technology,” Science, vol. 348, no. 6234, pp. 516–521, 2015. https://doi.org/10.1126/science.1261243.Search in Google Scholar PubMed

[45] S. Molesky, Z. Lin, A. Y. Piggott, W. Jin, J. Vucković, and A. W. Rodriguez, “Inverse design in nanophotonics,” Nat. Photon., vol. 12, no. 11, pp. 659–670, 2018. https://doi.org/10.1038/s41566-018-0246-9.Search in Google Scholar

[46] J. Peurifoy, Y. Shen, L. Jing, et al.., “Nanophotonic particle simulation and inverse design using artificial neural networks,” Sci. Adv., vol. 4, no. 6, p. eaar4206, 2018. https://doi.org/10.1126/sciadv.aar4206.Search in Google Scholar PubMed PubMed Central

[47] J. Jiang, M. Chen, and J. A. Fan, “Deep neural networks for the evaluation and design of photonic devices,” Nat. Rev. Mater., vol. 6, pp. 679–700, 2021. https://doi.org/10.1038/s41578-020-00260-1.Search in Google Scholar

[48] O. Hemmatyar, S. Abdollahramezani, Y. Kiarashinejad, M. Zandehshahvar, and A. Ali, “Full color generation with fano-type resonant hfo 2 nanopillars designed by a deep-learning approach,” Nanoscale, vol. 11, no. 44, pp. 21266–21274, 2019. https://doi.org/10.1039/c9nr07408b.Search in Google Scholar PubMed

[49] I. Sajedian, T. Badloe, and J. Rho, “Optimisation of colour generation from dielectric nanostructures using reinforcement learning,” Opt. Express, vol. 27, no. 4, pp. 5874–5883, 2019. https://doi.org/10.1364/oe.27.005874.Search in Google Scholar PubMed

[50] I. Malkiel, M. Mrejen, A. Nagler, U. Arieli, L. Wolf, and H. Suchowski, “Plasmonic nanostructure design and characterization via deep learning,” Light: Sci. Appl., vol. 7, no. 1, pp. 1–8, 2018. https://doi.org/10.1038/s41377-018-0060-7.Search in Google Scholar PubMed PubMed Central

[51] J. Baxter, A. C. Lesina, J.-M. Guay, W. Arnaud, P. Berini, and L. Ramunno, “Plasmonic colours predicted by deep learning,” Sci. Rep., vol. 9, no. 1, pp. 1–9, 2019. https://doi.org/10.1038/s41598-019-44522-7.Search in Google Scholar PubMed PubMed Central

[52] T. Zhang, J. Wang, L. Qi, et al.., “Efficient spectrum prediction and inverse design for plasmonic waveguide systems based on artificial neural networks,” Photonics Res., vol. 7, no. 3, pp. 368–380, 2019. https://doi.org/10.1364/prj.7.000368.Search in Google Scholar

[53] Y. Chen, J. Zhu, Y. Xie, N. Feng, and Q. H. Liu, “Smart inverse design of graphene-based photonic metamaterials by an adaptive artificial neural network,” Nanoscale, vol. 11, no. 19, pp. 9749–9755, 2019. https://doi.org/10.1039/c9nr01315f.Search in Google Scholar PubMed

[54] Z. Liu, D. Zhu, S. P. Rodrigues, K.-T. Lee, and W. Cai, “Generative model for the inverse design of metasurfaces,” Nano Lett., vol. 18, no. 10, pp. 6570–6576, 2018. https://doi.org/10.1021/acs.nanolett.8b03171.Search in Google Scholar PubMed

[55] Z. Liu, D. Zhu, K.-T. Lee, A. S. Kim, L. Raju, and W. Cai, “Compounding meta-atoms into metamolecules with hybrid artificial intelligence techniques,” Adv. Mater., vol. 32, no. 6, p. 1904790, 2020. https://doi.org/10.1002/adma.201904790.Search in Google Scholar PubMed

[56] W. Ma, F. Cheng, and Y. Liu, “Deep-learning-enabled on-demand design of chiral metamaterials,” ACS Nano, vol. 12, no. 6, pp. 6326–6334, 2018. https://doi.org/10.1021/acsnano.8b03569.Search in Google Scholar PubMed

[57] J. Jiang and J. A. Fan, “Simulator-based training of generative neural networks for the inverse design of metasurfaces,” Nanophotonics, vol. 9, no. 5, pp. 1059–1069, 2020.10.1515/nanoph-2019-0330Search in Google Scholar

[58] J. Jiang and J. A. Fan, “Global optimization of dielectric metasurfaces using a physics-driven neural network,” Nano Lett., vol. 19, no. 8, pp. 5366–5372, 2019. https://doi.org/10.1021/acs.nanolett.9b01857.Search in Google Scholar PubMed

[59] F. Wen, J. Jiang, and J. A. Fan, “Robust freeform metasurface design based on progressively growing generative networks,” ACS Photonics, vol. 7, no. 8, pp. 2098–2104, 2020. https://doi.org/10.1021/acsphotonics.0c00539.Search in Google Scholar

[60] R. Pestourie, Y. Mroueh, T. V. Nguyen, P. Das, and S. G. Johnson, “Active learning of deep surrogates for pdes: application to metasurface design,” npj Comput. Mater., vol. 6, no. 1, pp. 1–7, 2020. https://doi.org/10.1038/s41524-020-00431-2.Search in Google Scholar

[61] C. C. Nadell, B. Huang, J. M. Malof, and W. J. Padilla, “Deep learning for accelerated all-dielectric metasurface design,” Opt. Express, vol. 27, no. 20, pp. 27523–27535, 2019. https://doi.org/10.1364/oe.27.027523.Search in Google Scholar PubMed

[62] T. Qiu, X. Shi, J. Wang, et al.., “Deep learning: a rapid and efficient route to automatic metasurface design,” Adv. Sci., vol. 6, no. 12, p. 1900128, 2019. https://doi.org/10.1002/advs.201900128.Search in Google Scholar PubMed PubMed Central

[63] Z. A. Kudyshev, V. AlexanderKildishev, V. M. Shalaev, and A. Boltasseva, “Machine-learning-assisted metasurface design for high-efficiency thermal emitter optimization,” Appl. Phys. Rev., vol. 7, no. 2, p. 021407, 2020. https://doi.org/10.1063/1.5134792.Search in Google Scholar

[64] Z. A. Kudyshev, A. V. Kildishev, V. M. Shalaev, and A. Boltasseva, “Machine learning–assisted global optimization of photonic devices,” Nanophotonics, vol. 10, no. 1, pp. 371–383, 2021.10.1515/9783110710687-028Search in Google Scholar

[65] L. Li, H. Ruan, C. Liu, et al.., “Machine-learning reprogrammable metasurface imager,” Nat. Commun., vol. 10, no. 1, pp. 1–8, 2019. https://doi.org/10.1038/s41467-019-09103-2.Search in Google Scholar PubMed PubMed Central

[66] R. Hegde, “Sample-efficient deep learning for accelerating photonic inverse design,” OSA Continuum, vol. 4, no. 3, pp. 1019–1033, 2021. https://doi.org/10.1364/osac.420977.Search in Google Scholar

[67] M. H. Tahersima, K. Kojima, T. Koike-Akino, et al.., “Deep neural network inverse design of integrated photonic power splitters,” Sci. Rep., vol. 9, no. 1, pp. 1–9, 2019. https://doi.org/10.1038/s41598-018-37952-2.Search in Google Scholar PubMed PubMed Central

[68] A. M. Hammond, E. Potokar, and R. M. Camacho, “Accelerating silicon photonic parameter extraction using artificial neural networks,” OSA Continuum, vol. 2, no. 6, pp. 1964–1973, 2019. https://doi.org/10.1364/osac.2.001964.Search in Google Scholar

[69] L. Pilozzi, F. A. Farrelly, G. Marcucci, and C. Conti, “Machine learning inverse problem for topological photonics,” Commun. Phys., vol. 1, no. 1, pp. 1–7, 2018. https://doi.org/10.1038/s42005-018-0058-8.Search in Google Scholar

[70] D. Sell, J. Yang, S. Doshay, R. Yang, and J. A. Fan, “Large-angle, multifunctional metagratings based on freeform multimode geometries,” Nano Lett., vol. 17, no. 6, pp. 3752–3757, 2017. https://doi.org/10.1021/acs.nanolett.7b01082.Search in Google Scholar PubMed

[71] V. Neder, Y. Ra’di, A. Andrea, and P. Albert, “Combined metagratings for efficient broad-angle scattering metasurface,” ACS photonics, vol. 6, no. 4, pp. 1010–1017, 2019. https://doi.org/10.1021/acsphotonics.8b01795.Search in Google Scholar PubMed PubMed Central

[72] S. Inampudi and H. Mosallaei, “Neural network based design of metagratings,” Appl. Phys. Lett., vol. 112, no. 24, p. 241102, 2018. https://doi.org/10.1063/1.5033327.Search in Google Scholar

[73] Y. Kiarashinejad, S. Abdollahramezani, and A. Ali, “Deep learning approach based on dimensionality reduction for designing electromagnetic nanostructures,” npj Comput. Mater., vol. 6, no. 1, pp. 1–12, 2020. https://doi.org/10.1038/s41524-020-0276-y.Search in Google Scholar

[74] M. Chen, J. Jiang, and J. A. Fan, “Design space reparameterization enforces hard geometric constraints in inverse-designed nanophotonic devices,” ACS Photonics, vol. 7, no. 11, pp. 3141–3151, 2020. https://doi.org/10.1021/acsphotonics.0c01202.Search in Google Scholar

[75] Z. Liu, Z. Zhu, and W. Cai, “Topological encoding method for data-driven photonics inverse design,” Opt. Express, vol. 28, no. 4, pp. 4825–4835, 2020. https://doi.org/10.1364/oe.387504.Search in Google Scholar

[76] M. Zandehshahvar, Y. Kiarashi, M. Zhu, H. Maleki, T. Brown, and A. Adibi, “Manifold learning for knowledge discovery and intelligent inverse design of photonic nanostructures: breaking the geometric complexity,” arXiv preprint arXiv:2102.04454, 2021.10.1021/acsphotonics.1c01888Search in Google Scholar

[77] J. Trisno, H. Wang, H. T. Wang, R. J. H. Ng, S. D. Rezaei, and J. K. W. Yang, “Applying machine learning to the optics of dielectric nanoblobs,” Adv. Photon. Res., vol. 1, no. 2, p. 2000068, 2020. https://doi.org/10.1002/adpr.202000068.Search in Google Scholar

[78] J. Jiang and J. A. Fan, “Multiobjective and categorical global optimization of photonic structures based on resnet generative neural networks,” Nanophotonics, vol. 10, no. 1, pp. 361–369, 2021. https://doi.org/10.1515/9783110710687-027.Search in Google Scholar

[79] C. Yeung, Ju-M. Tsai, B. King, et al.., “Multiplexed supercell metasurface design and optimization with tandem residual networks,” Nanophotonics, vol. 10, no. 3, pp. 1133–1143, 2021. https://doi.org/10.1515/nanoph-2020-0549.Search in Google Scholar

[80] T. Shibanuma, P. Albella, and S. A. Maier, “Unidirectional light scattering with high efficiency at optical frequencies based on low-loss dielectric nanoantennas,” Nanoscale, vol. 8, no. 29, pp. 14184–14192, 2016. https://doi.org/10.1039/c6nr04335f.Search in Google Scholar PubMed

[81] K. D. Donda and R. Hegde, “Optimal design of beam-deflectors using extended unit-cell metagratings,” Prog. Electromagn. Res. M, vol. 77, pp. 83–92, 2019.10.2528/PIERM18092801Search in Google Scholar

[82] S. A. Mann, R. R. Grote, R. M. Osgood, and J. A. Schuller, “Dielectric particle and void resonators for thin film solar cell textures,” Opt. Express, vol. 19, no. 25, pp. 25729–25740, 2011. https://doi.org/10.1364/oe.19.025729.Search in Google Scholar

[83] A. Y. Piggott, J. Petykiewicz, L. Su, and J. Vučković, “Fabrication-constrained nanophotonic inverse design,” Sci. Rep., vol. 7, no. 1, pp. 1–7, 2017. https://doi.org/10.1038/s41598-017-01939-2.Search in Google Scholar PubMed PubMed Central

[84] D. Wu, “Pool-based sequential active learning for regression,” IEEE Trans. Neural Netw. Learn. Syst., vol. 30, no. 5, pp. 1348–1359, 2018. https://doi.org/10.1109/TNNLS.2018.2868649.Search in Google Scholar PubMed

[85] R. Schmidt, M. Voigt, M. Pisaroni, et al., “General introduction to Monte Carlo and multi-level Monte Carlo methods,” in Uncertainty Management for Robust Industrial Design in Aeronautics, Cham, Springer, 2019, pp. 265–278.10.1007/978-3-319-77767-2_16Search in Google Scholar

[86] R. Schmidt, M. Voigt, and R. Mailach, “Latin hypercube sampling-based Monte Carlo simulation: extension of the sample size and correlation control,” in Uncertainty Management for Robust Industrial Design in Aeronautics, Cham, Springer, 2019, pp. 279–289.10.1007/978-3-319-77767-2_17Search in Google Scholar

[87] R. Storn and K. Price, “Differential evolution–a simple and efficient heuristic for global optimization over continuous spaces,” J. Glob. Optim., vol. 11, no. 4, pp. 341–359, 1997. https://doi.org/10.1023/a:1008202821328.10.1023/A:1008202821328Search in Google Scholar

[88] P. Rocca, G. Oliveri, and A. Massa, “Differential evolution as applied to electromagnetics,” IEEE Antennas Propag. Mag., vol. 53, no. 1, pp. 38–49, 2011. https://doi.org/10.1109/map.2011.5773566.Search in Google Scholar

[89] S. P. Soumyashree and S. H. Ravi, “Transmission-mode all-dielectric metasurface color filter arrays designed by evolutionary search,” J. Nanophotonics, vol. 14, no. 1, p. 016014, 2020.10.1117/1.JNP.14.016014Search in Google Scholar

[90] S. P. Soumyashree, H. S. Vyas, and S. H. Ravi, “Robust inverse design of all-dielectric metasurface transmission-mode color filters,” Opt. Mater. Express, vol. 10, no. 12, pp. 3145–3159, 2020.10.1364/OME.409186Search in Google Scholar

[91] S. S. Panda and R. S. Hegde, Learning Models for Metagratinngs Design, 2021. Available at: https://bitbucket.org/soumyashree_1311/learning-models-for-metagrating-design.Search in Google Scholar

[92] V. Liu and S. Fan, “S4: a free electromagnetic solver for layered periodic structures,” Comput. Phys. Commun., vol. 183, no. 10, pp. 2233–2244, 2012. https://doi.org/10.1016/j.cpc.2012.04.026.Search in Google Scholar

[93] M. Zandehshahvar, Y. Kiarashi, M. Chen, R. Barton, and A. Ali, “Inverse design of photonic nanostructures using dimensionality reduction: reducing the computational complexity,” Opt. Lett., vol. 46, no. 11, pp. 2634–2637, 2021. https://doi.org/10.1364/ol.425627.Search in Google Scholar

[94] S. H. Ravi, “Photonics inverse design: pairing deep neural networks with evolutionary algorithms,” IEEE J. Sel. Top. Quantum Electron., vol. 26, no. 1, pp. 1–8, 2019.10.1109/JSTQE.2019.2933796Search in Google Scholar


Supplementary Material

The online version of this article offers supplementary material (https://doi.org/10.1515/nanoph-2021-0540).


Received: 2021-09-20
Accepted: 2021-11-22
Published Online: 2021-12-08

© 2021 Soumyashree S. Panda and Ravi S. Hegde, published by De Gruyter, Berlin/Boston

This work is licensed under the Creative Commons Attribution 4.0 International License.

Downloaded on 11.12.2025 from https://www.degruyterbrill.com/document/doi/10.1515/nanoph-2021-0540/html
Scroll to top button