#### Research Article Guangshun Qiao\* and Yulin Lu # Operating Efficiency in the Capital-Intensive Semiconductor Industry: A Nonparametric Frontier Approach https://doi.org/10.1515/econ-2022-0050 received January 31, 2023; accepted October 05, 2023 **Abstract:** This article uses a nonparametric production frontier approach to investigate the operating efficiency differences by the impacts of capital expenditure and business model in the global semiconductor industry. Handling the impact of capital expenditure as a fixed input by the directional distance estimator, this study compares the operating efficiencies in the global semiconductor industry between the integrated device manufacturers and the fabless and foundry firms over 1999–2018. The estimation results indicate that the operating efficiencies do vary in the semiconductor by the business model. The vertically integrated manufacturers dominate the semiconductor industry, and the capital-intensive manufacturers operate more efficiently than the assetlight fabless firms on average. **Keywords:** semiconductor industry, CAPEX, integrated device manufacturer, nonparametric frontier, directional distance estimator ### 1 Introduction Integrated circuits (ICs) are essential components of virtually all modern electronic devices. Since Bell laboratories invented the transistors in 1947 and Texas Instruments released the first working IC in 1958, the semiconductor industry, which is the aggregate of companies engaged in the design and fabrication of semiconductor devices or IC chips, has been at the forefront of the digital economy for decades. From laptop to smartphone and artificial intelligence (AI), semiconductor devices are present in nearly all aspects of modern technology. The personal computer revolution in the 1970–1980s was a result of advances in semiconductor technology, such as the Intel 8,008 microprocessor (Ceruzzi, 1996). In the development and expansion of the World Wide Web revolution in the 1990s, application-specific integrated circuits played a significant role in enabling fast and efficient networking and data processing, contributing to the growth of web-based technologies and consumer electronics (Makimoto, 2002). The rise of the smartphone in the 2010s was supported by higher-performance system-on-a-chip (SoC), such as the Apple A series and Qualcomm Snapdragon series. ChatGPT, latterly the most popular deep learning workload, required significant computational power and was trained on Nvidia¹ graphics processing units (GPUs). Due to the growth in emerging technologies such as AI, cloud computing, Internet of Things, 5G networks, autonomous vehicles, industrial automation, and renewable energy systems, the needs for more powerful, energy-efficient, and miniaturized semiconductor devices have been consistently increasing. The 2023 Semiconductor Industry Association (SIA) factbook reported that the global semiconductor sales reached the highest-ever annual total of \$574 million in 2022. McKinsey's preliminary forecast shows that the global semiconductor industry is poised to become a trillion-dollar industry by 2030 (Burkacky et al., 2022). ## 1.1 Structural Change History in the Semiconductor Industry The semiconductor industry has a long history of structural change. Prior to the 1980s, a few integrated device Yulin Lu: School of Accounting, Jilin Business and Technology College, Jilin, 130507, China <sup>\*</sup> Corresponding author: Guangshun Qiao, School of Finance and Trade, Wenzhou Business College, Wenzhou, 325035, China, e-mail: qiaoquangshun@wzbc.edu.cn <sup>1</sup> Nvidia Corporation (Nasdaq: NVDA) is a leading technology company specializing in the design and development of GPUs and AI computing solutions. <sup>2</sup> https://www.semiconductors.org/the-2023-sia-factbook-your-source-for-semiconductor-industry-data/. manufacturers (IDMs), such as Intel, Infineon,<sup>3</sup> ST,<sup>4</sup> and Texas Instruments, were the dominant players in the semiconductor industry. These IDMs have in-house capabilities to perform all of the production processes (e.g., research and design [R&D], front-end fabrication, and back-end assembly and test [A&T]). For example, as a leading IC manufacturer, Intel has several fab production sites (located in the US, Ireland, Israel, etc.), A&T sites (located in the US, China, Malaysia, Vietnam, etc.), and tens of thousands employees and partners all over the world. The vertical integration of managing the entire production process internally allows Intel to have greater control over quality, intellectual property (IP), and the ability to optimize the manufacturing process for specific needs (Malone, 2014). The semiconductor industry is renowned for its rapid technological advancements. This dynamic field continually pushes the boundaries of innovation, driving progress in areas of miniaturization, performance improvements, power efficiency, and emerging technologies. For example, the semiconductor industry had experienced significant shrinks in process technology nodes,<sup>5</sup> from around 130 nm in 2000, to 32 nm in 2010, and 7 nm in 2020 (Flamm, 2017). However, due to physical constraints, manufacturing challenges, heat dissipation, and power consumption issues, semiconductors with ever-expanding complexity approach the limits of Moore's law<sup>6</sup> (Mack, 2011). The expenses of building a semiconductor fabrication facility had increased from around \$1 billion in the 2000s to more than \$10 billion nowadays (e.g., see Ibrahim et al., 2014; Lambrechts et al., 2018). The substantial increase in fab construction costs became prohibitive for almost all the IC suppliers. It stimulated business model innovation in the semiconductor industry and gave birth to the fabless–foundry business model in the mid-1980s (Sarma & Sun, 2017). In the fabless-foundary business model, fabless companies dedicate their time to IC design and brand operation, while pure-play foundries devote themselves to frontend fabrication, and a third group of companies are allotted for back-end outsourced semiconductor assembly and test (OSAT) operations. By specializing in IC design with brand operation, fabless companies can concentrate their efforts and resources on creating differentiated and competitive chips where technological innovation meets strategic marketing, leveraging the manufacturing capabilities of semiconductor foundries and OSATs (Hurtarte et al., 2011). At the same time, foundries and OSATs take care of the actual manufacturing process, fabrication, and quality control, allowing fabless companies to focus on their core competencies, resulting in a more flexible semiconductor business ecosystem. A milestone of the vertical disintegration in the semiconductor industry is the establishment of the Taiwan Semiconductor Manufacturing Company (TSMC) in 1987. TSMC introduced the concept of a pure-play semiconductor foundry, specializing solely in the manufacturing of ICs and committed to be a long-term non-competitive partner with the fabless firms. By continuously investing in advanced process technologies, TSMC quickly established itself as a leader in semiconductor manufacturing. Presently, TSMC is the first company to commercialize the 7nm process technology<sup>7</sup> and the largest dedicated semiconductor foundry worldwide.8 Its advanced process technologies, highly automated manufacturing facilities, the ability to scale production, and a strong focus on customer satisfaction have helped TSMC to build long-term relationships with its customers, enabling it to capture a significant share of the highend semiconductor market and making it a crucial player in the semiconductor ecosystem (e.g., see Hsieh et al., 2002). ### 1.2 CAPEX Plays a Crucial Role The fabless–foundry business model has significantly changed the structure of the semiconductor value chain over the last **<sup>3</sup>** Infineon Technologies AG (Infineon) is Germany's largest semiconductor manufacturer. It was established on April 1, 1999, as a spinoff from Siemens AG, taking over Siemens' semiconductor operations including its research, development, manufacturing, and sales activities. Infineon has been a leader in developing and producing high-quality power MOSFETs (metal-oxide-semiconductor field-effect transistors) for various applications. **<sup>4</sup>** STMicroelectronics (ST) is a global semiconductor company headquartered in Geneva, Switzerland. It was formed in 1987 through a merger between the semiconductor divisions of Italy's STET (Società Finanziaria Telefonica) and France's Thomson Semiconducteurs. **<sup>5</sup>** Historically, the process technology node (also process node or simply node) was named after the physical gate length of a transistor. Shrinks in process node offer numerous advantages, including increased transistor density, improved performance, enhanced power efficiency, cost reduction, miniaturization, advanced functionality, and manufacturing advancements. **<sup>6</sup>** Moore's law was first observed in 1965 and later revised in 1975 by Gorden Moore, the co-founder and chairman emeritus of Intel. Moore's law states that the number of components per IC doubles about every 2 years. **<sup>7</sup>** The 7 nm process has helped TSMC win major customers such as Apple (A series Bionic chips), Qualcomm (Snapdragon 800-series chipsets), and Nvidia (RTX 30 series GPUs). **<sup>8</sup>** TSMC had invested more than \$16 Billion at its Fab 18, a new fab in the Southern Taiwan Science Park for producing 5nm and 3nm process technology. few decades and has been a topic of wide interest (e.g., see Adner & Kapoor, 2010; Macher et al., 2007; Sarma & Sun, 2017). Many factors might affect production costs and productivity in the innovation-driving IC industry, such as capital allocation (Brown et al., 2005), technological capability (Park et al., 2021; Sher & Yang, 2005), business model (Shin et al., 2017), device type (Park et al., 2018), institutional factor (Gugler & Siebert, 2007; Lu et al., 2013; Walheer & He, 2020), and foreign competition (Henderson & Scott, 2018). The major challenges in the capital-intensive semiconductor industry are the heavy capital expenditure (CAPEX) for cleanroom and costly equipment for front-end fabrication and back-end A&T procedures. Cleanrooms must be free of all airborne particles, which requires advanced filtration systems, controlled air flow, and rigorous cleaning procedures. Additionally, due to the complexity of the manufacturing processes and the need for precision and accuracy in the production of semiconductor chips, the equipment used in front-end fabrication and back-end A&T is costly (Monch et al., 2012). For example, extreme ultraviolet lithography (EUV)<sup>9</sup> is a critical technology for advanced semiconductor manufacturing processes. Each EUV machine made by ASML<sup>10</sup> costs around \$200 million or even higher. The high costs associated with CAPEX can be a significant barrier for new companies looking to enter the semiconductor industry, as well as a challenge for existing companies looking to expand their manufacturing capacity (Powell et al., 2015). Another challenge in the semiconductor industry is the cyclicality of demand (Rastogi et al., 2011; Tan & Mathews, 2010). The semiconductor industry is highly dependent on end-market demand, which can be volatile and subject to rapid shifts. It is difficult for companies to accurately forecast demand and plan production capacity, which might lead to costly overproduction or underproduction. In order to achieve full-capacity utilization, the foundries and OSATs seek to optimize productivity by serving many fabless companies, while even IDMs are renting their idle capacity to competitors to reduce the financial burden. It also triggered many of the IDMs to start outsourcing manufacturing partially from the dedicated foundries and became fab-lite<sup>11</sup> (Saha, 2015). In comparison, the fabless companies, mostly startups or spin-offs that are getting rid of the burden in setting up, maintaining, and upgrading fabs, are more flexible to integrate within local knowledge networks and focus in less crowded niche markets, specialized applications, or innovative technologies to explore technological diversity and comparative advantage, and compete with the IDMs. Besides the huge equipment expenditures, R&D costs for developing leading-edge products such as microprocessors and radiofrequency devices also raise steadily. With the shrinking of process node, technological complexity and design complexity increase exponentially. The slow progress in node technology requires continuous investments in both R&D and advanced fabrication facilities. Based on the 2023 SIA factbook, US semiconductor companies accounted for sales totaling \$275 billion in 2022, or 48% of the global market. At the same time, US semiconductor firms also invested \$58.8 billion in R&D, the highest in history to remain competitive in the industry. The uncertainty of R&D investments and the IP protection by incumbents set high barriers to entry and favor the success of large IDMs, such as Intel, ST, and Texas Instruments, which are able to make risky investments and thus have a higher chance to foresee and lead the technology evolutions. #### 1.3 Trade-offs Between Business Models There has been a long-lasting debate on which business model is operating more efficiently, or which business model is more likely to dominate the semiconductor industry. On the one hand, the reduced barriers to entry by vertical specialization drastically reduces the burden of CAPEX and ensures the domination of new markets by the fabless design houses. The entry of new fabless companies, most of which are spinoffs from industry incumbents, spur innovation and propel the diversification of products in various applications (Pellens & Della Malva, 2018). Furthermore, vertical disintegration in the semiconductor value chain is accompanied and twisted by the trend of industry globalization (Brown et al., 2005). Since the 1990s, fabless firms have had substantial shares or even dominated in most of the fastest-growing market segments (Balconi & Fontana, 2011). Foundries are also becoming technology transferors rather than merely manufacturing capacity providers in the semiconductor value chain (Li et al., 2011). In addition, the collaboration between the asset-light fabless and the pure-play foundry provides more robust protection of IP rights (Sarma & Sun, 2017). When the fabless firms pass on their design blueprints to <sup>9</sup> EUV uses extreme ultraviolet light with a wavelength of near 13.5 nm to create intricate patterns on silicon wafers, allowing for the production of smaller and more complex ICs. <sup>10</sup> Advanced semiconductor material lithography (ASML) is a Dutch multinational corporation and the sole supplier in the world of EUV photolithography machines used to manufacture the most advanced chips, targeting 5 and 3 nm process nodes. <sup>11</sup> Fab-lite, also called fab-light, refers to a semiconductor company that retains some in-house fabrication facilities but also relies on outsourcing a large portion of its production to external foundries. pure-play foundries, the threats of replication and the risk of IP theft are relatively low, comparing with the early years when fabless firms' ICs could only be manufactured by their rival IDMs. On the other hand, despite a trend toward vertical specialization driven by the entry of fabless firms, the vertically integrated IDMs have continued to persist and coexist with the fabless entrants in the semiconductor industry. Dibiaggio (2007) and Monteverde (1995) credit the efficiency of IDMs to the internalization of transaction costs. Ernst (2005), Macher (2006), and Kapoor and Adner (2012) hold the knowledge-based view that the IDMs achieve performance advantages when technological developments involve complex problems. Kapoor (2013) proposed and found that the incumbents who persist with vertical integration increase their emphasis on systemic innovations. Due to the inherently increasing complexity of the semiconductor supply chain, currently there does not exist an adequate reference model for the semiconductor industry, and more appropriate and state-of-the-art models are in great demand to analyze the semiconductor supply chain (Monch et al., 2012). The semiconductor industry is commonly characterized as both technology-intensive and capital-intensive. Much research on the topic of structural change in the semiconductor industry emphasizes the evolution of technology (Chen et al., 2019; Cho, 2020; Hwang & Choung, 2014; Shin et al., 2017). The impacts of capital investments in the semiconductor industry have not been discussed adequately. Besides the research shown earlier that focus on analyzing the impacts of technology evolution in the semiconductor industry, this study plans to emphasize the feature of capital intensive in comparing the trade-offs of business model in the semiconductor industry. This study applies up-to-date econometric methods to handle the impacts of CAPEX as a lump sum fixed input and explores the differences of operating efficiency by business model in the highly dynamic semiconductor industry in the past two decades. The methodological contributions of article will be discussed in the next subsections. #### 1.4 Brief Literature Review Taking advantage of a flexible functional form, data envelopment analysis (DEA) is one of the most popular approaches for efficiency estimation. There are rich records for performance evaluation in the semiconductor industry using the DEA approach. For instance, Kozmetsky and Yue (1998) examined the cost efficiency of 56 IC companies worldwide and showed that US, Japanese, South Korean, and Taiwanese IC companies had become the major participants in the global semiconductor industry in the early 1990s. Lu and Hung (2010) compared the managerial performance efficiency of 48 leading vertically disintegrated firms in Taiwan's IC value chain and noted that fabless companies perform better than foundries and OSATs. Jang et al. (2016) measured the cumulative change in R&D efficiency of 49 global leading fabless companies and noted that during the period 2007–2013, the overall R&D efficiency declined slightly. Li et al. (2019) explored 64 major Chinese enterprises in the semiconductor industry and found that the most significant factor limiting future improvements to innovation efficiency was a low level of scale efficiency. One common problem of these studies, among others, such as Lu et al. (2013), Hsu (2015), Hung et al. (2014), and Tsai et al. (2017), is the slow convergence rate of the nonparametric DEA estimator. 12 Accompanied by the increasing numbers of input and output dimensions, the convergence rate in DEA estimation is decreasing sharply. In cases when the observations are restricted to a small number either by geographic boundary or by business model boundary, the issue of slow convergence rate in DEA estimation may become severe and critical. For example, the research of Kuo and Yang (2012), Lu and Hung (2010), and Wu et al. (2006) used a small number of 38–39 companies to evaluate the performance of the fabless corporations in Taiwan, while in some extreme cases, such as Chen and Chen (2011), Hung and Lu (2008), Lin et al. (2019), and Liu and Wang (2008), the studies contained only 10-25 companies. The effective parametric sample size of the DEA estimators in these approaches was very small, which might lead to unconvincing results.13 #### 1.5 Methodological Features The main methodological feature in this study, after highlighting the slow convergence rate in DEA estimation, is to provide an empirical example of choosing the appropriate estimation methods in analyzing the operational efficiencies of the semiconductor industry, aiming to gain a faster convergence rate and hence a lower order of estimation error. **<sup>12</sup>** The convergence rate quantifies how fast the estimation error decreases when increasing the sample size n. Generally, the convergence rate in a linear regression is $n^{-\frac{1}{2}}$ , while the convergence rate of a nonparametric estimator is slower than its parametric counterpart. **13** For example, Hung and Lu (2008) used the DEA to estimate efficiencies of 25 companies by 4 input and 3 output variables. The effective parametric sample size of Hung and Lu (2008) was only $25\frac{2}{4+3} \approx 3$ . Appendix A explains how to calculate the effective parametric sample size. The semiconductor industry is, indeed, a highly globalized industry. For example, Silicon Valley, located in the southern San Francisco Bay Area of California, US, renowned for its collaborative ecosystem that fosters entrepreneurship and innovation, has attracted a cluster of famous fabless semiconductor companies, such as Nvidia, Qualcomm, 14 AMD, 15 and Xilinx.<sup>16</sup> While the US has a strong presence in the fabless semiconductor sector, there are also fabless companies based in other countries such as the UK, Israel, Japan, and China. In order to support the growth of the fabless semiconductor industry on a global scale, foundries and OSATs are located in various regions around the world to cater to the global demand for semiconductor production and ensure a diverse supply chain for semiconductor manufacturing. Hence, this study considers the deeply globalized semiconductor value chain as an aggregated industry and collects data on 470 semiconductor companies all over the world with 5,136 observations in 1999–2018. The global database not only provides a worldwide perspective of the semiconductor industry, but also gains a faster convergence rate in DEA estimation. Furthermore, this study uses a dimensionality reduction technique to further improve the convergence rate. Another methodological feature in this article is to treat the CAPEX as a fixed input by using the directional distance measure. CAPEX is a kind of lump sum investment, which involves significant upfront expenses that are expected to yield long-term benefits. By allocating funds toward acquiring new technology, upgrading infrastructure, and expanding facilities, companies can enhance productivity, improve operational efficiency, and leverage the latest technological advancements to stay competitive in the market. Although CAPEX decisions play a crucial role in determining the level of productivity and technology within an organization, in the short run, CAPEX is not under managers' direct control. The directional distance estimator provides a convenient method to distinguish the non-discretionary fixed input CAPEX with other variables. The setting of this method will be discussed further in the next section. Another methodological feature in this article is to investigate the impacts of the business model in the semiconductor industry through a conditional nonparametric frontier approach. Recent developments in nonparametric ### 1.6 Findings and Organization This article aims to follow the Daraio et al.'s (2020) approach to shed light on disentangling the impact of capital investments and comparing the technical efficiencies between IDMs and vertically disintegrated fabless and foundry firms in the semiconductor industry. The estimation results indicate that CAPEX plays a crucial role in the semiconductor industry and vertically integrated manufacturers dominate the industry. Since semiconductor companies heavily rely on CAPEX to acquire advanced equipment, to establish and upgrade manufacturing facilities, and to develop cuttingedge technologies, the capital-intensive IDMs and OSATs operate more efficiently than the asset-light fabless firms on average. It is worth noting that such kind of operating efficiency is probably not attributed to management improvement, but generated from subsidies or M&A (VerWey, 2019). In fostering a healthy and competitive semiconductor ecosystem, we suggest to tilt the subsidies of the semiconductor industry towards the fabless sector to encourage more innovation and diversification. This article is organized as follows. Section 2 explains the nonparametric frontier framework and discusses the diagnostics and test statistics to choose a suitable estimator in this research. Section 3 introduces the data and defines the variables. Section 4 presents the empirical results and discusses the effect of capital investment and business model in the semiconductor industry. Section 5 concludes. # 2 Methodology ### 2.1 DEA Approach The economic theory of efficiency in production can be traced to Farrell (1957). Attributed by its flexibility and adaptability, DEA (Charnes et al., 1978) is considered the frontier estimation (Daraio & Simar, 2014; Daraio et al., 2020) provide tools to analyze the operating efficiencies in the semiconductor industry under various types of constraints such as capital investments and the business model. While the heterogeneity by CAPEX is treated as a fixed input variable by the directional distance estimator (Daraio et al., 2020), the heterogeneity by the business model is handled by the conditional efficiency estimators (Daraio & Simar, 2007). In addition, separability test recommended by Simar and Wilson (2020) is applied to choose the optimal conditions considering both the heterogeneities by the business model and time. <sup>14</sup> Qualcomm holds significant IPs and patents related to Code Division Multiple Access (CDMA) technology and has established itself as a leader in mobile technology with its Snapdragon processors. <sup>15</sup> Advanced Micro Devices, Inc. (AMD), headquartered in Santa Clara, is a major fabless semiconductor company known for its CPUs. AMD competes with Intel in the computer processor market. <sup>16</sup> Xilinx, Inc., located in San Jose, is a well-known fabless company specializing in field-programmable gate arrays (FPGAs) and SoCs. mainstream approach in frontier analysis for assessing technical efficiency.<sup>17</sup> A large and growing literature has developed on the application of the DEA approach in the semiconductor industry (e.g., see Jang et al., 2016; Li et al., 2019; Sueyoshi & Ryu, 2020; Tsai et al., 2017; Zhou et al., 2020). This article follows the DEA approach and applies the latest methodological advancements of Bădin et al. (2012), Daraio et al. (2020), and Simar and Wilson (2020) to address the impacts of CAPEX, business model, and time in the ever-evolving semiconductor industry. Production theory primarily examines how the production process works within a firm to combine p inputs to achieve the desired level of q outputs and analyzes the factors that affect production decisions. The idea in the DEA approach is to estimate the efficiency score of a production plan (x, y), or the distance from (x, y) to the boundary of the production set $\Psi = \{(x, y) \in \mathbb{R}^{p+q}_+ | x \text{ can produce } y\}$ . As a nonparametric approach, DEA does not require explicit assumptions about the underlying production function, allowing for a flexible data-driven analysis. Hence, we select the DEA approach in handling the efficiency estimation of the semiconductor industry. #### 2.2 Directional Distance Measure There are four kinds of commonly used efficiency measures in the DEA approach, namely, input-oriented Debreu–Farrel measure, out-oriented Debreu–Farrel measure, hyperbolic measure (Wilson, 2012), and directional distance measure (Chambers et al., 1998). The input-, output-, and hyperbolic oriented measures are radial measures that allow for only nonnegative values. In contrast, the directional distance measure is an additive measure. The directional distance measure is given by: $$\beta(x, y | d_x, d_y, \Psi) = \sup\{\beta | (x - \beta d_x, y + \beta d_y) \in \Psi\}, (2.1)$$ which projects (x, y) onto the technology in a specified direction $(-d_x, d_y)$ . The directional distance measure $\beta(x, y||d_x, d_y, \Psi)$ nests the input- and output-oriented measure as a special case by setting the direction vector $(d_x, d_y)$ as (x, 0) and (0, y), respectively. The directional distance measure allows for negative values of x and y, as it adds the feasible quantities to a unit's output and simultaneously subtracts proportional quantities from its input. The choices of the directions $d_x$ and $d_y$ are also flexible. Some direction can be set equal to zero to represent a non-discretionary input or output (Simar & Vanhems, 2012). This feature is used to proxy CAPEX in the semiconductor industry in this study, by categorizing CAPEX into a fixed input that is not under managers' direct control in the short run. #### 2.3 Estimation of the Frontier The attainable set $\Psi$ is unobserved. Nonparametric methods such as DEA and free disposal hull (FDH) are developed to estimate the unobservable production set $\Psi$ . The FDH estimator $\widehat{\Psi}_{\text{FDH}}$ is defined as: $$\widehat{\Psi}_{\text{FDH}} = \bigcup_{X_i, Y_i \in S_n} \{ (x, y) \in \mathbb{R}_+^{p+q} | x \ge X_i, y \le Y_i \}, \quad (2.2)$$ where $S_n = \{(X_i, Y_i)\}$ . FDH estimator $\widehat{\beta}_{\text{FDH}}(x, y | d_x, d_y, \Psi)$ is obtained by replacing $\Psi$ with $\widehat{\Psi}_{\text{FDH}}$ . DEA estimator $\widehat{\Psi}_{\text{DEA}}$ is the convex hull of $\widehat{\Psi}_{\text{FDH}}$ (Banker et al., 1984). 18 The trade-off between FDH and DEA is not trivial. Simar and Wilson (2015) summarized that the FDH and DEA estimators converge to limiting distributions at rates of $n^{\frac{1}{p+q}}$ and $n^{\frac{2}{p+q+1}}$ , respectively. The convergence rate for the FDH and the DEA estimator slows down with the increasing of dimensionality p+q. To minimize the estimation error empirically, we can either increase the sample size n or decrease the total dimensions of p+q. If the sample size n is restricted to a small number by real-world constraints, including the market scale or scope of the industry, geographical or political restrictions, and the high cost of data collection, dimension reduction such as the principal component analysis (PCA) may become an attractive solution. Appendix A provides an introduction of PCA. After the diagnostics of dimension reduction, a test of convexity is recommended for the trade-off between $\widehat{\Psi}_{FDH}$ and $\widehat{\Psi}_{DEA}$ (Kneip et al., 2015). If the null hypothesis of convexity is rejected, the FDH estimator is the only consistent estimator. Alternatively, if the null hypothesis is not rejected, the DEA estimator might be preferred. However, the test of convexity (Kneip et al., 2015; Kneip et al., 2016) depends on randomly split the original sample into two independent subsamples for the bias term calculations. This study applies a **<sup>17</sup>** Lampe and Hilgers (2015) surveyed 4,782 publications on performance measurement in 1978–2012 and found that 4,021 were for DEA, and 761 were for stochastic frontier analysis (SFA). **<sup>18</sup>** To be more precise, the notation DEA is for the variable-returns-to-scale DEA in this study. **<sup>19</sup>** The estimation error will increase with the slowdown of convergence rate. This is known as the curse of dimensionality in nonparametric estimation. bootstrap algorithm (Simar & Wilson, 2020) for the convexity test to overcome this issue. factors. This article uses the pure efficiency to measure the impacts of business model and time in the semiconductor industry. ### 2.4 Conditional Efficiency Measures There exist factors such as the business model, constraints of technology and regulatory, and differences in ownership, which are beyond control of the manager but may influence the production process. These factors are denoted as environmental factors $Z \in \mathbb{R}^r$ . Daraio and Simar (2005) proposed to investigate the joint behavior of (X, Y, Z) in probability terms by defining the conditional attainable set as $\Psi^z$ = $\{(x,y) \in \mathbb{R}^{p+q}_+|x \text{ can produce } y \text{ when } Z=z\}$ . Note that $\Psi = \bigcup_{z \in Z} \Psi^z$ . The probability distribution of (X, Y) conditional on Z = z can be written as: $$H_{X,Y|Z}(x, y|z) = \text{Prob}(X \le x, Y \ge y|Z = z).$$ (2.3) A conditional directional distance measure is given by: $$\beta(x, y|d_x, d_y, z) = \sup\{\beta|H_{X,Y|Z}(x - \beta d_x, y + \beta d_y|z)$$ $$> 0\}.$$ (2.4) Plugging a nonparametric estimator of $H_{X,Y|Z}(\cdot)$ into equation (2.4) can derive the estimation of the conditional efficiency score accordingly.<sup>20</sup> #### 2.5 Second Stage Analysis In a particular case, Z has no impact on the boundaries of the $\Psi^z$ and $\Psi^z = \Psi$ . Simar and Wilson (2007, 2011) called it the separability condition and argued that if the separability condition is not hold, naive regression in a second-stage analysis may provide inconsistent estimation. Alternatively, Bădin et al. (2012, 2014) suggested a flexible nonparametric location-scale model $\beta(X, Y|Z=z) = \mu(z) + \sigma(z)\varepsilon$ in a secondstage regression, where $\mu(z)$ measures the average effect of z and $\sigma(z)$ provides additional information on the dispersion of the efficiency distribution. Bădin et al. (2012) derived the pure efficiency from the location-scale model as: $$\varepsilon(z) = \frac{\beta(x, y|z) - \mu(z)}{\sigma(z)}.$$ (2.5) The pure efficiency in equation (2.5) provides a measure of inefficiency whitened from the main effect of the environmental # 3 Data and Variable Specification The data are collected from the Sub-Industry of Semiconductors in the Compustat database. In order to provide a global perspective for the semiconductor value chain, we combined data from both the Compustat North America database and the Compustat Global database to cover companies in the industry worldwide. As the semiconductor industry is famous for being a cyclical industry (e.g., see Rastogi et al., 2011; Tan & Mathews, 2010), we gather 20 years of data in 1999-2018 to cover a sufficient period with multiple business cycles in the industry. We exclude liquid crystal display manufacturers, light-emitting diode manufacturers, and photovoltaic producers from the dataset, limiting the sample to only IC manufacturers in a narrow sense. Hence, the panel data include 5,136 observations from 470 unique companies in the global semiconductor industry in 1999–2018. The reason for the data to begin in 1999 is twofold. First, the global semiconductor value chain has been preliminarily established in the late 1990s since the inception of the fabless-foundry business model in the late 1980s. Since 1999, there are plenty of available annual reports for the fabless and foundry firms on the open market. Second, the year 1999 is a suitable starting point to observe the development trend in the global semiconductor industry. Two years after the 1997 Asian financial crisis, the semiconductor industry is in a golden decade without massive exogenous shocks until the 2008 financial crisis. Identifying the inputs and outputs of the production function has always been a subject of controversy, either in parametric or in nonparametric frontier estimations, without exception in the semiconductor industry. Hence, we sort the most commonly used variables in 37 empirical studies, which apply the DEA approach in the semiconductor industry. Besides a few variables chosen for specific topics, the commonly used variables in these articles are highly concentrated into two input and two output categories. The first input category measures variable inputs, including labor, raw material, R&D, sales, and marketing expenditure, while the second input category measures fixed assets. Therefore, we specify p = 5 inputs (labor, measured by the number of employees $[X_1]$ ; COGS $[X_2]$ ; R&D expenditure $[X_3]$ ; sales and marketing expenditure $[X_4]$ ; and fixed assets, measured by property, plant, and equipment [PP&E] $[X_f]$ ). Note that we distinguish the notation of <sup>20</sup> Appendix B introduces a fast and efficient computation of the directional distance measure. the fixed input $X_f$ from the other variable inputs $X_1$ , $X_2$ , $X_3$ , and $X_4$ . Comparably, the first output category measures revenue and the second output category measures the market value of the firms. Hence, we specify q=2 outputs (total revenue $[Y_1]$ ; and shareholders' equity, measured by common ordinary equity $[CEQ][Y_2]$ ). For the output variable $Y_2$ , we use shareholders' equity instead of the market value of a firm, because the variable of market value is suffering from missing data in the Compustat database, and the variable shareholders' equity is also a widely used proxy for the value of a firm. The same dataset had been used in Qiao and Wang (2021), but in this approach, we add the variable $X_f$ to emphasize the impact of CAPEX, and measure it as a fixed input by the directional distance estimator to achieve more robust results. Table 1 gives the summary statistics for the variables in 1999–2018 pooled data. In order to provide a uniform standard across years, all the variables except $X_1$ are expressed in US\$ millions, and their values have been adjusted to 2018 US dollar by GDP deflator. The distribution of all the variables is heavily skewed to the right, owning to the domination of several semiconductor giants in the market. Besides inputs and outputs, we specify r=2 environmental variables (business model $[Z_1]$ ; and time, measured by the years 1999–2018 $[Z_2]$ ). The business model $Z_1$ is a discrete variable, which categorizes the business models of fabless, IDM, foundry, and A&T into three groups. The first group contains the fabless, which are labor-intensive for IC design. The second group contains foundries and OSATs, that are capital-intensive for fabrication. The third group contains IDMs that are both labor-intensive and capital-intensive. Alternatively, $Z_2$ can either be a continuous variable or a discrete one. If $Z_2$ is treated as a continuous variable, choosing the optimal time windown for $Z_2$ is critical, which will be discussed further in the next section. Table 2 breaks down the 5,136 observations by the business model. It is no surprise that over half of the companies are fabless. As the barriers to entry, which rely heavily on CAPEX, are much lower for fabless than for the others, fabless companies spring up like the mushrooms in the late 1990s to the early 2000s. At the same time, the number of firms operating in other kinds of business models remains relatively stable. After the golden decade of fast growth in the semiconductor industry come to an end in the mid-2000s (e.g., see Flamm, 2017), the proportions of firms in each business model are gradually fixed. Around 60% of the firms are fabless, while 20% of the firms are IDMs and the rest 20% are either front-end wafer fabs or back-end OSATs. # 4 Empirical Results Most nonparametric estimators suffer from the curse of dimensionality. Based on the three diagnostics introduced in Appendix A, the necessity for dimension reduction is unambiguous. With seven dimensions (p = 5 and q = 2) in the original data, the effective parametric sample size m for annual data is small, no matter using FDH or DEA estimators. We calculate the values of the largest eigenvalue of the moment matrices of $\mathbf{XX}'$ and $\mathbf{YY}'$ to the corresponding sum of eigenvalues to be $R_x = 91.19\%$ and $R_y = 98.31\%$ , indicating high correlations among the inputs and among the outputs. Therefore, dimension reduction can reduce estimation error. A slight difference in processing PCA for the directional Table 1: Summary statistics for 1999–2018 pooled data | Variable | Min | Q1 | Median | Mean | Q3 | Max | |------------|-------|--------|---------|----------|---------|-----------| | $X_1$ | 0.001 | 0.160 | 0.486 | 3.082 | 2.011 | 107.600 | | $X_2$ | 0.001 | 24.170 | 88.008 | 475.252 | 301.645 | 18226.000 | | $X_3$ | 0.000 | 4.302 | 18.330 | 160.217 | 67.001 | 13543.000 | | $X_4$ | 0.549 | 5.885 | 20.087 | 125.406 | 67.185 | 1982.015 | | $X_f$ | 0.005 | 6.065 | 27.787 | 554.060 | 174.405 | 48976.000 | | $Y_1$ | 0.003 | 47.283 | 161.799 | 1064.110 | 563.655 | 70848.000 | | $Y_2$ | 0.175 | 44.279 | 151.749 | 1114.748 | 487.730 | 74563.000 | | Obs. | 5,136 | | | | | | | Uniq. Obs. | 470 | | | | | | Note. $X_1$ denotes the labor, $X_2$ denotes the cost of goods sold (COGS), $X_3$ denotes the R&D expenditure, $X_4$ denotes the sales and marketing expenditure, $X_f$ denotes the fixed assets, $Y_1$ denotes the revenue, $Y_2$ denotes the shareholders' equity, Obs. denotes the observations, and uniq. obs. denotes the companies. The unit of $X_1$ is thousand employees, and units of other variables are US\$ million. All values have been adjusted to 2018 US\$ by GDP deflator. Uniq. Obs. Table 2: Number of companies by business model Year ΑII **IDM** Foundry A&T **Fabless** Obs. 5,136 1,063 3,017 Obs. denotes the observations, and uniq. obs. denotes the companies. distance estimator is that PCA is only on the variable inputs and outputs, but not on the fixed input $X_f$ . Hence, after PCA, there remain three dimensions, namely, $\widetilde{X}$ (PCA from $X_1$ to $X_4$ ), $X_f$ , and $\widetilde{Y}$ (PCA from $Y_1$ to $Y_2$ ). The following analyses are based on data with PCA. Among studies that use the nonparametric frontier approach to estimate efficiency and benchmark performance of firms in the semiconductor industry, the vast majority choose the DEA estimator, without comparing the pros and cons between the FDH estimator and the DEA estimator. The DEA estimator is probably a better choice without dimension reduction, as the slower convergence rate of FDH estimator may increase measurement error rapidly with increasing dimensions. However, it is worth to reevaluate the trade-off between the FDH and DEA estimators with dimension reduction. The drawback of the DEA estimator is imposing convexity on the production set Ψ, while the FDH estimator is free of this assumption. The test of convexity is applied to measure this Table 3 provides the results of the convexity test.<sup>21</sup> At 95% confidence level, the null hypotheses of convexity are rejected for over 80% of the 20 years' annual data, except 3 21 We use the FEAR package (Wilson, 2008) for the convexity test. Table 3: Results of convexity test | Year | N | Statistic | <i>p</i> -value | | |------|-----|-----------|-----------------|--| | 1999 | 125 | 2.222 | 0.011 | | | 2000 | 149 | 1.594 | 0.006 | | | 2001 | 155 | 1.853 | 0.040 | | | 2002 | 213 | 3.138 | 0.005 | | | 2003 | 241 | 2.901 | 0.001 | | | 2004 | 264 | 3.440 | 0.000 | | | 2005 | 260 | 3.238 | 0.000 | | | 2006 | 267 | 3.651 | 0.000 | | | 2007 | 269 | 3.915 | 0.003 | | | 2008 | 278 | 3.227 | 0.008 | | | 2009 | 290 | 2.162 | 0.057 | | | 2010 | 300 | 2.890 | 0.006 | | | 2011 | 298 | 2.102 | 0.088 | | | 2012 | 301 | 1.014 | 0.174 | | | 2013 | 313 | 1.989 | 0.020 | | | 2014 | 302 | 3.552 | 0.001 | | | 2015 | 288 | 1.452 | 0.041 | | | 2016 | 283 | 2.053 | 0.018 | | | 2017 | 275 | 4.831 | 0.000 | | | 2018 | 265 | 4.963 | 0.000 | | We use 100 splits and 1,000 bootstrap replications. years (2009, 2011, and 2012) in the hyperbolic measure. Simar and Vanhems (2012) linked the directional distance measure with the hyperbolic measure by a monotonic transformation, so that the results in Table 3 are also valid for the directional distance estimator. Hence, we choose the FDH estimator. In order to consider a discrete environmental variable such as the business model $Z_1$ for the estimator in equation (2.4), the separability condition needs to be examined. We use a bootstrap algorithm (Simar & Wilson, 2020) for the separability test on the discrete environmental variable $Z_1$ . The first portion in Table 4 show the separability test results with respect to the business model $Z_1$ . Though the test statistics $\tau_1$ and $\tau_2$ not always give the same results, there is strong evidence to reject the separability condition. In other words, each of the three business models in semiconductor industry has a unique production frontier. For the environmental variable $Z_2$ , which represents the years 1999–2018, there is flexibility to treat it either as a discrete variable or as a continuous variable (Mastromarco & Simar, 2015). To treat $Z_2$ as a discrete variable, the 20 <sup>22</sup> In the directional distance measure, step [5] in Simar and Wilson (2020, p. 293) which is originally designed for the radial measures should be revised as $X_i^* = X_i - \widehat{\beta}_i \times d_x + \beta_i^* \times d_x$ and $Y_i^* = Y_i + \widehat{\beta}_i \times d_x$ $d_y - \beta_i^* \times d_y$ , where the directions of $d_x$ and $d_y$ are the sample mean of X and Y. **Table 4:** Test of separability conditional on $Z_1$ and $Z_2$ | | $ au_1$ | | $ au_2$ | | | | |--------------------------------|-----------|-----------------|-----------|-----------------|--|--| | | Statistic | <i>p</i> -value | Statistic | <i>p</i> -value | | | | Conditional on $Z_1$ | | | | | | | | Fabless vs IDM | 7.567 | 0.000 | 0.992 | 0.925 | | | | Fabless vs OSAT | 4.126 | 0.000 | 1.000 | 0.000 | | | | IDM vs OSAT | 3.883 | 0.000 | 0.870 | 0.715 | | | | Conditional on $Z_2$ | | | | | | | | Pooled vs optimal time | 3.230 | 0.000 | 0.983 | 0.000 | | | | Conditional on $Z_1$ and $Z_2$ | | | | | | | | 2-Year groups | 26.273 | 0.000 | 1.000 | 0.000 | | | | 4-Year groups | 18.572 | 0.000 | 1.000 | 0.000 | | | | 5-Year groups | 17.242 | 0.000 | 1.000 | 0.000 | | | | Optimal time | 19.891 | 0.000 | 1.000 | 0.000 | | | $Z_1$ denotes the business model, and $Z_2$ denotes the time. We use 10 splits and 1,000 bootstrap replications. $\tau_1$ is the averaging of the statistics across 10 splits. $\tau_2$ is the Kolmogorov–Smirnov statistic obtained earlier. years of 1999–2018 can be splitted into ten 2-year groups (two adjacent years as a group), five 4-year groups (four adjacent years as a group), or four 5-year groups (five adjacent years as a group). In this case, the separability test with respect to $Z_2$ is similar to the separability test with respect to $Z_1$ . Although $Z_2$ can be treated as 20 individual years, it is not recommended. From the economic point of view, the technology life cycle and business cycle in the semiconductor industry are typically 2–4 years (Tan & Mathews, 2010), so it is more natural to assume a uniform production frontier for the semiconductor companies within one industry cycle. From the econometric point of view, since there are 100–300 observations in each year, the effective parametric sample size $m = n^{\frac{2}{3}}$ for the directional distance measure in each year is very small. In either way, the approach to treat $Z_2$ as 20 individual years is less attractive. Alternatively, treating $Z_2$ as a continuous variable, the optimal time window of $Z_2$ needs to be fixed in advance. The optimal bandwidth is h=5.5 for fabless, IDM, and pooled data, implying that the smoothing window of year t is [t-5,t+5], while the optimal bandwidth is h=7.5 for OSAT with the smoothing window of year t to be [t-7,t+7] (Appendix C explains how to calculate the optimal bandwidth). The second portion and the third portion in Table 4 show the separability test results with respect to the optimal time $Z_2$ , and with respect to the business model $Z_1$ and the time $Z_2$ , respectively. In any case, the separability conditions are strongly rejected. Hence, we will estimate the efficiency scores defined in equation (2.4) with separated production frontiers per the constraints of both $Z_1$ and $Z_2$ . Table 5 shows the summary of the efficiency scores conditional on both the business model $Z_1$ and time $Z_2$ . Whether the time $Z_2$ is treated as a discrete variable or a continuous variable, the distributions of the efficiency scores are skewed to the right in all kinds of business models, especially for the fabless firms, implying more intensive competition for the fabless. Nevertheless, on conditions that $Z_2$ is treated as a discrete variable, the first quartiles are either equal to zero or very close to zero, no matter how the years are grouped. It is a sign that the measurement error by slow convergence rate still exists (e.g., see the second diagnostic in Appendix A). Thus, to treat $Z_2$ as a continuous variable is preferred, rewarding a larger subsample size and a faster convergence rate, and hence more accurate estimates. Figure 1 visualizes the trends of the annual mean efficiencies by the business model and year. Treating $Z_2$ either as a discrete variable or as a continuous variable, the curves of the annual mean efficiencies for the fabless firms are above the curves for the other business models. This phenomenon is more visible in the bottom-right panel, where $Z_2$ is defined as a continuous variable. As a higher efficiency score infers a lower technical efficiency in the directional distance measure, the curves in Figure 1 imply that the fabless firms are operating less efficiently on average. Figure 1 also demonstrates that if $Z_2$ is a discrete variable, the differences of operating efficiency among business models are fading away over time. Driving by increasing complexity of ICs, technological convergence, and time-tomarket and cost optimization pressure, the fab-lite model became popular in the semiconductor industry in late 1990s and early 2000s, and has continued to evolve and gain prominence. Some IDMs and fabless companies are forming strategic partnerships and collaborations to leverage each other's strengths. Furthermore, associations such as SEMI (Semiconductor Equipment and Materials International), SIA and ESIA (European Semiconductor Industry Association), contribute to the establishment of industry standards and provide platforms for networking and collaboration among industry stakeholders. The boundary between IDMs and fabless companies is becoming increasingly ambiguous. M&A activities in the semiconductor industry also contribute to the blurring of boundaries between IDMs and fabless companies. The industry has seen numerous mergers, acquisitions, and strategic partnerships aimed at expanding product portfolios, accessing new markets, and driving innovation. On the one hand, fabless giant Qualcomm has made over 41 acquisitions and 108 investments, such as the acquisition of Atheros (a leading provider of wireless networking Table 5: Summary statistics of the efficiency scores | Sample set | Sample size | Min | Q1 | Median | Mean | Q3 | Max | |---------------|-------------|-------|-------|--------|-------|-------|--------| | 2-year groups | | | | | | | | | Fabless | 3,017 | 0.000 | 0.000 | 0.023 | 0.097 | 0.085 | 13.859 | | IDM | 1,063 | 0.000 | 0.000 | 0.002 | 0.046 | 0.033 | 1.943 | | OSAT | 1,056 | 0.000 | 0.000 | 0.000 | 0.041 | 0.029 | 1.167 | | 4-year groups | | | | | | | | | Fabless | 3,017 | 0.000 | 0.005 | 0.036 | 0.129 | 0.107 | 15.171 | | IDM | 1,063 | 0.000 | 0.000 | 0.009 | 0.067 | 0.061 | 1.939 | | OSAT | 1,056 | 0.000 | 0.000 | 0.005 | 0.064 | 0.055 | 1.901 | | 5-year groups | | | | | | | | | Fabless | 3,017 | 0.000 | 0.009 | 0.042 | 0.147 | 0.115 | 15.179 | | IDM | 1,063 | 0.000 | 0.000 | 0.012 | 0.074 | 0.069 | 2.377 | | OSAT | 1,056 | 0.000 | 0.000 | 0.008 | 0.074 | 0.071 | 2.000 | | Optimal time | | | | | | | | | Fabless | 3,017 | 0.000 | 0.027 | 0.073 | 0.258 | 0.175 | 19.151 | | IDM | 1,063 | 0.000 | 0.006 | 0.043 | 0.131 | 0.171 | 2.087 | | OSAT | 1,056 | 0.000 | 0.004 | 0.036 | 0.142 | 0.146 | 2.660 | **Figure 1:** Mean $\beta$ conditional on business model and year. solutions) in 2011 and CSR (a England-based fabless known for its wireless chips) in 2015, continuously diversifying its product portfolio and strengthening its position in the mobile and networking markets. The merger of Avago and Broadcom in 2015 combined Avago's expertise in analog and mixed-signal semiconductor solutions with Broadcom's strengths in connectivity and networking technologies. On the other hand, IDM giant Intel has so far acquired more than 90 companies, including the acquisition of Altera Corporation (a prominent manufacturer of programmable logic devices) in 2015. This acquisition enabled Intel to integrate Altera's FPGA technology with its processors, offering customized solutions for various applications. Another interesting discovery is that the curves of different business models in the bottom-right panel of Figure 1 tend to converge in 2008, the year of global finanical crisis. That is, under a severe condition such as the 2008 financial crisis, the differences in operating efficiency become unconspicuous among business models. However, it is important to note that different business models still play a role in determining how companies weather the crisis. While the differences in operating efficiency might not be as apparent during extreme conditions, they can still influence a company's ability to adapt, survive, and eventually recover. As the economy recovered from the 2008 crisis, the differences in operating efficiency among business models once again became noticeable in the semiconductor industry. Figure 2 shows the annual mean efficiency curves by business model and optimal time, with 95% confidence interval. The confidence interval is derived by new central limit theorem (Kneip et al., 2015, p. 409). The variances for the fabless firms are higher compared with the IDMs or OSATs, implying higher risks and uncertainties for the fabless business model. As the separability condition does not hold (Table 4), we use a flexible nonparametric location-scale model for a **Figure 2:** Mean $\beta$ with 95% confidence interval. second-stage regression. The pure efficiency can be derived from equation (2.5). In practice, we obtain $\hat{\mu}(z)$ by regressing $\hat{\beta}(x,y|z)$ on the environmental variable z. Similarly, we obtain $\hat{\sigma}(z)$ by regressing the squared residuals of the preceding regression on z. The upper panel in Figure 3 illustrates the pure efficiency $\hat{\varepsilon}(z_1,z_2)$ that cleanses efficiency scores from the influence of both the environmental factors $Z_1$ and $Z_2$ . In the upper panel, the curves of $\hat{\varepsilon}(z_1,z_2)$ by different business model twist together with no clear structures, similar to white noise vibrating at small values around zero. The lower panel in Figure 3 illustrates the pure efficiency $\hat{\varepsilon}(z_2)$ that cleanses efficiency scores from the influence of only $Z_2$ , but not the business model $Z_1$ . In the lower panel, the curves of $\hat{\varepsilon}(z_2)$ demonstrate clear separation by business models. Since $\hat{\varepsilon}(z_2)$ only cleanses the influence of time, the lower panel in Figure 3 maintains the structure of the differences in technical efficiency by the business model in Figures 1 and 2. The contrast between the upper and lower panels in Figure 3 provides further evidence that the technical efficiencies do vary in the semiconductor Figure 3: Pure efficiency. by the business model, and the asset-light fabless firms are operating less efficiently on average in the past two decades. # **5 Summary** The semiconductor industry is famous for the high barriers to entry, especially in the capital-intensive manufacturing portion. The incumbent IDMs, benefitted by the economy of scale and protected by the economic moat by huge CAPEX, have dominated the semiconductor industry since the onset of the industry in the 1960s. However, wagering on novel technologies and processes with the everexpanding complexity of ICs becomes a weighty burden even for the giant IDMs. The fabless—foundry business model alleviates the financial risks of capital investment, reduces the barriers to entry, accelerats technology iterations, and leads to a flourishing of fabless design houses for various applications. This study compares the operating efficiencies between the IDMs and the fabless-foundry business models to shed light on which business model will be the market trend and dominate the semiconductor industry in the long run. Based on the capital-intensive feature of the semiconductor industry, this study chooses a directional distance measure to handle the constraint of CAPEX. At the same time, conditional FDH estimators are used to handle the effects of business model and time in the nonparametric frontier approach. The empirical results provide clear evidence that the IDMs are operating more efficiently, while fabless firms are operating less efficiently by and large. A secondstage nonparametric location-scale regression is used to further check the robustness of the finding. By setting different conditions in the second-stage regression, we show strong evidence that the technical efficiencies do vary in the semiconductor by the business model, and the asset-light fabless firms are operating less efficiently comparing with the capital-intensive IDMs, foundries, and OSATs on average. Nevertheless, there are several limitations in this study. The method is mainly econometric. Criteria for PCA, the separability condition, the optimal bandwidth, etc. are statistical, lacking solid ground in economics. The dynamics in the semiconductor industry had not been fully measured, which may need more advanced econometric methods and data with more technical details for further research. Though the fabless-foundry business model encourages entrance of the fabless startups, the CAPEX barriers accompanying with technical barriers still limit the fields and applications for the fabless firms to growth and development. The IDMs, having more room to optimize the operation and lead the technology development with a strategic product roadmap by vertical integration, will continuously dominate the semiconductor industry in the foreseeable future. At the same time, the fabless—foundry business model is an important complement of the IDMs to explore a broader scope in the semiconductor industry. In fostering a healthy and competitive semiconductor ecosystem, we suggest to tilt the subsidies of the semiconductor industry toward the fabless sector to promote innovation, to support entrepreneurship and startups, to leverage specialization and expertise, and to encourage more diversification. **Funding information**: No funding was used for this article. **Conflict of interest**: The authors declare no known competing financial interests or personal relationships that could have appeared to influence this article's results. **Article note:** As part of the open assessment, reviews and the original submission are available as supplementary files on our website. **Data availability statement**: The data used in this study are available from the corresponding author on request. ### References Adner, R., & Kapoor, R. (2010). Value creation in innovation ecosystems: How the structure of technological interdependence affects firm performance in new technology generations. *Strategic Management Journal*, *31*, 306–333. Bădin, L., Daraio, C., & Simar, L. (2010). Optimal bandwidth selection for conditional efficiency measures: A data-driven approach. *European Journal of Operational Research*, 201, 633–640. Bădin, L., Daraio, C., & Simar, L. (2012). How to measure the impact of environmental factors in a nonparametric production model. *European Journal of Operational Research*, 223, 818–833. Bădin, L., Daraio, C., & Simar, L. (2014). Explaining inefficiency in nonparametric production models: The state of the art. *Annals of Operations Research*, 214, 5–30. Balconi, M., & Fontana, R. (2011). Entry and innovation: An analysis of the fabless semiconductor business. Small Business Economics, 37, 87–106. Banker, R. D., Charnes, A., & Cooper, W. W. (1984). Some models for estimating technical and scale inefficiencies in data envelopment analysis. *Management Science*, 30, 1078–1092. Brown, C., Linden, G., & Macher, J. (2005). Offshoring in the semiconductor industry: A historical perspective. *Brookings Trade Forum*, 279–333. - Burkacky, O., Dragon, J., & Lehmann, N. (2022). *The semiconductor decade:* A trillion-dollar industry. McKinsey & Company, 1. - Ceruzzi, P. (1996). From scientific instrument to everyday appliance: The emergence of personal computers, 1970-77. *History and Technology, an International Journal*, *13*(1), 1–31. - Chambers, R. G., Chung, Y. H., & Färe, R. (1998). Profit, directional distance functions, and Nerlovian efficiency. *Journal of Optimization Theory and Applications*, *98*, 351–364. - Charnes, A., Cooper, W. W., & Rhodes, E. (1978). Measuring the efficiency of decision making units. European Journal of Operational Research, 2, 429–444. - Chen, Y. S., & Chen, B. Y. (2011). Applying DEA, MPI, and grey model to explore the operation performance of the Taiwanese wafer fabrication industry. *Technological Forecasting & Social Change*, 78, 536–546. - Chen, T. C., Guo, D. Q., Chen, H. M., & Wei, T. T. (2019). Effects of R&D intensity on firm performance in Taiwanas semiconductor industry. *Economic Research-Ekonomska istraživanja*, 32(1), 2377–2392. - Cho, Y. (2020). The effects of knowledge assets and path dependence in innovations on firm value in the Korean semiconductor industry. Sustainability, 12(6), 2319. - Daraio, C., & Simar, L. (2005). Introducing environmental variables in nonparametric frontier models: A probabilistic approach. *Journal Productivity Analysis*, 24, 93–121. - Daraio, C., & Simar, L. (2007). Advanced robust and nonparametric methods in efficiency analysis. New York: Springer Science & Business Media, LLC. - Daraio, C., & Simar, L. (2014). Directional distances and their robust versions: Computational and testing issues. *European Journal of Operational Research*, 237, 358–369. - Daraio, C., Simar, L., & Wilson, P. W. (2020). Fast and efficient computation of directional distance estimators. *Annals of Operations Research*, 288, 805–835. - Dibiaggio, L. (2007). Design complexity, vertical disintegration and knowledge organization in the semiconductor industry. *Industrial and Corporate Change*, *16*, 239–267. - Ernst, D. (2005). Limits to modularity: Reflections on recent developments in chip design. *Industry and Innovation*, 12, 303–335. - Farrell, M. J. (1957). The measurement of productive efficiency. *Journal of the Royal Statistical Society*, 120, 253–281. - Flamm, K. (2017). Has Moore's law been repealed? An economist's perspective. *Computing in Science & Engineering*, 19, 29–40. - Gugler, K., & Siebert, R. (2007). Market power versus efficiency effects of mergers and research joint ventures: Evidence from the semiconductor industry. *The Review of Economics and Statistics*, 89(4), 645–659. - Hall, P., Racine, J. S., & Li, Q. (2004). Cross-validation and the estimation of conditional probability densities. *Journal of the American Statistical Association*, *99*, 1015–1026. - Henderson, J., & Scott, A. J. (2018). The growth and internationalisation of the American semiconductor industry: Labour processes and the changing spatial organisation of production. In: *The Development of High Technology Industries* (pp. 37–79). Routledge. - Hsieh, Y., Lin, N., & Chiu, H. (2002). Virtual factory and relationship marketing - a case study of a Taiwan semiconductor manufacturing company. *International Journal of Information Management*, 22, 109–126. - Hsu, L. C. (2015). Using a decision-making process to evaluate efficiency and operating performance for listed semiconductor companies. *Technological and Economic Development of Economy*, 21, 301–331. - Hung, S. W., He, D. S., & Lu, W. M. (2014). Evaluating the dynamic performances of business groups from the carry-over perspective: A case study of Taiwanas semiconductor industry. *Omega*, 46, 1–10. - Hung, S. W., & Lu, W. M. (2008). The comparative productivity efficiency of Taiwan's integrated circuits packaging/testing firms. *Information* Systems and Operational Reserach, 46, 189–198. - Hurtarte, J. S., Wolsheimer, E. A., & Tafoya, L. M. (2011). *Understanding fabless IC technology.* Elsevier. - Hwang, H., & Choung, J. (2014). The co-evolution of technology and institutions in the catch-up process: The case of the semiconductor industry in Korea and Taiwan. *The Journal of Development Studies*, 50, 1240–1260. - Ibrahim, K., Chik, M. A., & Hashim, U. (August 2014). Horrendous capacity cost of semiconductor wafer manufacturing. In: *2014 IEEE International Conference on Semiconductor Electronics (ICSE2014)* (pp. 329–331). IEEE. - Jang, H., Lee, S., & Suh, E. (2016). A comparative analysis of the change in R&D efficiency: A case of R&D leaders in the technology industry. *Technology Analysis & Strategic Management*, 28, 886–900. - Jeong, S., Park, B. U., & Simar, L. (2010). Nonparametric conditional efficiency measures: asymptotic properties. *Annals of Operations Research*, 173, 105–122. - Kapoor, R. (2013). Persistence of integration in the face of specialization: How firms navigated the winds of disintegration and shaped the architecture of the semiconductor industry. *Organization Science*, 24, 1195–1213. - Kapoor, R., & Adner, R. (2012). What firms make vs what they know: How firms' production and knowledge boundaries affect competitive advantage in the face of technological change. *Organization Science*, 23, 1227–1248. - Kneip, A., Simar, L., & Wilson, P. W. (2015). When bias kills the variance: Central limit theorems for DEA and FDH efficiency scores. *Econometric Theory*, 31, 394–422. - Kneip, A., Simar, L., & Wilson, P. W. (2016). Testing hypotheses in non-parametric models of production. *Journal of Business and Economic Statistics*, 34, 435–456. - Kozmetsky, G., & Yue, P. (1998). Comparative performance of global semiconductor companies. *Omega*, 26, 153–175. - Kuo, M. H., & Yang, C. (2012). Does intellectual capital matter? Assessing the profitability and marketability of IC design companies. *Quality & Ougntity*. 46. 1865–1881. - Lambrechts, W., Sinha, S., Abdallah, J. A., & Prinsloo, J. (2018). Extending Mooreas law through advanced semiconductor design and processing techniques. CRC Press. - Lampe, H. W., & Hilgers, D. (2015). Trajectories of efficiency measurement: A bibliometric analysis of DEA and SFA. *European Journal of Operational Research*, 240(1), 1–21. - Li, H., He, H., Shan, J., & Cai, J. (2019). Innovation efficiency of semiconductor industry in China: A new framework based on generalized three-stage DEA analysis. *Socio-Economic Planning Sciences*, *66*, 136, 148 - Li, Q., Lin, J., & Racine, J. S. (2013). Optimal bandwidth selection for nonparametric conditional distribution and quantile functions. *Journal of Business & Economic Statistics*, 31, 57–65. - Li, Y., Huang, M., & Chen, D. (2011). Semiconductor industry value chain: Characters' technology evolution. *Industrial Management & Data Systems*, 111, 370–390. - Lin, F., Lin, S. W., & Lu, W. M. (2019). Dynamic eco-efficiency evaluation of the semiconductor industry: A sustainable development perspective. *Environ Monit Assess*, 191, 1–16. - Liu, F., & Wang, P. (2008). DEA Malmquist productivity measure: Taiwanese semiconductor companies. International Journal of Production Economics, 112, 367-379. - Lu, W. M., & Hung, S. W. (2010). Assessing the performance of a vertically disintegrated chain by the DEA approach - A case study of Taiwanese semiconductor firms. International Journal of Production Research, 48, 1155-1170. - Lu, W. M., Wang, W. K., & Lee, H. L. (2013). The relationship between corporate social responsibility and corporate performance: Evidence from the US semiconductor industry. International Journal of Production Research, 51, 5683-5695. - Macher, J. T. (2006). Technological development and the boundaries of the firm: A knowledge-based examination in semiconductor manufacturing, Management Science, 52, 826-843. - Macher, J. T., Mowery, D. C., & Di Minin, A. (2007). The "non-globalization" of innovation in the semiconductor industry. California Management Review, 50, 217-242. - Mack, C. A. (2011). Fifty years of Moore's law. IEEE Transactions on Semiconductor Manufacturing, 24, 202-207. - Makimoto, T. (2002). The hot decade of field programmable technologies. In: 2002 IEEE International Conference on Field-Programmable Technology, 2002. (FPT). Proceedings (pp. 3-6). IEEE. - Malone, M. S. (2014). The Intel trinity: How Robert Noyce, Gordon Moore, and Andy Grove Built the World as most important company. Harper - Mastromarco, C., & Simar, L. (2015). Effect of FDI and time on catching up: New insights from a conditional nonparametric frontier analysis. Journal of Applied Econometrics, 30(5), 826-847. - Mönch, L., Fowler, J. W., & Mason, S. J. (2012). Production planning and control for semiconductor wafer fabrication facilities: Modeling, analysis, and systems (p. 52). Springer Science & Business Media. - Monteverde, K. (1995). Technical dialog as an incentive for vertical integration in the semiconductor industry. Management Science, 41, - Park, J. H., Chung, H., Kim, K. H., Kim, J. J., & Lee, C. (2021). The impact of technological capability on financial performance in the semiconductor industry. Sustainability, 13(2), 489. - Park, J. H., Kook, S. H., Im, H., Eum, S., & Lee, C. (2018). Fabless semiconductor firms' financial performance determinant factors: Product platform efficiency and technological capability. Sustainability, 10(10), 3373. - Pellens, M., & Della Malva, A. (2018). Corporate science, firm value, and vertical specialization: Evidence from the semiconductor industry. Industrial and Corporate Change, 27(3), 489-505. - Powell, D. M., Fu, R., Horowitz, K., Basore, P. A., Woodhouse, M., & Buonassisi, T. (2015). The capital intensity of photovoltaics manufacturing: Barrier to scale and opportunity for innovation. Energy & Environmental Science, 8(12), 3395-3408. - Qiao, G., & Wang, Z. A. (2021). Vertical integration vs specialization: A nonparametric conditional efficiency estimate for the global semiconductor industry. Journal of Productivity Analysis, 56(2-3), 139-150. - Rastogi, A. P., Fowler, J. W., Carlyle, W. M., Araz, O. M., Maltz, A., & Büke, B. (2011). Supply network capacity planning for semiconductor manufacturing with uncertain demand and correlation in - demand considerations. International Journal of Production Economics, 134(2), 322-332. - Saha, S. K. (2015). Emerging business trends in the microelectronics industry. Open Journal of Business and Management, 4(1), 105-113. - Sarma, S., & Sun, L. (2017). The genesis of fabless business model: Institutional entrepreneurs in an adaptive ecosystem. Asia Pacific Journal of Management, 34, 587-617. - Sher, P. J., & Yang, P. Y. (2005). The effects of innovative capabilities and R&D clustering on firm performance: The evidence of Taiwanese semiconductor industry. Technovation, 25(1), 33-43. - Shin, N., Kraemer, K., & Dedrick, J. (2017). R&D and firm performance in the semiconductor industry. Industry and Innovation, 24, 280-297. - Simar, L., & Vanhems, A. (2012). Probabilistic characterization of directional distances and their robust versions. Journal of Econometrics, 166 342-354 - Simar, L., & Wilson, P. W. (2007). Estimation and inference in two-stage, semi-parametric models of productive efficiency. Journal of Econometrics, 136, 31-64. - Simar, L., & Wilson, P. W. (2011). Two-stage DEA: Caveat emptor. Journal Productivity Analysis, 36, 205-218. - Simar, L., & Wilson, P. W. (2015). Statistical approaches for non-parametric frontier models: A guided tour. International Statistical Review, 83, 77-110. - Simar, L., & Wilson, P. W. (2020). Hypothesis testing in nonparametric models of production using multiple sample splits. Journal Productivity Analysis, 53, 287-303. - Sueyoshi, T., & Ryu, Y. (2020). Performance assessment of the semiconductor industry: Measured by DEA environmental assessment. Energies, 13(22), 5998. - Tan, H., & Mathews, J. A. (2010). Cyclical industrial dynamics: The case of the global semiconductor industry. Technological Forecasting & Social Change, 77, 344-353. - Tsai, C. H., Wu, H. Y., Chen, I. S., Chen, J. K., & Ye, R. W. (2017). Exploring benchmark corporations in the semiconductor industry based on efficiency. Journal of High Technology Management Research, 28, 188-207. - VerWey, J. (2019). Chinese semiconductor industrial policy: Past and present. J. Int'l Com. & Econ., 1. - Walheer, B., & He, M. (2020). Technical efficiency and technology gap of the manufacturing industry in China: Does firm ownership matter?. World Development, 127, 104769. - Wilson, P. W. (2008). FEAR 1.0: A software package for frontier efficiency analysis with R. Socio-Economic Planning Sciences, 42, 247-254. - Wilson, P. W. (2012). Asymptotic properties of some non-parametric hyperbolic efficiency estimators. Exploring research frontiers in contemporary statistics and econometrics: A Festschrift for Léopold Simar (pp. 115-150). - Wilson, P. W. (2018). Dimension reduction in nonparametric models of production, European Journal of Operational Research, 267, 349-367. - Wu, W. Y., Tsai, H. J., Cheng, K. Y., & Lai, M. (2006). Assessment of intellectual capital management in Taiwanese IC design companies: Using DEA and the Malmquist productivity index. R&D Management, 36, 531-545. - Zhou, X., Chen, H., Chai, J., Wang, S., & Lev, B. (2020). Performance evaluation and prediction of the integrated circuit industry in China: A hybrid method. Socio-Economic Planning Sciences, 69, 100712. # **Appendix** # **A Principal Component Analysis** PCA is a mapping $\Psi: R_+^{p+q} \mapsto R_+^{1+1}$ . In matrix notation, $p \times n$ matrix $\mathbf{X}$ and $q \times n$ matrix $\mathbf{Y}$ are transformed to $1 \times n$ matrices $\Lambda'_{x_1}\mathbf{X}$ and $\Lambda'_{y_1}\mathbf{Y}$ by pre-multiplying the first eigenvector $\Lambda_{x_1}$ and $\Lambda_{y_1}$ of the moment matrices $\mathbf{X}\mathbf{X}'$ and $\mathbf{Y}\mathbf{Y}'$ , respectively. Unfortunately, there is no theorem that precisely identifies situations where dimension reduction should be used. Wilson (2018) provides three diagnostics for empirical research. The first diagnostic is to compute the effective parametric sample size m, in case of n observations in a non-parametric estimation. Setting $m^{\frac{1}{2}}=n^{\kappa}$ , and thus, $m \approx \lfloor n^{2\kappa} \rceil$ , where the convergence rate $\kappa = \frac{1}{p+q}$ for FDH estimator and $\kappa = \frac{2}{p+q+1}$ for DEA estimator, and $\lfloor a \rceil$ denotes the integer nearest a. Hence, the criterion of judging the minimum sample size m in parametric estimation can be used as reference in judging the minimum sample size n in non-parametric estimation. **Figure A1:** Optimal time window of $Z_2$ by LSCV. A second diagnostic is to consider the proportion of n observations that yield efficiency scores equal to one. Since the, FDH estimator converges slower than the DEA estimator, a robust diagnostic for the curse of dimensionality should use the FDH efficiency estimator. If more than 25-50% of the observations yield efficiency scores equal to one, the estimation results are not convincing. A third diagnostic is to determine the ratios $R_{\rm r}$ and $R_{\rm v}$ of the largest eigenvalue of the moment matrices XX' and YY' to the corresponding sum of eigenvalues for XX' and $\mathbf{Y}\mathbf{Y}'$ . The ratios of $R_{\mathbf{Y}}$ and $R_{\mathbf{y}}$ provide measures of how close the corresponding moment matrices are to rank-one. For example, if $R_x = 0.9$ , then the matrix with dimension reduction $\Lambda'_{x1}X$ contains 90% of the independent linear information in the original matrix X. In practice, Wilson (2018) proposed standardizing the matrices X and Y before PCA to ensure that the inputs or outputs have the same scale, in case of excessive number of inputs or outputs. # **B** Computation of the Directional Distance Measure This study follows the Daraio et al. (2020) approach to compute the directional distance measure using the FDH estimator. First, use the Hadamard component-wise division of vectors $\oslash$ to do a monotonic transformation of the data as: $$X^* = X \oslash d_X$$ and $Y^* = Y \oslash d_Y$ . Then, the directional distance estimator in equation (2.1) can be expressed explicitly as: $$\begin{split} \beta(x,y|d_x,d_y) &= \sup\{\beta > 0 | H_{n,X^*Y^*|X_f}(x^* - \beta,y^* \\ &+ \beta|x_f) > 0\}, \\ &= \max_{\{i|X_{f,i} \leq x_f\}} [\min\{x^* - X_i^*,Y_i^* - y^*\}], \end{split} \tag{A1}$$ where *n* is the sample size, $i \in \{1, 2, ..., n\}$ , and $X_f$ is the fixed input. Similarly, the conditional directional distance estimator in equation (2.4) can be expressed as: $$\beta(x, y|d_x, d_y, z) = \max_{\{i|X_{f,i} \le x_f, |Z_i - z| \le h\}} [\min\{x^* - X_i^*, Y_i^* - y^*\}],$$ (A2) where Z denotes a vector of environmental variables. ### C Kernel Method A nonparametric estimator of $H_{X,Y|Z}(\cdot)$ in equation (2.3) can be obtained by standard kernel smoothing, i.e., $$\widehat{H}_{X,Y|Z}(x,y|z) = \frac{\sum_{i=1}^{n} \mathbf{I}(X_{i} \leq x, Y_{i} \geq y) K\left(\frac{Z_{i}-z}{h}\right)}{\sum_{i=1}^{n} K\left(\frac{Z_{i}-z}{h}\right)},$$ where $K(\cdot)$ is a kernel function and h is a vector of bandwidths $h = (h_1, ..., h_r)$ . Bădin et al. (2010, 2012), Hall et al. (2004), Jeong et al. (2010), and Li et al. (2013) had discussed extensively how to choose the optimal bandwidth h by least-square cross-validation (LSCV). In this study, the optimal bandwidth h of the time $Z_2$ is calculated by minimizing $$\frac{\sum_{i=1}^{n}\sum_{j\neq i}^{n}\left|\mathbf{I}(\widetilde{x}_{i}\leq\widetilde{x}_{j},x_{f,i}\leq x_{f,j},\widetilde{y}_{i}\geq\widetilde{y}_{j})-\frac{\frac{1}{n}\sum_{k\neq i}^{n}\mathbf{I}(\widetilde{x}_{k}\leq\widetilde{x}_{j},x_{f,k}\leq x_{f,j},\widetilde{y}_{k}\geq\widetilde{y}_{j})K_{h}(z_{i},z_{k})}{\frac{1}{n-1}\sum_{k\neq i}^{n}K_{h}(z_{i},z_{k})}\right|^{2}}{n(n-1)}$$ where $K_h(t) = \frac{1}{h}K(\frac{t}{h})$ . The patterns in Figure A1 represent the optimal time window by the business model using LSCV.