Home Automated Prediction of Preservation Index in Library Environments through Multiple Regression Analysis
Article Open Access

Automated Prediction of Preservation Index in Library Environments through Multiple Regression Analysis

  • Eka Ratri Noor Wulandari ORCID logo EMAIL logo , Hafrida Rahmah ORCID logo , Salnan Ratih Asriningtias ORCID logo , Iwan Permadi ORCID logo , Heri Prayitno ORCID logo , Pitoyo Widhi Atmoko ORCID logo and Pipit Tunjungsari ORCID logo
Published/Copyright: June 16, 2025

Abstract

This study presents a novel predictive model employing multiple linear regression to enhance the calculation of the Preservation Index (PI) for library collections. Building upon and expanding previous research on Internet of Things (IoT)-based temperature and humidity monitoring in library rooms, this investigation incorporates critical environmental variables: temperature, humidity, and light exposure. Data was collected over a three-month period using advanced IoT systems deployed in three distinct library environments, each presenting unique preservation challenges. The model’s development involved a meticulous process of interpolating existing PI data to achieve finer gradations, followed by analysis through a sophisticated multiple regression framework. Results indicate that while light exposure’s contribution to the model’s predictive capability was minimal, the overall model precision was remarkably high, demonstrated by a low Mean Absolute Percentage Error (MAPE) of 2.615. This research underscores the predominant influence of temperature and humidity in PI forecasting while also providing nuanced insights into the subtle effects of light exposure on collection preservation. A correlation heatmap is included to elucidate the interrelationships among variables, offering a visual complement to the statistical findings and enhancing the interpretation of complex data interactions. This study contributes to the advancement of preservation strategies in library science, potentially informing more effective environmental control practices in diverse library settings.

1 Introduction

The preservation of library collections is an enduring concern in the field of archival science, where environmental factors play a pivotal role in determining the longevity and integrity of stored materials (Camuffo 2019; Goswami 2018). UNESCO estimates that more than 7 billion items in library collections worldwide require preservation (Manernova 2015; Mishra 2017). This shows the magnitude of the challenge libraries face in maintaining and preserving their collections for future generations, where factors such as environmental conditions, biological threats, and chemical reactions continue to threaten the survival of library materials (Ahmad et al 2024; Garside and Bradford 2024). The library industry also faces financial loss due to damage to library collections, a phenomenon that reflects serious challenges in collection preservation and management (Yano et al 2013; Yousuf Ali 2017; Zaveri 2014).

Climate change and air pollution have significantly accelerated the degradation of library materials (Turhan et al 2019), while existing manual monitoring systems show limitations in anticipating dynamic changes in environmental conditions (Diulio et al 2022). This situation is exacerbated by the increasing risk of biological damage, especially mould growth on collections in libraries without adequate climate control (Derksen et al. 2024), resulting in the loss of rare and historic collections that cannot be replaced. Further impacts of this problem are a significant increase in restoration and conservation costs (Mishra 2017), as well as decreased accessibility of information for researchers and the public as damaged collections become inaccessible or even disappear completely (Coppola et al. 2020), thus threatening the continuity of knowledge transfer and cultural heritage for future generations.

The Preservation Index (PI) has served as an important metric, offering predictions on the lifespan of paper-based materials under various environmental conditions (Diulio et al 2019). This metric has become an invaluable tool in library and archive collection management, allowing professionals to assess and optimize their preservation strategies through an in-depth understanding of how various environmental factors can affect the durability of paper-based materials (Balocco et al. 2016). Historically, these predictions have predominantly focused on two primary variables – temperature and humidity – which are known to significantly influence the degradation processes (Li and Tang 2013; Persson 1989). Based on the existing literature, research on the preservation of library materials shows a complex interplay between various environmental factors, prediction models, and monitoring technologies. Calabrese et al. (2018) developed an integrated predictive system that uses non-invasive techniques to monitor and predict material changes over time, while the use of Internet of Things (IoT) sensors has become a promising solution in modern monitoring contexts.

The use of IoT for hygrothermal monitoring in libraries and archival facilities is by no means new. Research by Asinelli et al. (2018) utilized a simple Arduino-based device to remotely monitor the indoor environment of a cultural heritage site in Spain, while Monti et al. (2019) developed a strategically placed multisensor system to monitor library storage spaces in a university. Although adoption was initially slow due to infrastructure limitations (Masenya 2020), a recent study showed the transition from passive monitoring to proactive control with Node MCUs (Sharath 2024) and the ability to identify correlations between hygrothermal conditions and structural damage (Dolińska et al 2025). Research by Ni et al. (2025) reported key challenges that include limited Wi-Fi connectivity, sensor accuracy outside the 20–80 % RH range, and the need for non-invasive installation methods in historic buildings. On the other hand, research by Khean et al. (2023) reported huge benefits including continuous monitoring that can reduce energy consumption by 50 % while preventing mould growth and damage to valuable materials. With the advent of IoT, capabilities for environmental monitoring have dramatically improved, allowing for the continuous and precise collection of environmental data (Wulandari et al 2023).

Recent research in library preservation reveals a complex landscape that includes both digital and physical preservation challenges. Studies show a growing trend towards in-house digital preservation practices over outsourced solutions, as institutions seek greater control over their digital assets (Ahmad et al 2024), while big data preservation in digital libraries faces the challenge of sustainable storage technologies (Bhat 2018). Traditional preservation concerns remain, including environmental, biological, and chemical factors affecting physical materials (Mishra 2017), alongside emerging challenges in the preservation of personal digital heritage (Nagy and Kiszl 2020). Modern preservation methods are evolving to address these dual challenges, with recent studies highlighting the importance of integrating traditional and innovative preservation techniques in public libraries (Kumar 2024) and developing new methodologies for data preservation in contemporary library environments (Beskaravainaya and Mitroshin 2024).

Although various studies have examined methods of preserving library collections and developing predictive models to assess collection condition, there is still a gap in comprehensively integrating environmental variables into these models. Previous studies tend to focus on single factors or analyze environmental variables in isolation, so there are no predictive models that integrate the simultaneous effects of temperature, humidity, and light exposure to produce a more accurate assessment of the condition and preservation needs of library collections. Filling this gap is important to improve the accuracy and effectiveness of library collection preservation efforts in the face of dynamic environmental change challenges. By developing more comprehensive predictive models, libraries can optimize strategies and resources to ensure the long-term preservation of their collections.

Recent research in library science reveals several interrelated trends shaping the development of the field. Ahmad et al (2024) highlighted the shift towards in-house digital preservation and hybrid library management, while Pasqui (2024) emphasized the evolution of digital curation practices to meet the Findability, Accessibility, Interoperability, and Reusability (FAIR) principles. Regarding technological advancements, Bhat (2018) identified the limitations of current storage technologies for big data preservation, indicating the need for innovative solutions including the integration of sensors and IoT. Abad-Segura et al. (2020) underlined the importance of environmental sustainability in digital transformation strategies, especially in response to the challenges of global climate change. In this context, the development of more accurate and comprehensive predictive models for the preservation of library collections is becoming increasingly relevant, as it can help libraries adapt to dynamic environmental changes and overcome preservation challenges in the digital age.

This research aims to demonstrate that an enhanced predictive model, which incorporates environmental variables such as temperature, humidity, and light exposure, can improve the accuracy of preservation assessments, potentially offering libraries a more powerful tool to manage their collections sustainably. The implication of this research is the enhancement of libraries’ ability to perform collection preservation more accurately and sustainably through the use of predictive models that consider environmental factors (temperature, humidity, and light exposure), thereby optimizing preservation efforts and saving resources in the long run. The main contribution of this research is to provide new insights into how the comprehensive integration of environmental variables into predictive models can improve the accuracy of preservation assessments of library collections. The findings of this study are expected to assist libraries in developing more effective and efficient preservation strategies, as well as encourage further research in this area.

The scope of this paper is to investigate a multiple regression approach for calculating the Preservation Index (PI) of library collections, integrating environmental variables such as temperature, humidity, and light exposure. In this work, the model is applied to a dataset collected from three distinct library environments over a three-month period. Specifically, data from these environments, which vary in their exposure to natural light and controlled climate conditions, are analyzed to assess their impact on the PI. This research aims to demonstrate that a refined predictive model, incorporating these variables, can enhance the accuracy of preservation assessments, potentially offering libraries a more robust tool for managing their collections sustainably.

This study contributes to the field by providing a detailed predictive model of the Preservation Index (PI) using IoT systems across multiple library environments, which has not been thoroughly explored before. The novel integration of environmental variables in a combined room model offers a comprehensive approach to understanding and managing library preservation conditions more effectively.

2 Methodology

2.1 Data Collection

The study utilized empirical data collected over a three-month period at Brawijaya University Library. Environmental variables critical to preservation – temperature, humidity, and light exposure – were continuously recorded to capture the dynamic conditions within the library environment. This comprehensive dataset provides a solid foundation for analyzing the effects of environmental factors on material preservation (Camuffo 2019; Li and Tang 2013).

In this study, the environmental conditions within the library were meticulously monitored using an advanced Internet of Things (IoT) system. The system comprised DHT11 sensors known for their reliable measurement of temperature and humidity. These sensors were strategically placed in three distinct library rooms to ensure comprehensive data collection across varied conditions. A Raspberry Pi was employed as the central data processing unit, chosen for its ability to handle complex algorithms and manage large datasets effectively. This setup facilitated continuous data recording, with the Raspberry Pi programmed to collect readings from the sensors at one-minute intervals over a three-month period. The data was transmitted over a secured Wi-Fi network, allowing for real-time data access and remote monitoring by researchers. This frequent sampling rate provided a robust dataset, critical for analyzing the effects of environmental conditions on the library’s preservation index. Figure 1 below illustrates the complete configuration of the IoT system, showing the placement of sensors, their connection to the Raspberry Pi, and the subsequent data flow to the storage and analysis systems, thereby providing a visual representation of the systematic approach taken in capturing and processing the environmental data essential for this study.

Figure 1: 
The IoT monitoring system.
Figure 1:

The IoT monitoring system.

Data was collected during operational hours, from 8:00 a.m. to 5:00 p.m. on working days, to ensure that the environmental conditions measured reflected the library spaces’ typical daily usage patterns. Each room was monitored sequentially over a one-month period to avoid data overlap and ensure precise attribution of environmental influences to specific areas. Measurements were captured at one-minute intervals, offering a high-resolution view of environmental fluctuations.

On certain occasions, the start of data logging devices was delayed due to operational constraints, resulting in start times slightly later than the scheduled 8:00 a.m. start time. These delays were mainly due to the manual setup needed each day to activate the monitoring systems, causing initial recordings to begin at varied times, such as 8:03 a.m. or 8:12 a.m. Despite these staggered starts, the recording consistently finished at the intended time of 5:00 p.m.

Efforts were made to document and account for all instances of delayed starts, ensuring that the analysis remained robust. This approach allowed us to maintain the integrity of the data collection process, with detailed logging of any deviations from the planned schedule. These deviations were minimal and did not significantly impact the overall dataset or the conclusions drawn from it.

Three distinct rooms within the Universitas Brawijaya Library were utilized, each with unique characteristics affecting the internal environmental conditions. The White Label Room featured 22 active windows providing ample natural light, supplemented by a combination of fluorescent and downlight spot lamps, designed to evenly distribute light without causing damage to materials. The Red Label Room, used primarily for specialized collections, included 14 active windows and utilized solely downlight spot lamps to enhance visibility while preserving the integrity of the materials. The Thesis Storage Room was equipped with 10 active windows and a similar lighting setup to ensure consistent light levels across all reading and storage areas.

In this study, we concentrated exclusively on monitoring and analyzing the internal environmental conditions of library environments, such as temperature, humidity, and light exposure inside different rooms. External environmental variables, including outdoor temperature and humidity, were not measured as they were outside the scope of our investigation. This methodological choice was deliberate to isolate the effects of internal microclimatic conditions on the Preservation Index of library materials, ensuring that our analysis directly reflects the impact of controlled internal factors. This focus is intended to enhance the applicability of our results to indoor environmental management practices for library preservation.

To better represent the spatial characteristics of the monitored rooms, Figure 2 presents the internal layout of the White Label Room, including its orientation to the cardinal directions and location of windows and doors. The reading areas are located in the center of the room, surrounded by shelves that are mostly aligned perpendicular to window positions.

Figure 2: 
Spatial layout of the White Label Room.
Figure 2:

Spatial layout of the White Label Room.

As shown in Figure 3, the room receives significant daylight from the eastern and southern sides, which affects both the temperature and light exposure throughout the day. These spatial features were considered in the environmental analysis and model interpretation.

Figure 3: 
Photograph of the White Label Room.
Figure 3:

Photograph of the White Label Room.

In our analysis, we combined data from three different rooms within the same library building – assuming that the environmental conditions are uniformly consistent across these spaces. This assumption is based on the fact that all rooms are situated in close proximity within the same building, under the same Heating, Ventilation, and Air Conditioning (HVAC) system, and have similar structural designs and exposure to environmental factors. Statistical analyses were conducted on the aggregated data to assess the overall impact of environmental conditions on the Preservation Index of library materials. This approach allows us to leverage a larger dataset and enhance the reliability of our findings by reducing variability due to small sample sizes in individual rooms.

An advanced Internet of Things (IoT) system was employed to monitor and record environmental conditions within the library accurately. This subsection details the setup and functionality of the system used.

2.2 Interpolation of Preservation Index Data

The Preservation Index (PI) values were derived from interval data provided in established preservation reference books. Interpolation techniques were applied to the interval PI data to create a continuous dataset suitable for statistical analysis. This process involved using methods like spline or linear interpolation to estimate PI values for each corresponding set of environmental measurements, thus filling gaps between the known values (Harim et al. 2020; Lai and Kaplan 2022; Renka 1987).

2.3 Regression Analysis

A regression model was developed to predict the PI based on the interpolated data and the measured environmental variables (Deepika and Channegowda 2020; Kareem et al 2022). The equation defines the model:

(1) P I = β 0 + β 1 T e m p e r a t u r e + β 2 H u m i d i t y + β 3 L i g h t

where β0, β1, β2, β3β0, β1, β2, and β3 are the coefficients estimated by the regression. The model was trained using a portion of the dataset and validated with the remaining data to assess its predictive accuracy and reliability.

2.4 Comparative Analysis

The effectiveness of each regression model in predicting PI was compared against the interpolated PI values from the book. Statistical measures such as mean absolute error (MAE) were used to quantify the accuracy and responsiveness of each method. This comparative analysis aims to identify the most reliable method for calculating the Preservation Index in library environments, considering the practical applicability of each method under real-world conditions.

3 Results and Analysis

3.1 Environmental Conditions in the Monitored Rooms

To better understand the microenvironmental context in which the preservation models were applied, this section presents trends in temperature, relative humidity, and light exposure observed in each room during the monitoring periods.

3.2 Temperature Trends

Figure 4 presents the daily average temperature trends observed in each monitored library room during separate data collection periods: White Label Room, Red Label Room, and Thesis Storage Room. Despite being measured in different months, the graphs are displayed using the same temporal format to enable direct visual comparison.

Figure 4: 
Daily average temperature trends in each monitored library room.
Figure 4:

Daily average temperature trends in each monitored library room.

The White Label Room exhibited relatively stable temperature fluctuations, while the Red Label Room showed a more pronounced daytime rise, likely due to higher solar exposure. The Thesis Storage Room exhibited the lowest overall temperature trend, possibly attributed to its shaded orientation and air circulation pattern.

These differences in thermal dynamics are important to consider when interpreting the regression results, as temperature variation has a direct impact on the Preservation Index.

3.3 Humidity Trends

Figure 5 presents the daily average humidity recorded in each of the monitored rooms during their respective data collection periods. The White Label Room showed consistently high humidity levels, ranging between 78 and 82 %, with minimal daily variation. The Red Label Room, in contrast, experienced more fluctuation, possibly due to greater air exchange from open windows or inconsistent airflow control. The Thesis Storage Room maintained moderate humidity levels around 60–65 %, indicating more stable environmental control or better insulation. These variations in humidity levels may influence the Preservation Index and are relevant when assessing the relative stability of each environment.

Figure 5: 
Daily average relative humidity trends in each monitored library room.
Figure 5:

Daily average relative humidity trends in each monitored library room.

3.4 Light Trends

Figure 6 illustrates the daily average light exposure levels recorded in the White Label, Red Label, and Thesis Storage Rooms. Among the three rooms, the White Label Room recorded the highest and most variable light exposure, reflecting its larger windows and minimal shading. The Red Label Room had consistently low light levels, which may be attributed to the use of curtains or internal room placement. The Thesis Storage Room exhibited moderate and relatively stable light exposure, suggesting a balanced combination of natural and artificial lighting.

Figure 6: 
Daily average light exposure in the monitored rooms.
Figure 6:

Daily average light exposure in the monitored rooms.

These environmental light differences are relevant for preservation considerations, as high and fluctuating exposure to light can accelerate fading, deterioration, and material aging, especially for paper-based archival collections.

3.5 Red Label Room

Figure 7 shows a Pearson correlation heatmap that helps us understand how temperature, humidity, and light exposure relate to each other in the Red Label Room of the library. The analysis reveals a strong negative relationship between temperature and humidity, with a correlation coefficient of −0.76. This means that when temperature goes up, humidity tends to go down significantly in this room. On the other hand, light exposure does not seem to have much connection with either temperature or humidity, showing very weak correlations of −0.10 and −0.09, respectively. This suggests that light changes happen independently of the area’s temperature or humidity changes.

Figure 7: 
Pearson correlation for Red Label collection room.
Figure 7:

Pearson correlation for Red Label collection room.

The regression model developed for the Preservation Index (PI) in the Red Label Room is summarized by the equation:

(2) Y R E D = 87.92 + 0.704 θ R E D + 0.2826 η R E D 0.0004 λ R E D

where,

YRED represents the preservation index (PI), θ RED is the temperature, η RED is the humidity, and λ RED is the light exposure in the Red Label Room. The intercept of 87.92 suggests a high baseline PI under optimal conditions where temperature, humidity, and light are at their reference levels, when all the independent variables are set to zero. The coefficients associated with temperature (θ RED ) and humidity (η RED ) are −0.7043 and −0.2826, respectively, indicating that increases in these variables lead to a decrease in the PI. Specifically, a one-unit increase in temperature is associated with a 0.7043 unit decrease in the PI, highlighting the sensitivity of preservation conditions to temperature fluctuations. Similarly, a one-unit increase in humidity results in a 0.2826 decrease in the PI, affirming the detrimental impact of higher humidity levels on material preservation.

The coefficient for light exposure (λ RED ), at −0.0004 is significantly smaller, suggesting a minimal impact of light on the PI in this specific environment. This minimal influence reflects the relative insignificance of light exposure compared to temperature and humidity in affecting the preservation conditions within the Red Label Room.

3.6 White Label Room

Figure 8 presents the Pearson correlation heatmap for the White Label Room, providing insights into the interdependencies among temperature (°F), humidity (%), and light exposure. The heatmap indicates a moderate negative correlation of −0.42 between temperature and humidity, suggesting that humidity tends to decrease in this specific environment as temperature increases. This inverse relationship might be attributed to the air’s reduced capacity to hold moisture at higher temperatures, which is crucial for strategic preservation planning in archival environments.

Figure 8: 
Pearson correlation for White Label collection room.
Figure 8:

Pearson correlation for White Label collection room.

Conversely, the correlation between light exposure and both temperature and humidity are notably weak, with coefficients of −0.03 and 0.11, respectively. These values indicate that light exposure in the White Label Room varies almost independently of temperature and humidity changes. The near-zero correlation between temperature and light exposure (−0.03) implies that fluctuations in temperature do not significantly affect the light conditions within the room, and the positive but slight correlation (0.11) between humidity and light exposure suggests minimal co-variation.

The regression model tailored for assessing the Preservation Index (PI) within the White Label Room is encapsulated by the following equation:

(3) Y W H I T E = 91.75 + 0.7978 θ W H I T E 0.338 η W H I T E 0.0002 λ W H I T E

where,

YWHITE denotes the PI, θ WHITE represents the temperature, η WHITE signifies the humidity, and λ WHITE indicates light exposure in the White Label Room.

The model’s intercept at 91.75 suggests a relatively high baseline PI under controlled environmental conditions where temperature, humidity, and light are set at their baseline values. The temperature coefficient of −0.7978 indicates a significant negative impact on the PI; this implies that an increase in temperature by one unit decreases the PI by approximately 0.798 units, underscoring the critical influence of temperature regulation in the preservation environment.

Similarly, the humidity coefficient of −0.2338 suggests that increases in humidity also detrimentally affect the PI, albeit to a lesser extent than temperature. This coefficient reinforces the importance of maintaining optimal humidity levels to mitigate its negative impact on archival materials.

Interestingly, the coefficient for light exposure at −0.0002 is extremely small, suggesting that within the range of variation observed in this study, light exposure has an almost negligible direct impact on the PI. This minimal influence suggests that, for the White Label Room, other environmental controls might be prioritized over stringent light exposure management.

3.7 Thesis Storage Room

Figure 9 illustrates the Pearson correlation heatmap for environmental variables within the Thesis Storage Room. The analysis reveals a moderate negative correlation between temperature (°F) and humidity (%), with a coefficient of −0.39, indicating that increases in temperature are generally accompanied by decreases in humidity within this environment. This inverse relationship is crucial for understanding the storage conditions of theses, as temperature and humidity are known to significantly influence paper preservation.

Figure 9: 
Pearson correlation for Thesis Storage Room.
Figure 9:

Pearson correlation for Thesis Storage Room.

In contrast, the correlation between temperature and light exposure is also negative, marked at −0.22, suggesting that higher temperatures slightly correlate with lower light levels in this room. However, this relationship is weaker compared to the temperature-humidity interaction. Light exposure shows a minimal positive correlation with humidity, recorded at 0.12, which is not substantial but indicates that slight increases in humidity could coincide with increases in light levels, although this effect is relatively negligible.

The regression model for the Thesis Storage Room, aimed at predicting the Preservation Index (PI), is succinctly formulated as follows:

(4) Y T H E S I S = 147.69 1.2844 θ T H E S I S 0.4971 η T H E S I S 0.0044 λ T H E S I S

in this model, YTHESIS represents the PI, θ THESIS denotes the temperature, η THESIS denotes the humidity, and λ THESIS indicates light exposure within the Thesis Storage Room.

The intercept value of 147.6966 suggests a relatively high baseline PI under controlled conditions, where all environmental variables are at their baseline or reference levels. This high baseline indicates that the preservation conditions are optimal when the environmental factors are maintained at standard levels, typical of controlled archival settings.

The coefficient for temperature (θ THESIS ) at −1.2844 indicates a significant negative impact on the PI. This suggests that each unit increase in temperature results in a substantial decrease in the PI by approximately 1.2844 units, highlighting the critical need to manage temperature variations carefully within the storage area.

Similarly, the humidity’s coefficient (η THESIS ) of −0.4971 reinforces the negative effect of increasing humidity on the PI. Although the impact is less severe than temperature, it underscores the importance of controlling humidity to prevent detrimental effects on the preservation of theses.

Lastly, the coefficient for light exposure (λ THESIS ) at −0.0044, while still negative, indicates a very minor influence on the PI. This minimal impact suggests that light exposure does not play a significant role in the degradation of materials in the Thesis Storage Room, compared to temperature and humidity.

3.8 Environmental Insights Across Rooms

This section presents the collection and analysis of environmental data from multiple storage areas within the library, encompassing the White, Red, and Thesis Storage Rooms.

The consolidated dataset offers a comprehensive view of the interactions among environmental variables – temperature, humidity, and light exposure – across these diverse yet interconnected spaces. By examining this data, we seek to identify underlying patterns that potentially influence material preservation across different storage environments.

The Pearson correlation heatmap shown in Figure 10 assesses the interactions among environmental variables – temperature (suhu_f), humidity (kelembaban_%), and light exposure (cahaya) – in the combined settings of the White, Red, and Thesis Storage Rooms. This visualization indicates generally weak correlations, suggesting that these variables operate independently across the different environments. Specifically, there is a slight negative correlation of −0.09 between temperature and humidity, showing that higher temperatures do not strongly correlate with lower humidity levels. Additionally, the correlation between temperature and light exposure is also weak at −0.03, indicating minimal dependence of light levels on temperature changes. Notably, light exposure has a small positive correlation with humidity at 0.12, implying that increases in humidity might slightly enhance light exposure, although this link is too weak to infer a strong dependency.

Figure 10: 
Pearson correlation across rooms.
Figure 10:

Pearson correlation across rooms.

The regression model developed to predict the Preservation Index (PI) across all room settings – White, Red, and Thesis Storage Rooms – is expressed by the equation:

(5) Y A L L = 123.39 1.0472 θ A L L 0.3882 η A L L 0.0042 λ A L L

here, YALL represents the PI, θ ALL denotes temperature, η ALL signifies humidity, and λ ALL indicates light exposure.

The intercept value of 123.3949 suggests a high baseline PI, assuming that temperature, humidity, and light are at their reference levels. This implies that under optimal conditions, the preservation environment is highly favourable. The coefficient for temperature (θ ALL ) at −1.0472 indicates a substantial negative impact on the PI; specifically, a one-unit increase in temperature correlates with a decrease in the PI by approximately 1.0472 units, underscoring the critical sensitivity of the preservation conditions to temperature changes.

Furthermore, the coefficient for humidity (η ALL ) at −0.3882 also demonstrates a negative effect on the PI, albeit less pronounced than temperature, yet still significant for material preservation strategies.

Lastly, the coefficient for light exposure (λ ALL ) at −0.0042, while negative, indicates a relatively minor impact on the PI compared to temperature and humidity. This minimal coefficient suggests that light exposure, within the observed ranges, does not significantly influence the preservation conditions as much as the other environmental variables.

3.9 Statistical Assessment of Regression Models

We explore the outcomes of the regression analysis by focusing on statistical measures that assess the performance and validity of our models across different environmental settings. These include R2 (R-squared), the F-statistics, and the Durbin-Watson statistics. R2 provides a measure of how much of the variability in the dependent variable can be explained by the independent variables in the model. A higher R2 value indicates a better fit of the model to the data, suggesting that the model explains a large proportion of the variability. The F-statistics evaluate the overall significance of the regression model, testing whether the observed relationships between the dependent and independent variables are statistically significant. A higher F-statistic value signals that the model is statistically meaningful, which implies that the predictors collectively have a significant effect on the outcome variable. Meanwhile, the Durbin-Watson statistic helps detect the presence of autocorrelation in the residuals of the regression model. Values close to 2 indicate no autocorrelation, affirming that the residuals are independent and that the regression model is appropriately specified.

Table 1 provides a comparison of regression models for various environmental settings within the library, specifically the White, Red, and Thesis Storage Rooms, along with a combined model. The R2 values across these models are impressively high, indicating that nearly all the variability in the Preservation Index can be explained by the models for each respective environment. Specifically, the combined room model has the highest R2 value at 0.984, suggesting the most robust fit among the models.

Table 1:

Regression metric summary.

Environment R2 F-statistic Durbin-Watson
Red room 0.983 6.347e+05 0.015
White room 0.971 1.414e+05 0.023
Thesis storage room 0.945 5.738e+04 0.019
Combined rooms 0.984 2.062e+05 0.017

Looking at the F-statistics, which test the overall significance of each regression model, we observe a clear hierarchy in model significance, though all models demonstrate statistically significant relationships between environmental factors and the Preservation Index. The Red Room model exhibits the highest F-statistic at 634,700, indicating an exceptionally strong predictive relationship, likely due to specific environmental sensitivities unique to this room. The Combined Rooms model follows with an F-statistic of 206,200, underscoring its robustness across multiple environments and validating its utility for overarching environmental strategies. The White Room, with an F-statistic of 141,400, and the Thesis Storage Room, at 57,380, still present significant models that effectively capture environmental impacts on preservation. These statistics not only affirm the validity of each model but also highlight the varying degrees of environmental influence across different room settings, suggesting that while some areas may be more sensitive to changes, all areas are significantly affected by their respective environmental conditions.

The Durbin-Watson statistic values for each environment demonstrate that the residuals from the regression models exhibit minimal to no autocorrelation, ensuring the independence of observations. This aspect is crucial for validating the reliability and accuracy of our predictive models. Specifically, the values – 0.015 for the Red Room, 0.023 for the White Room, 0.019 for the Thesis Storage Room, and 0.017 for the Combined Rooms – are all close to 2, indicating that autocorrelation does not significantly affect the outcomes of these models. Despite being slightly below the ideal value of 2, these statistics are within an acceptable range, confirming that the residuals are appropriately random. This evidence supports the robustness of our environmental models across different room settings, suggesting that they provide unbiased and consistent estimations of how environmental variables influence the Preservation Index.

Overall, the analysis of regression models across various library environments highlights that all models are statistically significant and reliable, with each model effectively capturing the influence of environmental variables on the Preservation Index. The Red Room model stands out as particularly strong, evidenced by its highest F-statistic and close-to-ideal Durbin-Watson value, suggesting that it may offer the most accurate predictions within its specific setting. The Combined Rooms model also demonstrates a high level of robustness, making it an excellent candidate for overarching environmental strategies, especially for applications that require a unified approach across different environments. Although the White Room and Thesis Storage Room models have slightly lower metrics, they are still within acceptable ranges and provide valuable insights into their respective settings. Given these findings, all models are deemed reliable, but the Red Room and Combined Rooms models show the most promise for further application.

Moving forward, we will assess the predictive power of these regression models by testing them against a separate dataset designated as the test data. By calculating the Mean Absolute Percentage Error (MAPE) for each model, we will better understand their practical applicability and accuracy. The following subsection will detail this process and the results obtained, providing further insights into the efficacy of these environmental models.

3.10 MAPE Analysis for Room-Specific Regression Models

In this section, we evaluate the predictive accuracy of the regression models for different library environments by comparing their Mean Absolute Percentage Error (MAPE). MAPE is a commonly used measure to assess the accuracy of forecasting models. It expresses the average absolute difference between the predicted values and the actual values as a percentage of the actual values. The formula for calculating MAPE is as follows:

(6) M A P E = 1 n i = 1 n A i F i A i 100

where Ai represents the actual value, Fi represents the forecasted value, and n is the number of observations. A lower MAPE value indicates a more accurate model, as it suggests that the predictions are closer to the actual values. By comparing the MAPE values across the different room-specific models, we can determine which model offers the highest predictive accuracy. This analysis will provide critical insights into the practical applicability of each model for environmental management within the library (Table 2).

Table 2:

The MAPE result.

Environment MAPE
Red room 0.4921
White room 1.1106
Thesis storage room 1.2440
Combined room 2.6515

The MAPE results reveal varying levels of predictive accuracy across the different room models. The Red Room model stands out with the lowest MAPE of 0.4921, indicating it provides the most accurate predictions among all the models. This suggests that the environmental conditions in the Red Room are well understood and effectively modeled, making it a reliable choice for precise environmental management. The White Room and Thesis Storage Room models have slightly higher MAPEs, at 1.1106 and 1.2440 respectively, but these are still within acceptable ranges for practical use. These results suggest that while these models are fairly accurate, there may be opportunities for refinement to improve their predictive capabilities. The Combined Room model, with a MAPE of 2.6515, shows the least accuracy, likely due to the complexity of integrating data from multiple environments. However, the Combined Room model remains a viable option, as it has been designed to account for the varying conditions of temperature, humidity, and light across all rooms. Despite its higher MAPE, this model still achieves a relatively high level of accuracy, making it suitable for general environmental management applications where a broad overview is required. Therefore, while the Red Room model is recommended for precise monitoring, the Combined Room model is also a strong candidate for situations where comprehensive, multi-environment analysis is needed.

4 Conclusions

This study undertook a detailed examination of the environmental factors influencing the Preservation Index across different library rooms – namely, the Red Room, White Room, Thesis Storage Room, and a Combined Room model that integrates data from all environments. By analysing Pearson correlation coefficients, we identified varying degrees of interdependence between temperature, humidity, and light exposure within each setting. These correlations served as a foundation for building robust regression models tailored to predict the Preservation Index based on the environmental conditions in each room.

The regression analysis revealed that all models are statistically significant, with high R2 values indicating that the models effectively capture the variability in the Preservation Index. Among the individual room models, the Red Room demonstrated the strongest predictive power, evidenced by the highest F-statistic and the lowest MAPE of 0.4921, confirming its accuracy in forecasting preservation conditions. The White Room and Thesis Storage Room models, while slightly less precise, still exhibited reliable predictive capabilities with MAPEs of 1.1106 and 1.2440, respectively. The Combined Room model, designed to provide a holistic view across all environments, showed a higher MAPE of 2.6515, which suggests some loss in precision due to the integration of diverse environmental data. Nevertheless, this model remains valuable for broad-based environmental management, as it anticipates the varying conditions across multiple rooms, achieving a high level of accuracy suitable for general applications.

In summary, while each room-specific model offers valuable insights into the environmental dynamics of their respective settings, the Combined Room model stands out as a versatile tool for comprehensive analysis. It effectively balances the complexity of multi-environmental data while maintaining a reasonable level of accuracy, making it the optimal choice for integrated preservation strategies. The findings from this study underscore the importance of tailored environmental models in enhancing the predictive maintenance of library collections, and they provide a strong foundation for further refinement and application of these models in broader contexts.

The novelty of this study lies in the application of IoT technology to create a robust environmental monitoring system specifically tailored for library settings, which has not been previously explored in the literature. Additionally, the development of a Combined Room model provides a new methodological pattern for assessing environmental impacts on library materials.

5 Further Research

Building on the findings of this study, future research should focus on developing a web-based application that automates the calculation of the Preservation Index (PI) using the predictive models established here. This tool would enable real-time monitoring and decision-making for library environments, enhancing the accessibility and utility of the regression models for preservation management. Additionally, further research should explore the application of these models in different environmental contexts within the same city. By analyzing the environmental conditions of various locations, the models could be refined and adapted to capture a broader range of environmental influences. Expanding the analysis to include multiple environments across different neighborhoods or districts would provide a more comprehensive understanding of how microclimates within a city impact the Preservation Index. This could lead to more precise predictive models, ultimately improving the accuracy and effectiveness of preservation strategies across diverse library settings.


Corresponding author: Eka Ratri Noor Wulandari, Faculty of Vocational Studies, Universitas Brawijaya, 65141, Malang, Indonesia, E-mail:

Award Identifier / Grant number: 1711.1/UN10.F16/HK/2024

Acknowledgment

This work was supported and funded by Faculty of Vocational Studies research grant of 2024. In addition, this work was supported by Central Library Universitas Brawijaya.

  1. Research funding: Funding was supported by “Faculty of Vocational Studies, Universitas Brawijaya (1711.1/UN10.F16/HK/2024).

Appendix

See Table A1.

Table A1:

The abbreviation table.

Abbreviation Full term
IoT Internet of things
PI Preservation index
HVAC Heating, ventilation, and air conditioning

References

Abad-Segura, E., M.-D. González-Zamar, J. Infante-Moro, and G. R. García. 2020. “Sustainable Management of Digital Transformation in Higher Education: Global Research Trends.” Sustainability 12 (5): 2107. https://doi.org/10.3390/su12052107.Search in Google Scholar

Ahmad, R., M. Rafiq, and M. Arif. 2024. “Global Trends in Digital Preservation: Outsourcing versus In-House Practices.” Journal of Librarianship and Information Science 56 (4): 1114–25. https://doi.org/10.1177/09610006231173461.Search in Google Scholar

Asinelli, M. G., M. Serra, J. M. Marimòn, and J. S. Espaulella. 2018. “The SmARTS_Museum_V1: An Open Hardware Device for Remote Monitoring of Cultural Heritage Indoor Environments.” HardwareX 4: 1–8, https://doi.org/10.1016/j.ohx.2018.e00028.Search in Google Scholar

Balocco, C., G. Petrone, O. Maggi, G. Pasquariello, R. Albertini, and C. Pasquarella. 2016. “Indoor Microclimatic Study for Cultural Heritage Protection and Preventive Conservation in the Palatina Library.” Journal of Cultural Heritage 22: 956–67. https://doi.org/10.1016/J.CULHER.2016.05.009.Search in Google Scholar

Beskaravainaya, E., and I. Mitroshin. 2024. “The Modern Methods of Data Preservation in Libraries. The Key Aspects and Solutions.” Scientific and Technical Libraries 8: 78–97. https://doi.org/10.33186/1027-3689-2024-8-78-97.Search in Google Scholar

Bhat, W. 2018. “Long-Term Preservation of Big Data: Prospects of Current Storage Technologies in Digital Libraries.” Library Hi Tech 36 (3): 539–55. https://doi.org/10.1108/LHT-06-2017-0117.Search in Google Scholar

Calabrese, M., N. Odisio, L. Appolonia, A. Bernagozzi, J. Christille, A. Glarey, et al.. 2018. “Integrated and Predictive Systems for Preventive Conservation.” Studies in Conservation 63 (Supplement 1): 43–50. https://doi.org/10.1080/00393630.2018.1504436.Search in Google Scholar

Camuffo, D. 2019. “Measuring Air Humidity.” In Microclimate for Cultural Heritage (Third Edition), 3rd ed., 431–57. Amsterdam: Elsevier.10.1016/B978-0-444-64106-9.00018-3Search in Google Scholar

Coppola, F., N. Brown, F. Amicucci, M. Strlič, and A. Modelli. 2020. “Non-Destructive Collection Survey of the Historical Classense Library. Part II: Conservation Scenarios.” Heritage Science 8 (89): 1–10, https://doi.org/10.1186/s40494-020-00430-y.Search in Google Scholar

Deepika, Nagarathna, and Channegowda. 2020. “Monitoring and Detection of Heat Stress of Layers in Poultry Farm Using Multiple Linear Regression.” In Proceedings of the 3rd International Conference on Smart Systems and Inventive Technology, ICSSIT 2020, Tirunelveli, India, 1186–91. Piscataway: IEEE Computer Society.10.1109/ICSSIT48917.2020.9214130Search in Google Scholar

Derksen, K., P. Brimblecombe, G. Piñar, M. Waldherr, A. Graf, and M. Haltrich, et al.. 2024. “Fungal Biodeterioration Risk in Monastic Libraries without Climate Control.” Microorganisms 12 (7): 1–20, https://doi.org/10.3390/microorganisms12071450.Search in Google Scholar

Diulio, M. de la Paz, P. Mercader-Moyano, and A. Gómez. 2019. “The Influence of the Envelope in the Preventive Conservation of Books and Paper Records. Case Study: Libraries and Archives in La Plata, Argentina.” Energy and Buildings 183: 727–38. https://doi.org/10.1016/j.enbuild.2018.11.048.Search in Google Scholar

Diulio, M. de la Paz, P. Mercader-Moyano, and A. Gómez. 2022. “Reusing Maintenance Monitoring Data on Preventive Conservation.” Ge-Conservacion 22 (1): 130–8. https://doi.org/10.37558/gec.v22i1.901.Search in Google Scholar

Dolińska, N., G. Wojciechowska, and Ł. Bednarz. 2025. “Monitoring Environmental and Structural Parameters in Historical Masonry Buildings Using IoT LoRaWAN-Based Wireless Sensors.” Buildings 15 (2): 1–28, https://doi.org/10.3390/buildings15020282.Search in Google Scholar

Garside, P., and K. Bradford. 2024. “Revising and Developing the Environmental Policy at the British Library.” Journal of the Institute of Conservation 47 (1): 22–35. https://doi.org/10.1080/19455224.2023.2301347.Search in Google Scholar

Goswami, M. 2018. “Effects of Environmental Factors on Preservation of Library Documents.” International Journal of Library and Information Studies 8 (2): 42–7.Search in Google Scholar

Harim, N. A., S. A. A. Karim, M. Othman, A. Saaban, A. Ghaffar, K. S. Nisar, et al.. 2020. “Positivity Preserving Interpolation by Using Rational Quartic Spline.” AIMS Mathematics 5 (4): 3762–82. https://doi.org/10.3934/math.2020244.Search in Google Scholar

Kareem, A. H. A., B. M. Ahmed, and L. A. Abdul-Rahaim. 2022. “Design and Implementation of Humidity and Temperature Automation and Monitoring for Public Building Comfortable Climate Based on Cloud Platform.” Przeglad Elektrotechniczny 98 (4): 94–100, https://doi.org/10.15199/48.2022.04.21.Search in Google Scholar

Khean, C. C., Z. Leman, N. M. Adam, and E. E. Supeni. 2023. “Indoor Climates and Energy Consumption by Air Conditioning System in a Learning Resource Centre in Malaysia.” Journal of Advanced Research in Fluid Mechanics and Thermal Sciences 102 (2): 33–41. https://doi.org/10.37934/arfmts.102.2.3341.Search in Google Scholar

Kumar, A. 2024. “An Analytical Approach of Preservation and Conservation of Library Materials in Aspects of Public Libraries.” Poonam Shodh Rachna 3 (5): 1–6. https://doi.org/10.56642/psr.v03i05.001.Search in Google Scholar

Lai, L. O., and J. O. Kaplan. 2022. “A Fast Mean-Preserving Spline for Interpolating Interval Data.” Journal of Atmospheric and Oceanic Technology 39 (4): 503–12. https://doi.org/10.1175/jtech-d-21-0154.1.Search in Google Scholar

Li, N., and Y. G. Tang. 2013. “Library Temperature and Humidity Remote Control System Based on Micro Controller Unit.” Applied Mechanics and Materials 341–342 (July 2013): 690–3. https://doi.org/10.4028/www.scientific.net/AMM.341-342.690.Search in Google Scholar

Manernova, O. 2015. “Conservation of Library Collections.” IFLA Journal 41: 63–9. https://doi.org/10.1177/0340035214561732.Search in Google Scholar

Masenya, T. M. 2020. “Application of Modern Technologies in the Management of Records in Public Libraries.” Journal of the South African Society of Archivists 53: 65–79. https://doi.org/10.4314/jsasa.v53i1.5.Search in Google Scholar

Mishra, L. 2017. “Preservation and Conservation of Library Materials.” ACADEMICIA: An International Multidisciplinary Research Journal 7 (2): 23–9. https://doi.org/10.5958/2249-7137.2017.00011.8.Search in Google Scholar

Monti, L., S. Mirri, C. Prandi, and P. Salomoni. 2019. “Preservation in Smart Libraries: An Experiment Involving IoT and Indoor Environmental Sensing.” In Proceedings – IEEE Global Communications Conference, GLOBECOM: Waikoloa, HI, USA, 9–13 December 2019, 1–6. Piscataway: IEEE.10.1109/GLOBECOM38437.2019.9014149Search in Google Scholar

Nagy, A.-C., and P. Kiszl. 2020. “Personal Digital Legacy Preservation by Libraries.” Journal of Librarianship and Information Science 53: 382–97. https://doi.org/10.1177/0961000620948574.Search in Google Scholar

Ni, Z., J. Hupkes, P. Eriksson, G. Leijonhufvud, M. Karlsson, and S. Gong. 2025. “Parametric Digital Twins for Preserving Historic Buildings: A Case Study at Löfstad Castle in Östergötland, Sweden.” IEEE Access 13 (December 2024): 3371–89. https://doi.org/10.1109/ACCESS.2024.3525442.Search in Google Scholar

Pasqui, V. 2024. “Digital Curation and Long-Term Digital Preservation in Libraries.” JLIS.It 15 (1): 109–25. https://doi.org/10.36253/jlis.it-567.Search in Google Scholar

Persson, P. N. 1989. “A Flexible Portable System for Measurement of Air Temperature and Humidity.” Agricultural and Forest Meteorology 45 (3–4): 167–73. https://doi.org/10.1016/0168-1923(89)90041-5.Search in Google Scholar

Renka, R. J. 1987. “Interpolatory Tension Splines with Automatic.” Society for Industrial and Applied Mathematics 8 (3): 393–415. https://doi.org/10.1137/0908041.Search in Google Scholar

Sharath, K. G. V. 2024. “Strategising IoT Platforms for Enhanced Hygrothermal Monitoring and Control in Libraries: A Transition from Passive to Proactive Operations.” Preservation, Digital Technology & Culture 53 (4): 249–58. https://doi.org/10.1515/pdtc-2024-0027.Search in Google Scholar

Turhan, C., Z. Arsan, and G. Akkurt. 2019. “Impact of Climate Change on Indoor Environment of Historic Libraries in Mediterranean Climate Zone.” International Journal of Global Warming 18 (3/4): 206–20. https://doi.org/10.1504/IJGW.2019.10022707.Search in Google Scholar

Wulandari, E. R. N., H. M. Abdullah, and T. M. Asri. 2023. “Internet of Things (IoT) Based Temperature and Humidity Monitoring System for Library Rooms.” In Proceedings of the 2022 Brawijaya International Conference (BIC 2022), Malang, ID, edited by Y. Adeputera Yusran, F. G. M. Fitri, T. A. Wihastuti, F. A. Nugroho, and I. D. Qurbani, 235–41. Dordrecht: Atlantis Press International BV.10.2991/978-94-6463-140-1_23Search in Google Scholar

Yano, C. A., Z. J. M. Shen, and S. Chan. 2013. “Optimising the Number of Copies and Storage Protocols for Print Preservation of Research Journals.” International Journal of Production Research 51 (23–24): 7456–69. https://doi.org/10.1080/00207543.2013.827810.Search in Google Scholar

Yousuf Ali, M. 2017. “Library Book Theft and Audits in University Libraries of Pakistan.” Journal of Library Administration 57 (1): 87–98. https://doi.org/10.1080/01930826.2016.1251252.Search in Google Scholar

Zaveri, P. 2014. “Damage to Libraries Due to Water Related Disasters.” Library Philosophy and Practice 1: 1–14.Search in Google Scholar

Received: 2024-12-20
Accepted: 2025-05-05
Published Online: 2025-06-16
Published in Print: 2025-07-28

© 2025 the author(s), published by De Gruyter, Berlin/Boston

This work is licensed under the Creative Commons Attribution 4.0 International License.

Downloaded on 9.9.2025 from https://www.degruyterbrill.com/document/doi/10.1515/pdtc-2024-0085/html
Scroll to top button