Abstract
Bayesian hierarchical models with a spatially smooth conditional autoregressive prior distribution are commonly used to estimate the spatio-temporal pattern in disease risk from areal unit data. However, most of the modeling approaches do not take possible boundaries of step changes in disease risk between geographically neighbouring areas into consideration, which may lead to oversmoothing of the risk surfaces, prevent the detection of high-risk areas and yield biased estimation of disease risk. In this paper, we propose a two-stage method to jointly estimate the disease risk in small areas over time and detect the locations of boundaries that separate pairs of neighbouring areas exhibiting vastly different risks. In the first stage, we use a graph-based optimisation algorithm to construct a set of candidate neighbourhood matrices that represent a range of possible boundary structures for the disease data. In the second stage, a Bayesian hierarchical spatio-temporal model that takes the boundaries into account is fitted to the data. The performance of the methodology is evidenced by simulation, before being applied to a study of respiratory disease risk in Greater Glasgow, Scotland.
Acknowledgment
The authors would like to thank the editors and anonymous reviewers, whose valuable comments have greatly improved the paper.
-
Research ethics: Not applicable.
-
Informed consent: Not applicable.
-
Author contributions: All authors have accepted responsibility for the entire content of this manuscript and approved its submission.
-
Use of Large Language Models, AI and Machine Learning Tools: None declared.
-
Conflict of interest: The authors state no conflict of interest.
-
Research funding: None declared.
-
Data availability: The raw data can be obtained from the corresponding author on reasonable request.
References
1. Gelfand, AE, Diggle, P, Guttorp, P, Fuentes, M. Handbook of spatial statistics. Boca Raton, Florida: CRC Press; 2010.10.1201/9781420072884Suche in Google Scholar
2. Tobler, WR. A computer movie simulating urban growth in the Detroit region. Econ Geogr 1970;46(1 Suppl):234–40. https://doi.org/10.2307/143141.Suche in Google Scholar
3. Besag, J, York, J, Mollié, A. Bayesian image restoration, with two applications in spatial statistics. Ann Inst Stat Math 1991;43:1–20. https://doi.org/10.1007/bf00116466.Suche in Google Scholar
4. Leroux, BG, Lei, X, Breslow, N. Estimation of disease rates in small areas: a new mixed model for spatial dependence. In: Statistical models in epidemiology, the environment, and clinical trials. New York, USA: Springer; 2000:179–91 pp.10.1007/978-1-4612-1284-3_4Suche in Google Scholar
5. Stern, HS, Cressie, N. Posterior predictive model checks for disease mapping models. Stat Med 2000;19:2377–97. https://doi.org/10.1002/1097-0258(20000915/30)19:17/18<2377::aid-sim576>3.3.co;2-t.10.1002/1097-0258(20000915/30)19:17/18<2377::AID-SIM576>3.3.CO;2-TSuche in Google Scholar
6. Bernardinelli, L, Clayton, D, Pascutto, C, Montomoli, C, Ghislandi, M, Songini, M. Bayesian analysis of space-time variation in disease risk. Stat Med 1995;14:2433–43. https://doi.org/10.1002/sim.4780142112.Suche in Google Scholar
7. Knorr-Held, L, Besag, J. Modelling risk from a disease in time and space. Stat Med 1998;17:2045–60. https://doi.org/10.1002/(sici)1097-0258(19980930)17:18<2045::aid-sim943>3.0.co;2-p.10.1002/(SICI)1097-0258(19980930)17:18<2045::AID-SIM943>3.0.CO;2-PSuche in Google Scholar
8. Knorr-Held, L. Bayesian modelling of inseparable space-time variation in disease risk. Stat Med 2000;19:2555–67. https://doi.org/10.1002/1097-0258(20000915/30)19:17/18<2555::aid-sim587>3.0.co;2-#.10.1002/1097-0258(20000915/30)19:17/18<2555::AID-SIM587>3.0.CO;2-#Suche in Google Scholar
9. Napier, G, Lee, D, Robertson, C, Lawson, A, Pollock, KG. A model to estimate the impact of changes in MMR vaccine uptake on inequalities in measles susceptibility in Scotland. Stat Methods Med Res 2016;25:1185–200. https://doi.org/10.1177/0962280216660420.Suche in Google Scholar
10. Rushworth, A, Lee, D, Mitchell, R. A spatio-temporal model for estimating the long-term effects of air pollution on respiratory hospital admissions in Greater London. Spat Spatio-Temporal Epidemiol 2014;10:29–38. https://doi.org/10.1016/j.sste.2014.05.001.Suche in Google Scholar
11. Wang, W, Xiao, X, Qian, J, Chen, S, Liao, F, Yin, F, et al.. Reclaiming independence in spatial-clustering datasets: a series of data-driven spatial weights matrices. Stat Med 2022;41:2939–56. https://doi.org/10.1002/sim.9395.Suche in Google Scholar PubMed PubMed Central
12. Knorr-Held, L, Raßer, G. Bayesian detection of clusters and discontinuities in disease maps. Biometrics 2000;56:13–21. https://doi.org/10.1111/j.0006-341x.2000.00013.x.Suche in Google Scholar PubMed
13. Anderson, C, Lee, D, Dean, N. Identifying clusters in Bayesian disease mapping. Biostatistics 2014;15:457–69. https://doi.org/10.1093/biostatistics/kxu005.Suche in Google Scholar PubMed
14. Adin, A, Lee, D, Goicoa, T, Ugarte, MD. A two-stage approach to estimate spatial and spatio-temporal disease risks in the presence of local discontinuities and clusters. Stat Methods Med Res 2019;28:2595–613. https://doi.org/10.1177/0962280218767975.Suche in Google Scholar PubMed
15. Yin, X, Napier, G, Anderson, C, Lee, D. Spatio-temporal disease risk estimation using clustering-based adjacency modelling. Stat Methods Med Res 2022;31:1184–203. https://doi.org/10.1177/09622802221084131.Suche in Google Scholar PubMed PubMed Central
16. Lu, H, Reilly, CS, Banerjee, S, Carlin, BP. Bayesian areal wombling via adjacency modeling. Environ Ecol Stat 2007;14:433–52. https://doi.org/10.1007/s10651-007-0029-9.Suche in Google Scholar
17. Ma, H, Carlin, BP, Banerjee, S. Hierarchical and joint site-edge methods for medicare hospice service region boundary analysis. Biometrics 2010;66:355–64. https://doi.org/10.1111/j.1541-0420.2009.01291.x.Suche in Google Scholar PubMed PubMed Central
18. Lee, D, Mitchell, R. Boundary detection in disease mapping studies. Biostatistics 2012;13:415–26. https://doi.org/10.1093/biostatistics/kxr036.Suche in Google Scholar PubMed
19. Lee, D, Mitchell, R. Locally adaptive spatial smoothing using conditional auto-regressive models. J Roy Stat Soc C Appl Stat 2013;62:593–608. https://doi.org/10.1111/rssc.12009.Suche in Google Scholar
20. Lee, D, Meeks, K, Pettersson, W. Improved inference for areal unit count data using graph-based optimisation. Stat Comput 2021;31:1–17. https://doi.org/10.1007/s11222-021-10025-7.Suche in Google Scholar
21. Wakefield, J, Kim, A. A Bayesian model for cluster detection. Biostatistics 2013;14:752–65. https://doi.org/10.1093/biostatistics/kxt001.Suche in Google Scholar PubMed PubMed Central
22. Anderson, C, Lee, D, Dean, N. Bayesian cluster detection via adjacency modelling. Spat Spatio-Temporal Epidemiol 2016;16:11–20. https://doi.org/10.1016/j.sste.2015.11.005.Suche in Google Scholar PubMed
23. Santafé, G, Adin, A, Lee, D, Ugarte, MD. Dealing with risk discontinuities to estimate cancer mortality risks when the number of small areas is large. Stat Methods Med Res 2021;30:6–21. https://doi.org/10.1177/0962280220946502.Suche in Google Scholar PubMed
24. Salciccioli, JD, Marshall, DC, Shalhoub, J, Maruthappu, M, De Carlo, G, Chung, KF. Respiratory disease mortality in the United Kingdom compared with EU15+ countries in 1985–2015: observational study. Br Med J 2018;363. https://doi.org/10.1136/bmj.k4680.Suche in Google Scholar PubMed PubMed Central
25. Walsh, D, Bendel, N, Jones, R, Hanlon, P. Investigating a Glasgow effect: why do equally deprived UK cities experience different health outcomes? Glasgow, UK: Glasgow Centre for Population Health; 2010.10.1016/j.puhe.2010.02.006Suche in Google Scholar PubMed
26. Moran, PA. Notes on continuous stochastic phenomena. Biometrika 1950;37:17–23. https://doi.org/10.2307/2332142.Suche in Google Scholar
27. Jacquez, GM, Maruca, S, Fortin, MJ. From fields to objects: a review of geographic boundary analysis. J Geogr Syst 2000;2:221–41. https://doi.org/10.1007/pl00011456.Suche in Google Scholar
28. Waller, LA, Carlin, BP, Xia, H, Gelfand, AE. Hierarchical spatio-temporal mapping of disease rates. J Am Stat Assoc 1997;92:607–17. https://doi.org/10.2307/2965708.Suche in Google Scholar
29. Lee, D. A comparison of conditional autoregressive models used in Bayesian disease mapping. Spat Spatio-Temporal Epidemiol 2011;2:79–89. https://doi.org/10.1016/j.sste.2011.03.001.Suche in Google Scholar PubMed
30. R Core Team. R: a language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing; 2013.Suche in Google Scholar
31. Lee, D, Rushworth, A, Sahu, SK. A Bayesian localized conditional autoregressive model for estimating the health effects of air pollution. Biometrics 2014;70:419–29. https://doi.org/10.1111/biom.12156.Suche in Google Scholar PubMed PubMed Central
32. Rushworth, A, Lee, D, Sarran, C. An adaptive spatiotemporal smoothing model for estimating trends and step changes in disease risk. J Roy Stat SocC Appl Stat 2017;66:141–57. https://doi.org/10.1111/rssc.12155.Suche in Google Scholar
33. Muegge, R, Dean, N, Jack, E, Lee, D. National lockdowns in England: the same restrictions for all, but do the impacts on COVID-19 mortality risks vary geographically? Spat Spatio-Temporal Epidemiol 2023;44:100559. https://doi.org/10.1016/j.sste.2022.100559.Suche in Google Scholar PubMed PubMed Central
34. Rue, H, Martino, S, Chopin, N. Approximate Bayesian inference for latent Gaussian models by using integrated nested Laplace approximations. J Roy Stat Soc B Stat Methodol 2009;71:319–92. https://doi.org/10.1111/j.1467-9868.2008.00700.x.Suche in Google Scholar
35. Geman, S, Geman, D. Stochastic relaxation, Gibbs distributions, and the Bayesian restoration of images. IEEE Trans Pattern Anal Mach Intell 1984:721–41. https://doi.org/10.1109/tpami.1984.4767596.Suche in Google Scholar PubMed
36. Metropolis, N, Rosenbluth, AW, Rosenbluth, MN, Teller, AH, Teller, E. Equation of state calculations by fast computing machines. J Chem Phys 1953;21:1087–92. https://doi.org/10.1063/1.1699114.Suche in Google Scholar
37. Hastings, WK. Monte Carlo sampling methods using Markov chains and their applications. Biometrika 1970;57:97–109. https://doi.org/10.2307/2334940.Suche in Google Scholar
38. Altekar, G, Dwarkadas, S, Huelsenbeck, JP, Ronquist, F. Parallel metropolis coupled Markov chain Monte Carlo for Bayesian phylogenetic inference. Bioinformatics 2004;20:407–15. https://doi.org/10.1093/bioinformatics/btg427.Suche in Google Scholar PubMed
39. Napier, G, Lee, D, Robertson, C, Lawson, A. A Bayesian space-time model for clustering areal units based on their disease trends. Biostatistics 2019;20:681–97. https://doi.org/10.1093/biostatistics/kxy024.Suche in Google Scholar PubMed PubMed Central
40. Eddelbuettel, D, François, R, Allaire, J, Ushey, K, Kou, Q, Russel, N, et al.. Rcpp: seamless R and C++ integration. J Stat Software 2011;40:1–18. https://doi.org/10.18637/jss.v040.i08.Suche in Google Scholar
41. Eddelbuettel, D. Seamless R and C++ integration with Rcpp. New York, UK: Springer; 2013.10.1007/978-1-4614-6868-4Suche in Google Scholar
42. Geweke, J. Evaluating the accuracy of sampling-based approaches to the calculations of posterior moments. Bayesian Stat 1992;4:641–9.10.1093/oso/9780198522669.003.0010Suche in Google Scholar
43. Spiegelhalter, DJ, Best, NG, Carlin, BP, Van Der Linde, A. Bayesian measures of model complexity and fit. J Roy Stat Soc B Stat Methodol 2002;64:583–639. https://doi.org/10.1111/1467-9868.00353.Suche in Google Scholar
44. Bradley, AP. The use of the area under the ROC curve in the evaluation of machine learning algorithms. Pattern Recogn 1997;30:1145–59. https://doi.org/10.1016/s0031-3203(96)00142-2.Suche in Google Scholar
45. Hanley, JA, McNeil, BJ. The meaning and use of the area under a receiver operating characteristic (ROC) curve. Radiology 1982;143:29–36. https://doi.org/10.1148/radiology.143.1.7063747.Suche in Google Scholar PubMed
46. Gelman, A, Rubin, DB. Inference from iterative simulation using multiple sequences. Stat Sci 1992;7:457–72. https://doi.org/10.1214/ss/1177011136.Suche in Google Scholar
47. Lu, H, Carlin, BP. Bayesian areal wombling for geographical boundary analysis. Geogr Anal 2005;37:265–85. https://doi.org/10.1111/j.1538-4632.2005.00624.x.Suche in Google Scholar
Supplementary Material
This article contains supplementary material (https://doi.org/10.1515/ijb-2023-0138).
© 2025 Walter de Gruyter GmbH, Berlin/Boston
Artikel in diesem Heft
- Frontmatter
- Research Articles
- Prognostic adjustment with efficient estimators to unbiasedly leverage historical data in randomized trials
- Homogeneity test and sample size of response rates for AC 1 in a stratified evaluation design
- A review of survival stacking: a method to cast survival regression analysis as a classification problem
- DsubCox: a fast subsampling algorithm for Cox model with distributed and massive survival data
- A hybrid hazard-based model using two-piece distributions
- Regression analysis of clustered current status data with informative cluster size under a transformed survival model
- Bayesian covariance regression in functional data analysis with applications to functional brain imaging
- Risk estimation and boundary detection in Bayesian disease mapping
- An improved estimator of the logarithmic odds ratio for small sample sizes using a Bayesian approach
- Short Communication
- A multivariate Bayesian learning approach for improved detection of doping in athletes using urinary steroid profiles
- Research Articles
- Guidance on individualized treatment rule estimation in high dimensions
- Weighted Euclidean balancing for a matrix exposure in estimating causal effect
- Penalized regression splines in Mixture Density Networks
Artikel in diesem Heft
- Frontmatter
- Research Articles
- Prognostic adjustment with efficient estimators to unbiasedly leverage historical data in randomized trials
- Homogeneity test and sample size of response rates for AC 1 in a stratified evaluation design
- A review of survival stacking: a method to cast survival regression analysis as a classification problem
- DsubCox: a fast subsampling algorithm for Cox model with distributed and massive survival data
- A hybrid hazard-based model using two-piece distributions
- Regression analysis of clustered current status data with informative cluster size under a transformed survival model
- Bayesian covariance regression in functional data analysis with applications to functional brain imaging
- Risk estimation and boundary detection in Bayesian disease mapping
- An improved estimator of the logarithmic odds ratio for small sample sizes using a Bayesian approach
- Short Communication
- A multivariate Bayesian learning approach for improved detection of doping in athletes using urinary steroid profiles
- Research Articles
- Guidance on individualized treatment rule estimation in high dimensions
- Weighted Euclidean balancing for a matrix exposure in estimating causal effect
- Penalized regression splines in Mixture Density Networks