Regression analysis of unmeasured confounding

Brian Knaeble; Braxton Osting; Mark A. Abramson

doi:10.1515/em-2019-0028

Article

Regression analysis of unmeasured confounding

Brian Knaeble , Braxton Osting and Mark A. Abramson

Published/Copyright: May 4, 2020

Published by

Become an author with De Gruyter Brill

Submit Manuscript Author Information Explore this Subject

From the journal Epidemiologic Methods Volume 9 Issue 1

Abstract

When studying the causal effect of x on y, researchers may conduct regression and report a confidence interval for the slope coefficient βx. This common confidence interval provides an assessment of uncertainty from sampling error, but it does not assess uncertainty from confounding. An intervention on x may produce a response in y that is unexpected, and our misinterpretation of the slope happens when there are confounding factors w. When w are measured we may conduct multiple regression, but when w are unmeasured it is common practice to include a precautionary statement when reporting the confidence interval, warning against unwarranted causal interpretation. If the goal is robust causal interpretation then we can do something more informative. Uncertainty, in the specification of three confounding parameters can be propagated through an equation to produce a confounding interval. Here, we develop supporting mathematical theory and describe an example application. Our proposed methodology applies well to studies of a continuous response or rare outcome. It is a general method for quantifying error from model uncertainty. Whereas, confidence intervals are used to assess uncertainty from unmeasured individuals, confounding intervals can be used to assess uncertainty from unmeasured attributes.

Keywords: correlation; model uncertainty; optimization; propensity

Corresponding author: Mark A. Abramson,Department of Mathematics, Utah Valley University, Orem, UT, USA, E-mail: mark.abramson@uvu.edu

A Proof of Proposition 1.1

Let e∈ℝn denote the ones vector, W=[e|w1|⋯|wp],

P=W(WtW)−1Wt, and E=1neet.

Note that P and E are projection matrices, i. e., P2=P and E2=E. Furthermore, since W includes e as a column, we have that EP=PE=E, (I−P)(I−E)=(I−E)(I−P)=I−P, and P−E is also a projection matrix.

We use P and E to write x^=Px, y^=Py,

σx2=1n|(I−E)x|2, σy2=1n|(I−E)y|2,σx^2=1n|(I−E)Px|2, σy^2=1n|(I−E)Py|2,Rwx2=σx^2σx2=|(I−E)Px|2|(I−E)x|2, Rwy2=σy^2σy2=|(I−E)Py|2|(I−E)y|2,ρxy=〈(I−E)x, (I−E)y〉nσxσy, ρx^y^=〈(I−E)Px, (I−E)Py〉nσx^σy^,

and

ρ(x−x^)(y−y^)=〈(I−P)x, (I−P)y〉|(I−P)x| |(I−P)y|.

Using the above expressions, we compute

nσxσyρxy=〈(I−E)x, (I−E)y〉=〈(I−E)Px, (I−E)Py〉+〈(I−P)x, (I−P)y〉=nσx^σy^ρx^y^+nσxσy1−Rwx21−Rwy2ρ(x−x^)(y−y^).

Dividing both sides by nσxσy, we obtain

(8)ρxy=RwxRwyρx^y^+1−Rwx21−Rwy2ρ(x−x^)(y−y^).

Solving for ρ(x−x^)(y−y^), we obtain an expression for the partial correlation,

(9)ρ(x−x^)(y−y^)=ρxy−RwxRwyρx^y^1−Rwx21−Rwy2.

We now consider our model from (1) rewritten as

y=βx|wx+Wβ+ε,

where β=(β0|w, β1, ⋯, βp). Write X=[x|W] and Q=X(XtX)−1Xt. Note that Q is a projection matrix, and since range(W)⊂range(X), we have that PQ=QP=P and (I−P)(I−Q)=(I−Q)(I−P)=(I−Q). In this notation, the fitted values are given by

Qy=βx|wx+Wβ.

We now add and subtract terms as follows:

Q(y−Py+Py)=βx|w(x−Px+Px)+Wβ

and rearrange to obtain:

Q(I−P)y=βx|w(I−P)x+P(Wβ+βx|wx−y).

We now apply I−P to both sides, take the inner product with x on both sides, use Qx=x, and rearrange to obtain

(10a)βx|w=〈x, (I−P)y〉〈x, (I−P)x〉

(10b)=|y−y^||x−x^|ρ(x−x^)(y−y^)

(10c)=σyσx1−Rwy21−Rwx2ρ(x−x^)(y−y^).

Combining (9) and (10), we obtain the desired result.

B Proof of Proposition 2.1

In the setting of the proof of Proposition 1.1, consider (8). Since ρ(y−y^)(x−x^)∈[−1,1], we obtain

(11)ρxy∈[ξ−, ξ+], where ξ±=RwxRwyρx^y^±1−Rwx21−Rwy2.

Manipulating (11) to isolate ρx^y^ gives (4).

Conversely, if n>p+2, then there are sufficient degrees of freedom for ρ(y−y^)(x−x^) to take any desired value in [−1, 1].

C Proof of Proposition 2.2

We assume that the feasible set Ω is non-empty and is defined by the upper and lower inequality constraints in (3a), (3b), (3c), and (4), for eight inequality constraints in total. We do not allow α−=α+ in (4), but we do allow lx=ux, ly=uy, and or lx^y^=ux^y^. We say an inequality constraint is active, when it holds with equality. In what follows for k=1,2,3 we say exactly k constraints are active when exactly k constraints are active and each active constraint is from a different line of (3a), (3b), (3c), or (4), e. g., lx=Rwx2=ux counts as one constraint not two constraints. There is no ambiguity when zero constraints are active. In three dimensions it is impossible or redundant to have four active constraints.

Recalling our notation l=minΩ(βx|w) and u=maxΩ(βx|w), we say a point (Rwx2, Rwy2, ρx^y^) is optimal if it satisfies either

βx|w(Rwx2, Rwy2, ρx^y^)=l or βx|w(Rwx2, Rwy2, ρx^y^)=u.

To prove Proposition 2.2, we show that any optimal points in Ω must also be within the finite subset of points S. The sets Ω and S depend on the values of the parameters {ρxy, σy/σx;lx, ux, ly, uy, lx^y^, ux^y^}, as described in Section 2. Recall, βx|w=σyσxρxy−RwxRwyρx^y^1−Rwx2. Note that

∂βx|w∂Rwx2=0⇔∂βx|w∂Rwx=0 and ∂βx|w∂Rwy2=0⇔∂βx|w∂Rwx=0

wherever Rwx≠0 and Rwy≠0. To simplify algebra in what follows we compute ∂βx|w/∂Rwx in place of ∂βx|w/∂Rwx2 and ∂βx|w/∂Rwy in place of ∂βx|w/∂Rwy2. Likewise, we infer the constancy of βx|w in Rwx2 or Rwy2 from its constancy in Rwx or Rwy, respectfully.

Wherever no constraints are active ∂βx|w∂βx^y^=σyσx−RwxRwy1−Rwx2≠0 and no optimal points exist. If exactly one constraint is active, we have the following cases. If the constraint is from either (3a) or (3b), then ∂βx|w∂ρx^y^=σyσx−RwxRwy1−Rwx2≠0. If that constraint is (3c), then either ∂βx|w∂Rwy=σyσx−Rwxρx^y^1−Rwx2≠0 or, if ρx^y^=0, βx|w is constant in Rwy2. Finally, if that one constraint is (4), then

(12)βx|w=±σyσx1−Rwy2/1−Rwx2,

which is strictly monotonic in both Rwx2 and Rwy2 on both surfaces

(13)ρx^y^=ρxy±1−Rwx21−Rwy2RwxRwy.

We have thus far shown that two or more constraints must be active in order for a point to be optimal.

Suppose exactly two constraints are active. If those two constraints are (3a) and (3b) then βx|w=σyσxρxy−bxbyρx^y^1−bx2 and either ∂βx|w∂ρx^y^≠0 or βx|w is constant in ρx^y^. If those two constraints are (3a) and (3c) then βx|w=σyσxρxy−bxRwybx^y^1−bx2 and either ∂βx|w∂Rwy≠0 or βx|w is constant in Rwy2. If those two constraints are (3b) and (3c) then βx|w=σyσxρxy−Rwxbybx^y^1−Rwx2 and because ∂βx|w∂Rwx=σyσx−bybx^y^Rwx2+2ρxyRwx2−bybx^y^(1−Rwx2)2 optimal points require

Rwx2=q±2(−bybx^y^, 2ρxy, −bybx^y^)

producing the options in line (7a) of Proposition 2.2.

We now consider exactly two active constraints and require one of them to be (4). If those two constraints are (3a) and (4) then as in (12) we have βx|w strictly monotonic in Rwy2. If those two constraints are (3b) and (4) then as in (12) we have βx|w strictly monotonic in Rwx2. If those two constraints are (3c) and (4) then bx^y^=ρxy±1−Rwx21−Rwy2RwxRwy, or written differently

(14)g(Rwx, Rwy):=bx^y^RwxRwy±1−Rwx21−Rwy2=ρxy.

Along a curve g(Rwx, Rwy)=ρxy, via (12),

h(Rwx, Rwy):=βx|w=±σyσx1−Rwy2/1−Rwx2.

At an optimal point, for some real λ, we must have ∇h=λ∇g (Nocedal and Wright 2006, p. 321), implying equality between

∂g∂Rwx/∂g∂Rwy=bx^y^Rwy±2Rwx(1+Rwy2)1/2(1−Rwx2)−1/2bx^y^Rwx±2Rwy(1+Rwx2)1/2(1−Rwy2)−1/2

and

∂h∂Rwx/∂h∂Rwy=Rwx(1−Rwy2)1/2(1−Rwx2)−3/2−Rwy(1−Rwx2)−1/2(1−Rwy)−1/2=Rwx(1−Rwy2)−Rwy(1−Rwx2),

which in turn implies Rwx=Rwy. Solving for R2=Rwx2=Rwy2 within (14) produces R2=ρxy+1bx^y^+1 or R2=ρxy−1bx^y^−1 resulting in lines (7b) and (7c) of Proposition 2.2.

Suppose exactly three constraints are active. If those three constraints are (3a), (3b), and (3c) then any candidate point is of the form (bx2, by2, bx^y^) as in line (7d) of Proposition 2.2. If those three constraints are (3a), (3b), and (4) then via (13) we have ρx^y^=ρxy±1−bx21−by2bxby resulting in line (7e) of Proposition 2.2. If those three constraints are (3a), (3c), and (4) then via (13) we have bx^y^=ρxy±1−bx21−Rwy2bxRwy, which after rearrangement and squaring gives

(15)(bx2bx^y^2+1−bx2)Rwy2−2bxbx^y^ρxyRwy+(bx2−1+ρxy2)=0.

Using the quadratic formula on (15) results in line (7f) of Propositon 2.2. If those three constraints are (3b), (3c), and (4), then likewise, via (13), we have bx^y^=ρxy±1−Rwx21−by2Rwxby, and with analogous rearrangement, squaring, and use of the quadratic formula, we solve for Rwx and derive line (7g) of Proposition 2.2.

Research funding: None declared.
Author contributions: All authors have accepted responsibility for the entire content of this manuscript and approved its submission.
Competing interests: Authors state no conflict of interest.

References

Center for Disease Control. 2017. Public Health Statement on Polybrominated Diphenyl Ethers (PBDEs). URL https://www.atsdr.cdc.gov/ToxProfiles/tp207-c1-b.pdf, (accessed March 20, 2019).Search in Google Scholar

Chatfield, C. 1995. “Model Uncertainty, Data Mining and Statistical Inference.” Journal of the Royal Statistical Society: Series A 158: 419–66.10.2307/2983440Search in Google Scholar

Choi, A. L., S. Cordier, P. Weihe, and P. Grandjean. 2008. “Negative Confounding in the Evaluation of Toxicity: the Case of Methylmercury in Fish and Seafood.” Critical Reviews in Toxicology 38: 877–93.10.1080/10408440802273164Search in Google Scholar PubMed PubMed Central

Cornfield, J., W. Haenszel, E. C. Hammond, A. M. Lilienfeld, M. B. Shimkin, and E. L. Wynder. 2008. “Smoking and Lung Cancer: Recent Evidence and A Discussion of Some Questions.” Journal of the National Cancer Institute 22: 173–203.10.1093/ije/dyp289Search in Google Scholar PubMed

Corraini, P., M. Olsen, L. Pedersen, O. M. Dekkers, and J. P. Vandenbroucke. 2017. “Effect Modification, Interaction and Mediation: An Overview of Theoretical Insights for Clinical Investigators.” Clinical Epidemiology 9: 331–8.10.2147/CLEP.S129728Search in Google Scholar PubMed PubMed Central

Ding, P. and L. Miratrix. 2015. “To Adjust or not to Adjust? Sensitivity Analysis of m-bias and Butterfly-bias.” Journal of Causal Inference 3: 41–57.10.1515/jci-2013-0021Search in Google Scholar

Ding, P. and T. VanderWeele. 2014. “Generalized Cornfield Conditions for the Risk Difference.” Biometrika 101: 971–7.10.1093/biomet/asu030Search in Google Scholar

Ding, P. and T. VanderWeele. 2016. “Sensitivity Analysis without Assumptions.” Epidemiology 27: 368–77.10.1097/EDE.0000000000000457Search in Google Scholar PubMed PubMed Central

Ding, P. and T. VanderWeele. 2017. “Sensitivity Analysis in Observational Research: Introducing the E-value.” Annals of Internal Medicine 167: 268–74.10.7326/M16-2607Search in Google Scholar PubMed

Eskenazi, B., J. Chevrier, S. A. Rauch, K. Kogut, K. G. Harley, C. Johnson, C. Trujillo, A. Sjödin, and A. Bradman. 2013. “In Utero and Childhood Polybrominated Diphenyl Ether (PBDE) Exposures and Neurodevelopment in the CHAMACOS Study.” Environmental Health Perspectives 121: 257–62.10.1201/b18030-18Search in Google Scholar

Fisher, R. A. 1935. Design of Experiments. Edinburgh: Oliver and Boyd.Search in Google Scholar

Frank, K. 2000. “Impact of a Confounding Variable on a Regression Coefficient.” Sociological Methods & Research 29: 147–94.10.1177/0049124100029002001Search in Google Scholar

Hastie, T., R. Tibshirani, and J. Friedman. 2009. The Elements of Statistical Learning, Data Mining, Inference, and Prediction, Springer Series in Statistics, 2nd ed. New York: Springer.10.1007/978-0-387-84858-7Search in Google Scholar

Horton, M. K., S. Bousleiman, R. Jones, A. Sjödin, X. Liu, R. Whyatt, R. Wapner, and P. Factor-Litvak. 2013. “Predictions of Serum Concentrations of Polybrominated Flame Retardants Among Healthy Pregnant Women in an Urban Environment: A Cross-sectional Study.” Environmental Health 12.10.1186/1476-069X-12-23Search in Google Scholar PubMed PubMed Central

Hosman, C., B. Hansen, and P. Holland. 2010. “The Sensitivity of Linear Regression Coefficients’ Confidence Limits to the Omission of a Confounder.” The Annals of Applied Statistics 4: 849–70.10.1214/09-AOAS315Search in Google Scholar

Knaeble, B. 2015. “Regression and Random Confounding.” Electronic Journal of Applied Statistical Analysis 8.Search in Google Scholar

Knaeble, B. 2017. “Adjustment with Three Continuous Variables.” Communications in Statistics – Simulation and Computation 48.10.1080/03610918.2017.1390128Search in Google Scholar

Knaeble, B. 2019. Brian Knaeble’s GitHub Page. URL: https://github.com/bknaeble/ConfoundingIntervals.Search in Google Scholar

Knaeble, B. and J. Chan. 2018. “Odds are the Sign is Right.” Biometrical Journal 60: 1164–71.10.1002/bimj.201700199Search in Google Scholar PubMed

Knaeble, B. and S. Dutter. 2017. “Reversals of Least-Squares Estimates and Model-Invariant Estimation for Directions of Unique Effects.” The American Statistician 71: 97–105.10.1080/00031305.2016.1226951Search in Google Scholar

Kontopantelis, E., T. Doran, D. A. Springate, I. Buchan, and D. Reeves. 2015. “Regression Based Quasi-Experimental Approach when Randomisation is not an Option: Interrupted Time Series Analysis.” BMJ 350: h2750.10.1136/bmj.h2750Search in Google Scholar PubMed PubMed Central

Kuratko, C. N., E. C. Barrett, E. B. Nelson, and N. Salem, Jr. 2013. “The Relationship of Docosahexaenoic Acid (dha) With Learning and Behavior in Healthy Children: A Review.” Nutrients 5: 2777–810.10.3390/nu5072777Search in Google Scholar PubMed PubMed Central

Lee, W. C. 2011. “Bounding the Bias of Unmeasured Factors with Confounding and Effect Modifying Potentials.” Statistics in Medicine 30: 1007–17.10.1002/sim.4151Search in Google Scholar PubMed

Liu, J., A. Raine, P. H. Venables, C. Dalais, and S. A. Mednick. 2003. “Malnutrition at Age 3 Years and Lower Cognitive Ability at Age 11 Years: Independence from Psychosocial Adversity.” Archives of Pediatric & Adolescent Medicine 157: 593–600.10.1001/archpedi.157.6.593Search in Google Scholar PubMed PubMed Central

MacLehose, R. F., S. Kaufman, J. S. Kaufman, and C. Poole. 2005. “Bounding Causal Effects Under Uncontrolled Confounding Using Counterfactuals.” Epidemiology 16: 548–55.10.1097/01.ede.0000166500.23446.53Search in Google Scholar PubMed

McNamee, R. 2005. “Regression Modelling and Other Methods to Control Confounding.” Occupational and Environmental Medicine 62: 500–6.10.1136/oem.2002.001115Search in Google Scholar PubMed PubMed Central

Nocedal, J. and S. J. Wright. 2006. Numerical Optimization, Springer Series in Operations Research, 2nd ed. New York: Springer.Search in Google Scholar

Patel, C. J., B. Burford, and J. P. Ioannidis. 2015. “Assessment of Vibration of Effects due to Model Specification can Demonstrate the Instability of Observational Associations.” Journal of Clinical Epidemiology 68: 1046–58.10.1016/j.jclinepi.2015.05.029Search in Google Scholar PubMed PubMed Central

Pearce, N., J. P. Vandenbroucke, and D. A. Lawlor. 2019. “Casual Inference in Environmental Epidemiology: Old and New Approaches.” Epidemiology 30: 311–6.10.1097/EDE.0000000000000987Search in Google Scholar PubMed PubMed Central

Pearl, J. 2009. “Causal Inference in Statistics: An Overview.” Statistics Surveys 3: 96–146.10.1214/09-SS057Search in Google Scholar

Pearl, J. and E. Bareinboim. 2014. “External Validity: From Do-caluclus to Transportability Across Populations.” Statistical Science 29: 579–95.10.21236/ADA563868Search in Google Scholar

Ramani, G. B., S. M. Jaeggi, E. N. Daubert, and M. Buschkuehl. 2017. “Domain-specific and Domain-general Training to Improve Kindergarten Children’s Mathematics.” Journal of Numerical Cognition 3: 468–95.10.5964/jnc.v3i2.31Search in Google Scholar

Rosenbaum, P. 1995. “Discussion of ‘Causal Diagrams for Empirical Research’ by J. Pearl.” Biometrika 82: 698.10.2307/2337336Search in Google Scholar

Rosenbaum, P. 2002. Observational Studies, Springer Series in Statistics, 2nd ed. New York: Springer.10.1007/978-1-4757-3692-2Search in Google Scholar

Rosenbaum, P. 2010. Design of Observational Studies, Springer Series in Statistics. New York: Springer.10.1007/978-1-4419-1213-8Search in Google Scholar

Rosenbaum, P. and D. Rubin. 1983. “The Central Role of the Propensity Score in Observational Studies for Causal Effects.” Biometrika 70: 41–55.10.21236/ADA114514Search in Google Scholar

Rothman, K. and S. Greenland. 1998. Modern Epidemiology. Philadephia: Lippincott, Williams, & Wilkins.Search in Google Scholar

Rubin, D. 2009. “Should Observational Studies be Designed to Allow Lack of Balance in Covariate Distributions Across Treatment Groups?” Statistics in Medicine 28: 1420–3.10.1002/sim.3565Search in Google Scholar

Schecter, A., D. Haffner, J. Colacino, K. Patel, O. Päpke, M. Opel, and L. Birnbaum. 2010. “Polybrominated Diphenyl Ethers (PBDEs) and Hexacbromocyclodecane (HBCD) in Composite U.S. Food Samples.” Environmental Health Perspectivese 118: 357–62.10.1289/ehp.0901345Search in Google Scholar PubMed PubMed Central

VanderWeele, T. J. 2009. “On the Distinction Between Interaction and Effect Modification. Epidemiology 20: 863–71.10.1097/EDE.0b013e3181ba333cSearch in Google Scholar PubMed

VanderWeele, T. J. 2017. “On a Square-Root Transformation of the Odds Ratio for a Common Outcome.” Epidemiology 28: e58e60.10.1097/EDE.0000000000000733Search in Google Scholar PubMed PubMed Central

VanderWeele, T. J. and I. Shpitser. 2011. “A New Criterion for Confounder Selection.” Biometrics 67:1406–13.10.1111/j.1541-0420.2011.01619.xSearch in Google Scholar PubMed PubMed Central

Supplementary Material

The online version of this article offers supplementary material (https://doi.org/10.1515/em-2019-0028).

Received: 2019-08-29

Accepted: 2020-03-23

Published Online: 2020-05-04

You are currently not able to access this content.

Supplementary Material Details

Articles in the same Issue

https://doi.org/10.1515/em-2019-0028

Keywords for this article

correlation; model uncertainty; optimization; propensity