We prove this using the Law of Iterated Expectations (LIE): \[E[Z] = E_X \big[ E[Z \mid X] \big]\]
Assumptions Required:
- Consistency: \(Y_i = D_i Y_i^{(1)} + (1 - D_i) Y_i^{(0)}\). For treated patients (\(D_i = 1\)), the observed outcome is the potential outcome under treatment: \(Y_i = Y_i^{(1)}\).
- Conditional Independence (Unconfoundedness): Causal potential outcomes are independent of treatment assignment given baseline covariates: \(Y^{(1)}, Y^{(0)} \perp D \mid X\).
Let \(\hat{g}(X)\) and \(\hat{m}_1(X)\) be our estimated models. The expectation of the AIPW estimator under treatment is: \[E\left[ \hat{Y}_{AIPW}^{(1)} \right] = E_X \left[ E\left[ \hat{m}_1(X_i) + \frac{D_i (Y_i - \hat{m}_1(X_i))}{\hat{g}(X_i)} \;\middle|\; X_i \right] \right]\]
Case 1: Propensity Model is Correct (\(\hat{g}(X) = g(X)\)), Outcome Model is Misspecified (\(\hat{m}_1(X) \neq m_1(X)\))
If the propensity score model is correct, we have \(E[D_i \mid X_i] = \hat{g}(X_i) = g(X_i)\).
Let’s evaluate the conditional expectation of the estimator’s terms given \(X_i\): \[E\left[ \hat{m}_1(X_i) + \frac{D_i (Y_i - \hat{m}_1(X_i))}{\hat{g}(X_i)} \;\middle|\; X_i \right] = \hat{m}_1(X_i) + \frac{E\left[ D_i (Y_i - \hat{m}_1(X_i)) \mid X_i \right]}{g(X_i)}\]
Using consistency (\(D_i Y_i = D_i Y_i^{(1)}\)) and conditioning: \[E\left[ D_i (Y_i - \hat{m}_1(X_i)) \mid X_i \right] = E\left[ D_i Y_i^{(1)} - D_i \hat{m}_1(X_i) \mid X_i \right]\]
By the Conditional Independence assumption (\(Y^{(1)} \perp D \mid X\)): \[E\left[ D_i Y_i^{(1)} \mid X_i \right] = E\left[ D_i \mid X_i \right] \cdot E\left[ Y_i^{(1)} \mid X_i \right] = g(X_i) E\left[ Y_i^{(1)} \mid X_i \right]\]
Similarly, since \(\hat{m}_1(X_i)\) is a deterministic function of \(X_i\): \[E\left[ D_i \hat{m}_1(X_i) \mid X_i \right] = E\left[ D_i \mid X_i \right] \cdot \hat{m}_1(X_i) = g(X_i) \hat{m}_1(X_i)\]
Substitute these expectations back into the numerator: \[E\left[ D_i (Y_i - \hat{m}_1(X_i)) \mid X_i \right] = g(X_i) E\left[ Y_i^{(1)} \mid X_i \right] - g(X_i) \hat{m}_1(X_i) = g(X_i) \left( E\left[ Y_i^{(1)} \mid X_i \right] - \hat{m}_1(X_i) \right)\]
Now substitute this back into the estimator: \[\hat{m}_1(X_i) + \frac{g(X_i) \left( E\left[ Y_i^{(1)} \mid X_i \right] - \hat{m}_1(X_i) \right)}{g(X_i)} = \hat{m}_1(X_i) + E\left[ Y_i^{(1)} \mid X_i \right] - \hat{m}_1(X_i) = E\left[ Y_i^{(1)} \mid X_i \right]\]
Taking the outer expectation over the covariate distribution \(X\): \[E\left[ \hat{Y}_{AIPW}^{(1)} \right] = E_X \Big[ E\left[ Y_i^{(1)} \mid X_i \right] \Big] = E\left[ Y_i^{(1)} \right]\]
Result: The estimate is unbiased. The incorrect predictions of the outcome model (\(\hat{m}_1(X_i)\)) are algebraically canceled out by the weighted residual term.
Case 2: Outcome Model is Correct (\(\hat{m}_1(X) = m_1(X)\)), Propensity Model is Misspecified (\(\hat{g}(X) \neq g(X)\))
If the outcome model is correctly specified, we have \(\hat{m}_1(X_i) = m_1(X_i) = E\left[ Y_i^{(1)} \mid X_i \right]\).
Let’s evaluate the conditional expectation of the residual correction term numerator given \(X_i\): \[E\left[ D_i (Y_i - \hat{m}_1(X_i)) \mid X_i \right] = E\left[ D_i (Y_i^{(1)} - E\left[ Y_i^{(1)} \mid X_i \right]) \mid X_i \right]\]
Conditioning further on treatment assignment \(D_i\) using the Law of Iterated Expectations: \[E\left[ D_i (Y_i^{(1)} - E\left[ Y_i^{(1)} \mid X_i \right]) \mid X_i \right] = E_{D \mid X} \left[ E\left[ D_i (Y_i^{(1)} - E\left[ Y_i^{(1)} \mid X_i \right]) \;\middle|\; X_i, D_i \right] \right]\]
Since \(D_i\) is binary, this expectation only has a non-zero value when \(D_i = 1\): \[P(D_i = 1 \mid X_i) \cdot E\left[ 1 \cdot (Y_i^{(1)} - E\left[ Y_i^{(1)} \mid X_i \right]) \;\middle|\; X_i, D_i = 1 \right]\]
By the Conditional Independence assumption (\(Y^{(1)} \perp D \mid X\)): \[E\left[ Y_i^{(1)} - E\left[ Y_i^{(1)} \mid X_i \right] \;\middle|\; X_i, D_i = 1 \right] = E\left[ Y_i^{(1)} \mid X_i \right] - E\left[ Y_i^{(1)} \mid X_i \right] = 0\]
Since the conditional expectation of the numerator is \(0\): \[E\left[ \frac{D_i (Y_i - \hat{m}_1(X_i))}{\hat{g}(X_i)} \;\middle|\; X_i \right] = 0\]
Therefore, the expectation of the estimator simplifies to: \[E\left[ \hat{Y}_{AIPW}^{(1)} \mid X_i \right] = E\left[ \hat{m}_1(X_i) \mid X_i \right] = E\left[ Y_i^{(1)} \mid X_i \right]\]
Taking the outer expectation over the covariate distribution \(X\): \[E\left[ \hat{Y}_{AIPW}^{(1)} \right] = E_X \Big[ E\left[ Y_i^{(1)} \mid X_i \right] \Big] = E\left[ Y_i^{(1)} \right]\]
Result: The estimate is unbiased. Since the outcome model is correct, the expected residual error is \(0\). The incorrect propensity weights (\(\hat{g}(X_i)\)) are multiplied by zero and fail to introduce any bias.