Mock Case — Survival Extrapolation in a Partitioned Survival Model
Treatment Effect Waning and Recursive Hazard Chaining: Isavetinib in Relapsed/Refractory Multiple Myeloma
Author
Xiaoge Zhang, PhD
Published
June 1, 2026
ImportantMock Case Disclosure
This is a methodological demonstration using fully synthetic parameters. The clinical scenario — Isavetinib versus standard of care (SoC) in Relapsed/Refractory Multiple Myeloma (RRMM) — is fictional and designed to expose a real methodological challenge in oncology HTA submissions: how to model treatment effect waning beyond the trial observation window in a biologically defensible way.
1 Executive Summary
Standard oncology HTA submissions in England require a 30-year (lifetime) model horizon, yet pivotal trials for RRMM typically provide 3–5 years of follow-up data. This creates the extrapolation problem: how should the treatment benefit observed during the trial period be projected forward?
A common, computationally simple approach is to apply a constant Hazard Ratio (HR) at the trial cut-off point. This case demonstrates that this approach generates a mathematically discontinuous “Hazard Cliff” at the extrapolation boundary — a structural artefact that NICE Evidence Assessment Groups (EAGs) routinely flag as biologically implausible.
This case study develops and validates an alternative: Recursive Hazard Chaining, which anchors the long-term extrapolation to the trial-period hazard at the cut-off point and applies a clinically motivated linear waning function. The result is a smooth, continuous survival trajectory that is far more defensible in front of an appraisal committee.
Base-case results: ICER = £53,514 / QALY (\(\Delta C\): £34,761 | \(\Delta E\): 0.649 QALYs). The full probabilistic sensitivity analysis (1,000 iterations) yields a 51% probability of cost-effectiveness at the £50,000 severity-adjusted threshold.
2 Decision Problem & Clinical Background
Multiple myeloma is a haematological malignancy of plasma cells with a median age at diagnosis of approximately 70 years. Virtually all patients eventually relapse, and each successive line of therapy is associated with shorter remission and reduced performance status. In the RRMM setting, the key clinical endpoint is Progression-Free Survival (PFS), with Overall Survival (OS) as the co-primary or secondary endpoint.
Isavetinib (fictional) is a novel oral kinase inhibitor being evaluated against a SoC backbone in the third-line RRMM setting. A company-sponsored trial provided 60 months of individual patient follow-up. The modelled base-case parameters are calibrated to reflect a realistic RRMM scenario:
SoC median OS: ~24 months
Isavetinib median OS: ~50 months (HR ≈ 0.55)
Model horizon: 30 years (360 monthly cycles)
The central methodological challenge is that the treatment benefit observed at Month 60 cannot plausibly be assumed to persist unchanged at its trial-period magnitude into Years 10, 20, and 30.
3 The Partitioned Survival Model: Mathematical Framework
3.1 Three-State Structure
A Partitioned Survival Model (PSM) is defined by two independent survival functions — PFS and OS — that jointly determine the proportion of the cohort in each of three mutually exclusive health states at every model cycle.
Notation. Let \(S_{\text{PFS}}(t)\) and \(S_{\text{OS}}(t)\) denote the survival functions for progression-free survival and overall survival respectively, where \(S(t) \in [0,1]\) is the probability of not having experienced the corresponding event by time \(t\). Let \(\pi_k(t)\) denote the state occupancy probability for health state \(k\) at time \(t\) — the proportion of the cohort residing in state \(k\) at cycle \(t\). The key structural feature of a PSM is that state occupancy is read directly off the survival curves, without requiring an explicit transition probability matrix:
Two constraints are non-negotiable. First, the three states must sum to exactly 1 at every cycle — this follows directly from the definitions above and requires no additional assumption. Second, OS cannot fall below PFS at any cycle, as a patient cannot be progression-free and dead simultaneously:
Violations of the second constraint — which can arise from naive extrapolation — must be corrected programmatically. Any model submitted to NICE without this safeguard will be immediately flagged by the EAG.
3.2 Weibull Survival Function
Each survival curve is parameterised as a Weibull function calibrated to the monthly cycle structure:
where \(t\) is time in months, \(\lambda > 0\) is the scale parameter, and \(\alpha > 0\) is the shape parameter. When \(\alpha > 1\), the hazard is monotonically increasing — consistent with the progressive nature of RRMM. All parameters are specified on the monthly scale to ensure dimensional consistency; a common submission error is to use parameters calibrated to years applied to monthly cycle lengths without conversion.
3.3 QALY and Cost Accumulation
QALY gains and costs are accumulated each cycle using the area-under-the-curve method, with continuous discounting at an annual rate \(r\):
SoC (standard of care) Weibull parameters on the monthly scale. Calibrated to a median OS of approximately 24 months.
2
Isavetinib Weibull parameters. Calibrated to a median OS of approximately 50 months, reflecting an HR of ~0.55 versus SoC.
3
Trial data cut-off at Month 60: the point at which the extrapolation period begins and treatment effect waning is applied.
4
Full waning by Month 120: clinical assumption that Isavetinib benefit fully dissipates 5 years after the trial horizon — consistent with observed TKI durability in haematological malignancies.
5
Thirty-year model horizon aligned with NICE reference case.
6
Health state utilities from published RRMM utility literature (illustrative).
7
Isavetinib monthly cost includes drug acquisition plus administration, reflecting a realistic oral oncology price point.
8
Post-progression costs are assumed equal between arms, consistent with assumed equivalent subsequent therapy access.
Model parameters: Weibull survival (monthly scale) and economic inputs
Parameter
Value
SoC PFS: alpha / lambda
1.20 / 0.040
SoC OS: alpha / lambda
1.10 / 0.018
Isavetinib PFS: alpha / lambda
1.10 / 0.025
Isavetinib OS: alpha / lambda
1.05 / 0.010
Waning window (months)
Month 60 → 120
Model horizon (months)
360 (30 years)
Utility: PFS / PD
0.750 / 0.500
Monthly cost PFS: Isavetinib / SoC (£)
£4,700 / £1,200
Monthly cost PD, both arms (£)
£1,800
Annual discount rate
3.5%
4 The Extrapolation Challenge: Beyond Trial Data
At Month 60, the trial ends. The model must continue for another 300 months. A seemingly natural solution is to take the HR observed at Month 60 and apply it as a fixed constant for the remainder of the time horizon. This is the Naive HR Switching approach.
The problem is mathematical. In the Weibull framework, applying a constant HR at the cut-off is equivalent to writing:
\[S_{\text{trt}}(t) = \left[S_{\text{soc}}(t)\right]^{\text{HR}} \quad \text{for } t > t^*\]
where \(t^*\) is the cut-off point. But the trial-period curve at \(t^*\) is:
This equality holds only when the HR equals the ratio of the Weibull scale parameters — which is almost never the case in a real trial. The mismatch creates a jump discontinuity in the survival function at \(t^*\), which is biologically impossible: survival probability cannot instantaneously change at a single point in time.
5 Naive HR Switching vs. Recursive Hazard Chaining
5.1 The Naive Approach: Implied HR Calculation
To make a fair comparison, the HR used in the Naive approach is not an arbitrary constant but the implied HR derived from the trial curves at the cut-off:
This anchors both approaches at the same empirical HR value. The divergence arises entirely from how that HR is applied going forward.
5.2 Recursive Hazard Chaining: Algorithm
The Recursive Hazard Chaining algorithm replaces the discontinuous HR switch with a smooth, monotonically increasing function. Three components define it:
Step 1 — Anchor. Compute the implied HR at the cut-off:
Step 2 — Linear waning. At each post-trial cycle \(t > t^*\), the HR decays linearly toward 1.0 (no benefit) over a clinically justified waning window \([t^*, t_{\text{end}}]\):
Step 3 — Recursive chaining. Rather than computing the absolute survival level from SoC, the treatment arm is advanced by the incremental SoC hazard raised to the current HR:
Anchor HR computed at index 60 (corresponding to Month 60). The + 1 offset is because R vectors are 1-indexed.
3
Naive formula: \(S_{\text{trt}}(t) = S_{\text{soc}}(t)^{\widehat{\text{HR}}}\). Applied as a constant from Month 61 onward — this is what creates the jump.
4
Linear progress of waning from 0.0 at Month 60 to 1.0 at Month 120. min(1, ...) ensures the HR does not fall below 1.0 after full waning.
5
Current HR interpolated between the anchored trial HR and 1.0 (no treatment benefit).
6
The recursive chaining formula. Each step advances the treatment arm using the incremental SoC hazard scaled by the current waning HR. Continuity is guaranteed because \(S_{\text{trt}}(t)\) is always derived from \(S_{\text{trt}}(t-1)\).
7
Hard floor ensuring OS \(\geq\) PFS. Without this, extrapolation artefacts can produce an impossible state where more patients are progression-free than are alive.
Implied HR at anchor (Month 60): OS = 0.432 | PFS = 0.381
HR at Month 120 (end of waning): OS = 1.000 | PFS = 1.000
Implied OS Hazard Ratio trajectory: Naive HR Switching applies a constant HR from Month 61 (flat line), while Recursive Chaining smoothly interpolates from the anchored HR back to 1.0 at Month 120.
The base-case ICER of approximately £53,500/QALY exceeds the standard £30,000 threshold but falls within the range where NICE Severity Weighting may apply. Under the 2022 NICE methods update, if the proportional QALY shortfall for RRMM patients exceeds 0.85, a weighting factor of 1.7 applies — effectively reducing the severity-adjusted ICER to approximately £31,500/QALY, which would cross the actionable threshold.
7 Probabilistic Sensitivity Analysis
Parameter uncertainty is propagated through 1,000 PSA iterations, sampling from distributions defined by the standard errors of the Weibull parameters and the literature-derived utility and cost inputs.
Cost-effectiveness plane (1,000 PSA iterations). The majority of iterations fall in the north-east quadrant, indicating Isavetinib is more effective and more costly than SoC. The dashed lines show the £30,000 and £50,000 willingness-to-pay thresholds.
Cost-Effectiveness Acceptability Curve (CEAC). The curve shows the probability that Isavetinib is cost-effective across a range of willingness-to-pay thresholds. At £50,000 (severity-adjusted threshold), the probability of cost-effectiveness exceeds 50%.
8 NICE Submission Considerations
Three strategic levers are available to improve the submission position for Isavetinib:
Severity Weighting. Under the NICE 2022 methods update, treatments for conditions with a high absolute QALY shortfall qualify for a severity weighting multiplier (1.2x or 1.7x). RRMM patients accumulate a substantial QALY shortfall relative to the age-matched general population. Demonstrating that the shortfall exceeds the 0.85 proportional threshold would move the effective cost-effectiveness threshold to £51,000/QALY, at which point the base-case ICER becomes approvable without Patient Access Scheme discounts.
Patient Access Scheme. If severity weighting is not granted, a confidential simple discount of approximately 17–20% on the Isavetinib list price would be sufficient to bring the ICER below £30,000/QALY. The PSA results show that at a £30,000 threshold, approximately 17% of iterations are already cost-effective, meaning even modest price reductions would generate a favourable CEAC profile.
Subsequent Therapy Mix Sensitivity. The model assumes equivalent post-progression therapy access in both arms. If the EAG proposes that Isavetinib patients access more expensive fourth-line agents (due to better performance status at progression), the incremental cost would increase. A scenario analysis with a 10% shift toward high-cost monoclonal antibody regimens in the post-progression state should be pre-prepared as part of the submission.
9 Auditor Stress Test: Strategic Q&A
Q1: Why does Recursive Hazard Chaining use a linear waning function rather than a more sophisticated parametric form?
The choice of waning shape involves a fundamental tension between statistical identifiability and biological plausibility. In the absence of mature long-term data, more complex waning models (e.g., exponential, log-logistic) add free parameters that cannot be estimated from the trial data and whose uncertainty is difficult to characterise in a PSA. A linear waning function requires only two pre-specified anchor points — the start and end of the waning window — which can be justified clinically by reference to the mechanism of action and the expected durability of the drug class. In a NICE submission, the waning window should be defined in the statistical analysis plan and defended by clinical advisory board consensus, not post-hoc curve fitting.
Q2: What happens to the ICER if the waning window is shortened from 5 years to 2 years?
Shortening the waning window reduces the duration over which Isavetinib provides benefit beyond the trial period, which reduces the incremental QALY gain and increases the ICER. This is the primary scenario where an EAG would challenge the extrapolation assumptions — if the clinical advisory board believes TKI benefit wanes quickly in RRMM, a 2-year window is a defensible alternative. The submission should include a scenario analysis varying the waning end point from 1 year to 10 years post-trial cut-off, alongside the base-case 5-year assumption.
Q3: Why is the OS \(\geq\) PFS constraint applied as a hard floor rather than re-parameterising the survival functions to guarantee the constraint analytically?
Re-parameterising the Weibull functions to analytically enforce the OS \(\geq\) PFS constraint would require either a shared-parameter model or the use of copula functions to jointly model PFS and OS — both of which are substantially more complex and less transparent in a NICE submission context. The hard floor is a pragmatic, auditable correction that is visible in the code and easy for the EAG to verify. Where violations occur (typically in the very long tail of the extrapolation), they represent a negligible proportion of the surviving cohort and have a trivial impact on the ICER.
Analysis code available at github.com/Qsev/XGZ-HEOR-Portfolio. Methodological standards: NICE DSU Technical Support Document 19 (Partitioned Survival Models); NICE Technology Appraisals Methods Guide (2022).