Background

My previous blog concerned an observational cohort study reporting a hazard ratio (HR) for all-cause mortality of 0.35 (95% CI 0.21–0.58) in patients with breast cancer (GLP-1 RA vs. no treatment) and 0.09 (95% CI 0.06–0.15) in patients with type 2 diabetes (GLP-1 RA vs. insulin or metformin). These effect sizes are an order of magnitude larger than what has been observed with adjunctive therapy in randomised controlled trials and a simulation study confirmed that this magnitude of bias can be explained completely by immortal time.
This post concerns GLP-1 receptor agonist use and cancer risk in obese nondiabetic adults, where the effect sizes are similarly implausible and the immortal time bias is likely driving the results. The recurrent nature of this bias and its ability to surface in journals with an impact factor of 65 merits another post on this pernicious bias.

Immortal time bias

Immortal time bias may have both misclassification and selection bias components as discussed in detail in my previous blog. The exposure definition in this study is identical to the previous study, classified as GLP-1RA users if they had ≥2 prescriptions, with follow-up starting at the index date (first prescription). This lack of alignment leads to thesame structural error, but even arguably more severe than previous. Consider a patient who fills their first prescription in January 2023 and their second in March 2023 has two months of immortal time — they could not have developed cancer in that window and still been classified as exposed. Critically, the follow-up is only a median of 2 years with an IQR of 1–2 years. This is a very short window, which means the immortal time between prescription 1 and prescription 2 represents a much larger fraction of total follow-up time than it would in a 10-year study.

A target trial emulation claim

The authors prominently invoke the target trial emulation framework(1) as a means to control for immortal time, yet they do not actually implement it correctly. A genuine target trial emulation requires explicit alignment of eligibility, treatment assignment, and time zero. Here, as with the previous study, patients are classified as exposed after the index date based on accumulating a second prescription. The authors cite the framework as a methodological strength while committing the exact error the framework is designed to prevent. This is more than an oversight — invoking target trial emulation as a quality marker while not implementing it correctly misleads readers and reviewers.

The effect sizes again fail the sniff test — spectacularly

The overall HR of 0.59 is implausible enough. But the subgroup results are extraordinary:
• Men: HR 0.32 (PSM), 0.27 (IPTW)
• Tirzepatide: HR 0.31 (PSM), 0.26 (IPTW)
• Tirzepatide in IPTW: HR 0.26 (95% CI 0.17–0.39)
An HR of 0.26 means a 74% reduction in cancer incidence. No chemoprevention agent in the history of oncology has ever demonstrated anything approaching this magnitude for a composite of 13 cancers in a 2-year follow-up window. Tamoxifen reduces breast cancer incidence by roughly 38% in high-risk women after 5 years of use. Aspirin reduces colorectal cancer incidence by perhaps 20–30% after a decade. The claim that tirzepatide reduces all obesity-associated cancer incidence by 74% in 2 years, in a non-diabetic population, is not biologically credible and should immediately signal methodological artefact.