Abstract
The proportional subdistribution hazards (PSH) model has been widely employed for analyzing competing risks data which have mutually exclusive events with multiple causes and commonly occur in clinical research. With the rapid development of healthcare industry, massively sized survival data sets are becoming increasingly prevalent and classical PSH models are computationally intensive with large data sets. In this article, we propose the optimal subsampling estimators and two-step algorithm for the Fine-Gray model. Asymptotic properties of the proposed estimators are established and an extensive simulation study is conducted to demonstrate the efficiency of the estimators. Our proposed methodology is then illustrated with the large dataset from the SEER (Surveillance, Epidemiology, and End Results) database.
Original language | English |
---|---|
Pages (from-to) | 361-377 |
Number of pages | 17 |
Journal | Statistics and Its Interface |
Volume | 18 |
Issue number | 3 |
Early online date | 17 Jan 2025 |
DOIs | |
Publication status | E-pub ahead of print - 17 Jan 2025 |
Keywords
- Big data
- Competing risks data
- Optimal subsampling