학술논문

Development and validation of risk prediction model for colorectal cancer in patients with gastrointestinal symptoms
Document Type
Electronic Thesis or Dissertation
Author
Source
Subject
colorectal cancer
gastrointestinal symptoms
CRC
Early CRC diagnosis
cancer deterioration
CRC risk
Language
English
Abstract
Background: Colorectal cancer (CRC) is a malignant tumour that grows in the colon and/or rectum. According to the global burden of disease estimate, in 2020, CRC was the third most common cancer and the second leading cause of cancer-related death in the world, with approximately 1.9 million CRC incident cancer cases and 0.9 million CRC deaths. The overall 5-year survival rate of CRC is approximately 60%, showing an increasing trend by year. Early CRC diagnosis and timely treatment could largely prevent cancer deterioration. Early diagnosis could be enhanced through risk prediction models. Prediction model research includes model development (the multivariable prediction model selects influential predictor variables and estimates regression coefficients), validation (internal validation; external validation), and impact studies (evaluation of the model's clinical validity and utility). It is of great clinical importance to identify risk factors with substantial predictive value, develop and validate risk prediction models with strong prediction performance, and improve risk prediction models' clinical impact/usefulness. There is a clear need for systematic investigation and appraisal of risk prediction models/ risk factors predicting CRC. Therefore, this thesis 1) conducted an umbrella review to summarise and evaluate risk factors and risk prediction models for CRC prognosis, specifically metastasis and recurrence, to examine the extent to which prediction models include the most influential factors; 2) examined the association between a constellation of demographic, clinical features, and genetic risk correlates of the patient's symptoms and signs related to CRC risk; explored the predictive value of risk factors as a group in forecasting CRC risk; developed, validated, and evaluated risk prediction models that incorporated significant predictors for individual prediction of CRC risk in patients with symptoms; 3) evaluated the prognostic factors for rectal cancer survival outcomes and estimated probabilities of rectal cancer patients surviving over follow-up time. Methods: An umbrella review was conducted to synthesize and evaluate risk factors and risk prediction models of CRC metastasis and recurrence. The umbrella review summarised the magnitude, direction, and significance of identified associations and effects, evaluated the credibility of the evidence for each risk factor, and categorized the evidence as convincing, highly suggestive, suggestive, or weak. In addition, a comparative cross-assessment between risk factors evaluated in the umbrella review and risk predictors included in existing prediction models was performed to investigate the extent to which prediction models include the most influential factors. The cross-assessment compared the magnitude of the summary relative risk and noted how many of those represented at least 3-fold changes in the odds of the outcome and how many had convincing or highly suggestive evidence in the assessment. The methodological quality and risk of bias was conducted based on the Assessment of Multiple Systematic Reviews 2.0 (AMSTAR 2.0) checklist. A CRC risk prediction analysis was conducted in the Study of Colorectal Cancer (SOCCS) (N=834) and the Lothian Bowel Symptom Study (LABSS) (N=820). SOCCS is a case-control study that started in 1999, recruiting CRC incident cases (aged 16 years and over) and matched healthy controls (age, sex, and health board) across the Scotland regions. Only CRC symptomatic cases from the SOCCS were used in the analysis discussed herein. LABSS is a multi-centre prospective cohort study that started in 2017, recruiting patients (aged 18 years and over) with bowel symptoms through the endoscopy, CT scanning, colorectal surgery and gastroenterology units within NHS recruiting centres across Scotland. To conduct the risk prediction analysis, I summarised the basic characteristics of SOCCS and LABSS, compared CRC cases and controls, and conducted univariable and multivariable logistic regression analyses for CRC risk. Following that, I explored the predictive value of variables as a group in forecasting CRC risk by building multivariable risk prediction models. CRC prediction models were developed with internal validation [N=1352; Cases: n=818/ Controls: n=534]. Candidate predictors included age, sex, BMI, weighted genetic risk score (wGRS) of 113 single nucleotide polymorphisms (SNPs), family history, and symptoms (change of bowel habit, rectal bleeding, weight loss, anaemia, abdominal pain). The two main strategies for the development of the final model are predictor selection and full model (Royston et al., 2009). In the predictor selection approach, models A (baseline model + wGRS) and B (baseline model) were developed based on the least absolute shrinkage and selection operator (LASSO) regression algorithm to select predictors. In the full model approach, models C (baseline model + wGRS) and D (baseline model) were built using all the variables. Models' prediction performance (calibration, discrimination) were evaluated through Hosmer-Lemeshow (HL) test (calibration curves were plotted) and Harrell's C-statistics (receiver operating characteristic curves were plotted). The corrected C-statistics were calculated based on bootstrapping validation (1,000 bootstraps resamples). Models' prediction performance were cross-assessed in the sensitivity analysis. An online nomogram for the final model was built using Shiny.apps. The clinical usefulness of the risk prediction nomogram was tested by decision curve and clinical impact curve analyses. Survival analysis of rectal cancer was conducted using data from the Rectal Cancer cohort study (2008-2012) which prospectively recruited patients (N=287) who underwent surgical resection for a primary rectal adenocarcinoma via the Lothian Colorectal Cancer MDM at the Western General Hospital. All patients underwent regular follow-ups until 5-years after surgery. The baseline summary of the study was described. Demographic characteristics (age, sex), cancer stage, cancer histopathology (tumour differentiation, extramural vascular invasion [EMVI], lymph node, CRM involvement), clinical treatment (radiotherapy, chemotherapy, surgery), number of deaths, and number of local or distant recurrences were summarised. Univariable and multivariable Cox regression models were fitted to estimate effects of prognostic factors (covariates listed above) for the risk of rectal cancer outcomes including local recurrence, distant recurrence, recurrence-free survival (RFS), and overall survival (OS). Hazard ratios (HRs) and 95% CI were calculated. Finally, Kaplan-Meier estimates (probabilities of rectal cancer patients in this cohort study surviving over follow-up time) were calculated and survival curves were simulated. Results: The umbrella review comprised 51 unique meta-analyses of observational studies investigating 34 risk factors for CRC metastasis and 17 risk factors for recurrence. Twelve of 34 risk factors were estimated to change the odds of the outcome at least 3-fold for CRC metastasis with P<0.05. Only one risk factor (vascular invasion for lymph node metastasis [LNM] in pT1 CRC) presented convincing evidence. Five risk factors presented highly suggestive evidence for CRC metastasis. Four of 17 risk factors were estimated to change the odds of the outcome at least 3-fold for CRC recurrence with P < 0.05. No risk factor presented convincing evidence and four risk factors presented highly suggestive evidence for CRC recurrence. This study updated the synthesis of risk prediction models for CRC metastasis (n=12) and recurrence (n=12) and then conducted a cross-assessment of individual risk factors evaluated in the umbrella review and of risk predictors included in existing prediction models. Conclusion: This thesis presents a comprehensive and thorough investigation of CRC risk factors and risk prediction models. The umbrella review investigated 34 risk factors for CRC metastasis and 17 risk factors for recurrence. Convincing evidence exists for the association between vascular invasion and LNM in pT1 tumours. Cross-assessment between individual risk factors and risk predictors applied in existing prediction models (metastasis: n=12; recurrence: n=12) suggests that future risk prediction model research would benefit from applying a more rigorous and systematic model construction process to integrate influential risk factors following evidence-based methods. In the CRC risk prediction modelling study, prediction models were developed with internal validation, showing good performance in both calibration and discrimination. However, due to limited data availability, CRC prediction models have not been externally validated. The sensitivity analysis demonstrated that integration of genetic architecture into CRC classical prediction model could improve prediction performance. This could be helpful to identify a subpopulation with higher CRC risk due to genetic susceptibility. The findings merit further investigation through model external validation and model clinical impact. The survival analysis of rectal cancer verified that adverse tumour pathology (tumour differentiation, positive lymph nodes, EMVI) were the main prognostic factors for rectal cancer recurrence and survival outcomes. In summary, the research work in this thesis could help with clinical decision-making on the relative priority of risk factors/predictors' impact on CRC development and prognosis. In addition, with the dedicated CRC prediction model, patients and clinicians can be informed about individualized prediction of CRC risk, which could guide personalised clinical care to improve patients' cancer outcomes.

Online Access