Skip to main navigation Skip to search Skip to main content

Development and validation of a predictive model for detection of colorectal cancer in primary care by analysis of complete blood counts: A binational retrospective study

Yaron Kinar, Nir Kalkstein, Pinchas Akiva, Bernard Levin, Elizabeth E. Half, Inbal Goldshtein, Gabriel Chodick, Varda Shalev

Research output: Contribution to journalArticlepeer-review

Abstract

Objective The use of risk prediction models grows as electronic medical records become widely available. Here, we develop and validate a model to identify individuals at increased risk for colorectal cancer (CRC) by analyzing blood counts, age, and sex, then determine the model's value when used to supplement conventional screening.Materials and Methods Primary care data were collected from a cohort of 606 403 Israelis (of whom 3135 were diagnosed with CRC) and a case control UK dataset of 5061 CRC cases and 25 613 controls. The model was developed on 80% of the Israeli dataset and validated using the remaining Israeli and UK datasets. Performance was evaluated according to the area under the curve, specificity, and odds ratio at several working points.Results Using blood counts obtained 3-6 months before diagnosis, the area under the curve for detecting CRC was 0.82 ± 0.01 for the Israeli validation set. The specificity was 88 ± 2% in the Israeli validation set and 94 ± 1% in the UK dataset. Detecting 50% of CRC cases, the odds ratio was 26 ± 5 and 40 ± 6, respectively, for a false-positive rate of 0.5%. Specificity for 50% detection was 87 ± 2% a year before diagnosis and 85 ± 2% for localized cancers. When used in addition to the fecal occult blood test, our model enabled more than a 2-fold increase in CRC detection.Discussion Comparable results in 2 unrelated populations suggest that the model should generally apply to the detection of CRC in other groups. The model's performance is superior to current iron deficiency anemia management guidelines, and may help physicians to identify individuals requiring additional clinical evaluation.Conclusions Our model may help to detect CRC earlier in clinical practice.

Original languageEnglish
Pages (from-to)879-890
Number of pages12
JournalJournal of the American Medical Informatics Association
Volume23
Issue number5
DOIs
StatePublished - 1 Sep 2016

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 3 - Good Health and Well-being
    SDG 3 Good Health and Well-being

Keywords

  • Colorectal cancer
  • Early detection of cancer
  • Electronic medical records
  • Machine learning
  • Primary health care
  • Risk prediction

All Science Journal Classification (ASJC) codes

  • Health Informatics

Fingerprint

Dive into the research topics of 'Development and validation of a predictive model for detection of colorectal cancer in primary care by analysis of complete blood counts: A binational retrospective study'. Together they form a unique fingerprint.

Cite this