TY - JOUR
T1 - A high positive predictive value algorithm using hospital administrative data identified incident cancer cases
AU - Baldi, Ileana
AU - Vicari, Piera
AU - Di Cuonzo, Daniela
AU - Zanetti, Roberto
AU - Pagano, Eva
AU - Rosato, Rosalba
AU - Sacerdote, Carlotta
AU - Segnan, Nereo
AU - Merletti, Franco
AU - Ciccone, Giovannino
PY - 2008/4
Y1 - 2008/4
N2 - Objective: We have developed and validated an algorithm based on Piedmont hospital discharge abstracts for ascertainment of incident cases of breast, colorectal, and lung cancer. Study Design and Setting: The algorithm training and validation sets were based on data from 2000 and 2001, respectively. The validation was carried out at an individual level by linkage of cases identified by the algorithm with cases in the Piedmont Cancer Registry diagnosed in 2001. Results: The sensitivity of the algorithm was higher for lung cancer (80.8%) than for breast (76.7%) and colorectal (72.4%) cancers. The positive predictive values were 78.7%, 87.9%, and 92.6% for lung, colorectal, and breast cancer, respectively. The high values for colorectal and breast cancers were due to the model's ability to distinguish prevalent from incident cases and to the accuracy of surgery claims for case identification. Conclusions: Given its moderate sensitivity, this algorithm is not intended to replace cancer registration, but it is a valuable tool to investigate other aspects of cancer surveillance. This method provides a valid study base for timely monitoring cancer practice and related outcomes, geographic and temporal variations, and costs.
AB - Objective: We have developed and validated an algorithm based on Piedmont hospital discharge abstracts for ascertainment of incident cases of breast, colorectal, and lung cancer. Study Design and Setting: The algorithm training and validation sets were based on data from 2000 and 2001, respectively. The validation was carried out at an individual level by linkage of cases identified by the algorithm with cases in the Piedmont Cancer Registry diagnosed in 2001. Results: The sensitivity of the algorithm was higher for lung cancer (80.8%) than for breast (76.7%) and colorectal (72.4%) cancers. The positive predictive values were 78.7%, 87.9%, and 92.6% for lung, colorectal, and breast cancer, respectively. The high values for colorectal and breast cancers were due to the model's ability to distinguish prevalent from incident cases and to the accuracy of surgery claims for case identification. Conclusions: Given its moderate sensitivity, this algorithm is not intended to replace cancer registration, but it is a valuable tool to investigate other aspects of cancer surveillance. This method provides a valid study base for timely monitoring cancer practice and related outcomes, geographic and temporal variations, and costs.
KW - Breast cancer
KW - Colorectal cancer
KW - Hospital discharges
KW - Lung cancer
KW - Positive predictive value
KW - Sensitivity
UR - https://www.scopus.com/pages/publications/39649120542
U2 - 10.1016/j.jclinepi.2007.05.017
DO - 10.1016/j.jclinepi.2007.05.017
M3 - Article
SN - 0895-4356
VL - 61
SP - 373
EP - 379
JO - Journal of Clinical Epidemiology
JF - Journal of Clinical Epidemiology
IS - 4
ER -