TY - JOUR
T1 - Machine learning as an adjunct to expert observation in classification of radiographic knee osteoarthritis
T2 - findings from the Hertfordshire Cohort Study
AU - Westbury, Leo D.
AU - Fuggle, Nicholas R.
AU - Pereira, Diogo
AU - Oka, Hiroyuki
AU - Yoshimura, Noriko
AU - Oe, Noriyuki
AU - Mahmoodi, Sasan
AU - Niranjan, Mahesan
AU - Dennison, Elaine M.
AU - Cooper, Cyrus
N1 - Funding Information:
The Hertfordshire Cohort Study was supported by the Medical Research Council University Unit Partnership grant number MRC_MC_UP_A620_1014. CC, EMD and LDW are supported by the UK Medical Research Council [MC_PC_21003; MC_PC_21001]. The funders had no role in the study design, collection, analysis and interpretation of data; in the writing of the manuscript; or in the decision to submit the manuscript for publication. For the purpose of open access, the author has applied a Creative Commons attribution license (CC BY) to any Author Accepted Manuscript version arising from this submission.
Publisher Copyright:
© 2023, The Author(s).
PY - 2023/7
Y1 - 2023/7
N2 - Background: Osteoarthritis is the most prevalent type of arthritis. Many approaches exist for characterising radiographic knee OA, including machine learning (ML). Aims: To examine Kellgren and Lawrence (K&L) scores from ML and expert observation, minimum joint space and osteophyte in relation to pain and function. Methods: Participants from the Hertfordshire Cohort Study, comprising individuals born in Hertfordshire from 1931 to 1939, were analysed. Radiographs were assessed by clinicians and ML (convolutional neural networks) for K&L scoring. Medial minimum joint space and osteophyte area were ascertained using the knee OA computer-aided diagnosis (KOACAD) program. The Western Ontario and McMaster Universities Osteoarthritis Index (WOMAC) was administered. Receiver operating characteristic analysis was implemented for minimum joint space, osteophyte, and observer- and ML-derived K&L scores in relation to pain (WOMAC pain score > 0) and impaired function (WOMAC function score > 0). Results: 359 participants (aged 71–80) were analysed. Among both sexes, discriminative capacity regarding pain and function was fairly high for observer-derived K&L scores [area under curve (AUC): 0.65 (95% CI 0.57, 0.72) to 0.70 (0.63, 0.77)]; results were similar among women for ML-derived K&L scores. Discriminative capacity was moderate among men for minimum joint space in relation to pain [0.60 (0.51, 0.67)] and function [0.62 (0.54, 0.69)]. AUC < 0.60 for other sex-specific associations. Discussion: Observer-derived K&L scores had higher discriminative capacity regarding pain and function compared to minimum joint space and osteophyte. Among women, discriminative capacity was similar for observer- and ML-derived K&L scores. Conclusion: ML as an adjunct to expert observation for K&L scoring may be beneficial due to the efficiency and objectivity of ML.
AB - Background: Osteoarthritis is the most prevalent type of arthritis. Many approaches exist for characterising radiographic knee OA, including machine learning (ML). Aims: To examine Kellgren and Lawrence (K&L) scores from ML and expert observation, minimum joint space and osteophyte in relation to pain and function. Methods: Participants from the Hertfordshire Cohort Study, comprising individuals born in Hertfordshire from 1931 to 1939, were analysed. Radiographs were assessed by clinicians and ML (convolutional neural networks) for K&L scoring. Medial minimum joint space and osteophyte area were ascertained using the knee OA computer-aided diagnosis (KOACAD) program. The Western Ontario and McMaster Universities Osteoarthritis Index (WOMAC) was administered. Receiver operating characteristic analysis was implemented for minimum joint space, osteophyte, and observer- and ML-derived K&L scores in relation to pain (WOMAC pain score > 0) and impaired function (WOMAC function score > 0). Results: 359 participants (aged 71–80) were analysed. Among both sexes, discriminative capacity regarding pain and function was fairly high for observer-derived K&L scores [area under curve (AUC): 0.65 (95% CI 0.57, 0.72) to 0.70 (0.63, 0.77)]; results were similar among women for ML-derived K&L scores. Discriminative capacity was moderate among men for minimum joint space in relation to pain [0.60 (0.51, 0.67)] and function [0.62 (0.54, 0.69)]. AUC < 0.60 for other sex-specific associations. Discussion: Observer-derived K&L scores had higher discriminative capacity regarding pain and function compared to minimum joint space and osteophyte. Among women, discriminative capacity was similar for observer- and ML-derived K&L scores. Conclusion: ML as an adjunct to expert observation for K&L scoring may be beneficial due to the efficiency and objectivity of ML.
KW - Artificial intelligence
KW - Epidemiology
KW - Kellgren and Lawrence
KW - Musculoskeletal
UR - http://www.scopus.com/inward/record.url?scp=85159697579&partnerID=8YFLogxK
U2 - 10.1007/s40520-023-02428-5
DO - 10.1007/s40520-023-02428-5
M3 - Article
C2 - 37202598
AN - SCOPUS:85159697579
SN - 1594-0667
VL - 35
SP - 1449
EP - 1457
JO - Aging Clinical and Experimental Research
JF - Aging Clinical and Experimental Research
IS - 7
ER -