Advanced Search

CN 34-1304/RISSN 1674-3679

Volume 28 Issue 9
Sep.  2024
Turn off MathJax
Article Contents
DING Hongmei, ZHANG Mingya, XU Xiaoqin, ZHANG Hongxiu. Machine learning and Cox proportional hazards regression model for warning of persistent infection with high-risk HPV type[J]. CHINESE JOURNAL OF DISEASE CONTROL & PREVENTION, 2024, 28(9): 1083-1089. doi: 10.16462/j.cnki.zhjbkz.2024.09.014
Citation: DING Hongmei, ZHANG Mingya, XU Xiaoqin, ZHANG Hongxiu. Machine learning and Cox proportional hazards regression model for warning of persistent infection with high-risk HPV type[J]. CHINESE JOURNAL OF DISEASE CONTROL & PREVENTION, 2024, 28(9): 1083-1089. doi: 10.16462/j.cnki.zhjbkz.2024.09.014

Machine learning and Cox proportional hazards regression model for warning of persistent infection with high-risk HPV type

doi: 10.16462/j.cnki.zhjbkz.2024.09.014
Funds:

Scientific Research Project of Jiangsu Maternal and Child Health Care Association FYX202345

More Information
  • Corresponding author: ZHANG Hongxiu, E-mail: hongxiuz@njmu.edu.cn
  • Received Date: 2023-08-31
  • Rev Recd Date: 2024-03-16
  • Available Online: 2024-10-24
  • Publish Date: 2024-09-10
  •   Objective  A prediction model of human papillomavirus based on machine learning was established to determine the factors associated with the persistent infection of high-risk human papilloma virus(HR-HPV), so as to provide early warning for the persistent infection of HR-HPV.  Methods  Clinical data of 4 407 women who participated in HPV testing at four health centers in Taizhou City from September 2017 to September 2019 and participated in HPV follow-up from September 2020 to September 2022 were collected. The demographic characteristics of total 4 407 subjects in this cohort study were used as the input of the machine learning model, and the change process of the results of the two HPV inspections as the output, a prediction model based on machine learning was established, including random forest and multi-layer perceptron, to predict the HPV follow-up results of the research object. Univariate Cox risk proportion regression model and multivariate Cox risk proportion regression model were used to statistically analyze 583 primary screening HR-HPV positive cases.  Results  The accuracy of the random forest prediction model was 84.3%, and the accuracy of the multi-layer perceptron was 80.5%. The top five viral types with persistent positive rate of HR-HPV were HPV58, multiple infections, HPV31, HPV33, and HPV52. The multivariate Cox regression analysis showed that the conversion risk of HR-HPV infection in those with junior high school education or below was 1.72 times that of those with high school education and above (HR=1.72, 95% CI: 1.03-2.87, P=0.037), and the conversion risk of HR-HPV infection in non-menopausal individuals was 2.11 times higher than that in menopausal individuals (HR=2.11, 95% CI: 1.10-4.06, P=0.025).  Conclusions  Machine learning and Cox regression analysis models can provide an early warning of the HR-HPV persistent infection population, which has an important clinical value for the subsequent management of HR-HPV-infected women and the prevention and control of cervical cancer.
  • loading
  • [1]
    Singh D, Vignat J, Lorenzoni V, et al. Global estimates of incidence and mortality of cervical cancer in 2020: a baseline analysis of the WHO global cervical cancer elimination initiative[J]. Lancet Glob Health, 2023, 11(2): e197-e206. DOI: 10.1016/S2214-109X(22)00501-0.
    [2]
    中国子宫颈癌综合防控路径建设专家共识编写组, 中华预防医学会肿瘤预防与控制专业委员会. 中国子宫颈癌综合防控路径建设专家共识[J]. 中国预防医学杂志, 2022, 23(10): 721-726. DOI: 10.16506/j.1009-6639.2022.10.001.

    Expert Consensus Writing Group for Comprehensive Prevention and Control of Cervical Cancer in China, Cancer Prevention and Control Professional Committee of the Chinese Preventive Medicine Association. Expert consensus on the path construction toward comprehensive prevention and control for cervical cancer in China[J]. Chin Prev Med, 2022, 23(10): 721-726. DOI: 10.16506/j.1009-6639.2022.10.001.
    [3]
    李文先, 曹秀菁. 中国妇幼健康的进展与成效[J]. 中华疾病控制杂志, 2022, 26(9): 993-995, 1107. DOI: 10.16462/j.cnki.zhjbkz.2022.09.001.

    Li WX, Cao XJ. Progress and achievements in maternal and child health in China[J]. Chin J Dis Control Prev, 2022, 26(9): 993-995, 1107. DOI: 10.16462/j.cnki.zhjbkz.2022.09.001.
    [4]
    Perkins RB, Guido RS, Castle PE, et al. Erratum: 2019 ASCCP risk-based management consensus guidelines for abnormal cervical cancer screening tests and cancer precursors[J]. J Low Genit Tract Dis, 2021, 25(4): 330-331. DOI: 10.1097/LGT.0000000000000628.
    [5]
    Karamveer K, Tiwary BK. CarcinoHPVPred: an ensemble of machine learning models for HPV carcinogenicity prediction using genomic data[J]. Carcinogenesis, 2022: bgac079. DOI: 10.1093/carcin/bgac079.
    [6]
    Usyk M, Zolnik CP, Castle PE, et al. Cervicovaginal microbiome and natural history of HPV in a longitudinal study[J]. PLoS Pathog, 2020, 16(3): e1008376. DOI: 10.1371/journal.ppat.1008376.
    [7]
    Nayar R, Wilbur DC. The pap test and Bethesda 2014[J]. Cancer Cytopathol, 2015, 123(5): 271-281. DOI: 10.1002/cncy.21521.
    [8]
    卢朝辉, 陈杰. WHO女性生殖器官肿瘤学分类(第4版)解读[J]. 中华病理学杂志, 2014, 43(10): 649-650. DOI: 10.3760/cma.j.issn.0529-5807.2014.10.001.

    Lu ZH, Chen J. Interpretation of WHO classification of oncology of female genital organs (4th edition)[J]. Chin J Pathol, 2014, 43(10): 649-650. DOI: 10.3760/cma.j.issn.0529-5807.2014.10.001.
    [9]
    国家统计局. 国家统计局关于印发《三次产业划分规定》的通知[EB/OL]. (2003-05-14)[2024-06-13]. https://www.gov.cn/gongbao/content/2003/content_62360.htm.
    [10]
    周婕, 吴延莉, 王艺颖, 等. BMI水平及动态变化与高血压、糖尿病、血脂异常共病发生风险的前瞻性队列研究[J]. 中华疾病控制杂志, 2023, 27(12): 1421-1429. DOI: 10.16462/j.cnki.zhjbkz.2023.12.010.

    Zhou J, Wu YL, Wang YY, et al. A prospective cohort study of the BMI level and dynamic changes with the risk of comorbidities of hypertension, diabetes mellitus and dyslipidemia[J]. Chin J Dis Control Prev, 2023, 27(12): 1421-1429. DOI: 10.16462/j.cnki.zhjbkz.2023.12.010.
    [11]
    Stensen S, Kjaer SK, Jensen SM, et al. Factors associated with type-specific persistence of high-risk human papillomavirus infection: a population-based study[J]. Int J Cancer, 2016, 138(2): 361-368. DOI: 10.1002/ijc.29719.
    [12]
    Luo Q, Zeng X, Luo H, et al. Epidemiologic characteristics of high-risk HPV and the correlation between multiple infections and cervical lesions[J]. BMC Infect Dis, 2023, 23(1): 667. DOI: 10.1186/s12879-023-08634-w.
    [13]
    Lin HH, Zhang QR, Kong XG, et al. Machine learning prediction of antiviral-HPV protein interactions for anti-HPV pharmacotherapy[J]. Sci Rep, 2021, 11(1): 24367. DOI: 10.1038/s41598-021-03000-9.
    [14]
    Cotton SC, Sharp L, Seth R, et al. Lifestyle and socio-demographic factors associated with high-risk HPV infection in UK women[J]. Br J Cancer, 2007, 97(1): 133-139. DOI: 10.1038/sj.bjc.6603822.
    [15]
    Haddad J, Hasan F, Roumeih AH, et al. The psychosocial burden of anogenital warts on Syrian patients: study of quality of life[J]. Heliyon, 2022, 8(7): e09816. DOI: 10.1016/j.heliyon.2022.e09816.
    [16]
    Shen Y, Xia J, Li HH, et al. Human papillomavirus infection rate, distribution characteristics, and risk of age in pre- and postmenopausal women[J]. BMC Womens Health, 2021, 21(1): 80. DOI: 10.1186/s12905-021-01217-4.
    [17]
    韦晓宁, 徐馨宇, 王少为. 2018—2020年广西某医院就诊的中老年女性人乳头瘤病毒感染情况及分布特点[J]. 中华预防医学杂志, 2022, 56(4): 468-473. DOI: 10.3760/cma.j.cn112150-20210929-00931.

    Wei XN, Xu XY, Wang SW. Infection and distribution characteristics of HPV of middle-aged and elderly women from a certain hospital in Guangxi Zhuang Autonomous Region from 2018 to 2020[J]. Chin J Prev Med, 2022, 56(4): 468-473. DOI: 10.3760/cma.j.cn112150-20210929-00931.
  • 加载中

Catalog

    通讯作者: 陈斌, bchen63@163.com
    • 1. 

      沈阳化工大学材料科学与工程学院 沈阳 110142

    1. 本站搜索
    2. 百度学术搜索
    3. 万方数据库搜索
    4. CNKI搜索

    Figures(2)  / Tables(2)

    Article Metrics

    Article views (136) PDF downloads(29) Cited by()
    Proportional views
    Related

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return