健康指数构建的统计分析流程及实现

施婕; 田祥; 王雅倩; 卢伟; 王晗; 王玥; 迟蔚蔚

doi:10.16462/j.cnki.zhjbkz.2022.10.009

健康指数构建的统计分析流程及实现

doi: 10.16462/j.cnki.zhjbkz.2022.10.009

施婕^{1, 2},
田祥⁴,
王雅倩²,
卢伟²,
王晗⁵,
王玥²,
迟蔚蔚^{1, 2, 3, ,}

1.
250012 济南, 山东大学齐鲁医学院公共卫生学院生物统计学系
2.
250003 济南, 国家健康医疗大数据研究院
3.
250002 济南, 山东健康医疗大数据管理中心
4.
250117 济南, 北方健康医疗大数据科技有限公司
5.
266034 青岛, 青岛市妇女儿童医院妇科中心

基金项目:

国家重点研发计划 2020YFC2003500

详细信息

通讯作者:
迟蔚蔚, E-mail: nahdyw@shandong.cn

中图分类号: R181
计量
- 文章访问数: 299
- HTML全文浏览量: 106
- PDF下载量: 111
- 被引次数: 0
出版历程
- 收稿日期: 2022-05-09
- 修回日期: 2022-08-22
- 刊出日期: 2022-10-10

Statistical analysis process of health index construction and its implementation

SHI Jie^{1, 2},
TIAN Xiang⁴,
WANG Ya-qian²,
LU Wei²,
WANG Han⁵,
WANG Yue²,
CHI Wei-wei^{1, 2, 3
, ,}

1.
Department of Biostatistics, School of Public Health, Cheeloo College of Medicine, Shandong University, Jinan 250012, China
2.
National Institute of Health Data Science of China, Jinan 250003, China
3.
National Administration of Health Data, Jinan 250002, China
4.
North China Digital Health Technology CO, LTD, Jinan 250117, China
5.
Qingdao Women and Children's Hospital, Qingdao 266034, China

Funds:

National Key Research and Development Program of China 2020YFC2003500

More Information

Corresponding author: CHI Wei-wei, E-mail: nahdyw@shandong.cn

摘要

摘要: 目的基于综合评价理论探索并归纳健康指数构建的统计分析流程，开发相关R包用于软件实现，为健康指数研究开发一套快捷高效的评价工具，实现一键生成高度集成的综合指数及评价结果，以期为监管部门提供科学可靠的评价信息与决策依据。方法使用R 4.1.3软件开发EvaModels包，结合健康指数构建的统计分析流程对EvaModels包内各项函数进行整体介绍与参数解释，分析比较各类方法的适用场景，并以公立医院持续发展指数构建为例进行演示分析。结果健康指数构建包括确定指数研究主题、构建评价指标体系、多指标综合评价及评价结果可视化四大流程。所开发的EvaModels包共内置九个函数，通过多种方法实现指标筛选、数据标准化、指标赋权和综合评价等功能，可满足多种评价问题的分析需求，基本覆盖健康指数构建的统计分析流程。结论 EvaModels包通过一组函数将健康指数构建中涉及统计分析的工作流程自动化、简单化，过程与代码易于解读与调用，提高了健康指数构建的便捷性和可操作性。
- 健康指数构建 /
- 统计分析流程 /
- 综合评价 /
- R包
Abstract: Objective Based on the comprehensive evaluation theory, this paper explores and summarizes the statistical analysis process of health index construction, and develops relevant R packages for software implementation. In this way, a set of fast and efficient evaluation tools for health index research can be further developed, and a highly integrated comprehensive index and evaluation results can be generated with one click. We hope to provide scientific and reliable evaluation information and a decision-making basis for regulatory authorities. Methods R 4.1.3 software was used to develop the EvaModels package. Combined with the statistical analysis process of health index construction, this paper introduced and explained the functions of the EvaModels package as a whole. Moreover, we analyzed and compared the applicable scenarios of various methods, and takes the construction of a sustainable development index of public hospitals as an example for demonstration and analysis. Results Health index construction included four processes. Namely, determining the research theme of the index, constructing an evaluation indicator system, multi-indicator comprehensive evaluation, and visualization of evaluation results. The developed EvaModels package has nine built-in functions to realize the functions of indicator screening, data standardization, index weighting and comprehensive evaluation through a variety of methods. Also, it can meet the analysis needs of a variety of evaluation problems and cover the statistical analysis process of health index construction. Conclusion The EvaModels package automates and simplifies the workflow involved in statistical analysis in health index construction through a set of functions. With procedures and codes that are easy to interpret and call, it can improve the convenience and operability of health index construction.
- Health index construction /
- Statistical analysis process /
- Comprehensive evaluation /
- R package

HTML全文

图 1 健康指数构建流程图

Figure 1. Flow chart of health index construction

下载: 全尺寸图片幻灯片

表 1 不同指标赋权方法介绍

Table 1. Introduction to different weighting methods

方法类别	方法名称	方法描述	适用场景
主观赋权法	层次分析法	定性与定量有机结合的决策分析方法，将问题分解为不同的因素，通过决策者的经验两两比较确定权重。	适用于缺乏具体数据或数据量较小；人的定性评估起重要作用的、对决策结果难以精确计量的情况。
客观赋权法	熵权法	通过计算各指标观测值的信息熵来度量信息量，指标的变异程度越小，所传递的信息量也越少，其对应权重越低。	适用于有数据，且最好全部是定量数据；底层的指标分的比较细，权重比较难确定的情况。
	变异系数赋权法	利用指标原始信息，即标准差与原始平均数的比计算其变异程度，变异越大的指标权重越大。	适用于指标量纲和数量级差异较大，各指标的方差不具有可比性的情况。
	主成分赋权法	通过原始变量的线性组合，把多个原始指标简化为有代表意义的少数指标。	适用于数据记录多和维度多的大型数据集。

下载: 导出CSV

表 2 不同综合评价方法介绍

Table 2. Introduction to different comprehensive evaluation methods

方法类别	方法名称	方法描述	适用场景
常规评价方法	优劣解距离法	根据理想点原理，通过计算每个方案到理想方案的相对贴近度来对方案进行排序，从而选出最优方案。	适用于有数据，且最好全部是定量数据的情况。
模糊数学方法	模糊综合评价	以模糊数学为基础，将一些不易定量的因素定量化，从多个指标对被评价事物隶属等级进行综合评价。	适用于评价指标带有主观性，不易量化的情况。
灰色综合评价法	灰色关联分析	利用各方案与最优方案之间关联度大小对评价对象进行比较、排序。	对样本量没有严格要求，不要求服从任何分布，适合只有少量观测数据的问题。

下载: 导出CSV

表 3 EvaModels包内函数及功能

Table 3. List of functions and their descriptions in the "EvaModels" package

R包名称	函数名称	方法名称	功能
EvaModels	cluster_CV()	R型聚类-变异系数法	指标筛选
	norm()	正负向指标归一化	数据标准化
	AHP()	层次分析法	指标赋权
	EM()	熵权法	指标赋权
	CV()	变异系数赋权法	指标赋权
	PCA()	主成分赋权法	指标赋权
	TOPSIS()	优劣解距离法	综合评价
	Fuzzy()	模糊综合评价	综合评价
	GRA()	灰色关联分析	综合评价

下载: 导出CSV

表 4 EvaModels包内函数的参数解释

Table 4. Parameter interpretation of functions in the "EvaModels" package

函数名称	参数名称	参数类型	参数解释
cluster_CV(data, k)	data	数据框	待筛选原始数据：行表示指标，列表示评价对象
	k	整数	聚类数目
norm(data, type)	data	数据框	待标准化原始数据：行表示评价对象，列表示指标
	type	数值型向量	指标方向：1表示正向型指标，2表示负向型指标
AHP(data)	data	数值型矩阵	待赋权判断矩阵：行表示评价对象，列表示指标
EM(data, type)	data	数据框	待赋权原始数据：行表示评价对象，列表示指标
	type	数值型向量	指标方向：1表示正向型指标，2表示负向型指标
CV(data)	data	数据框	待赋权原始数据：行表示评价对象，列表示指标
PCA(data)	data	数据框	待赋权原始数据：行表示评价对象，列表示指标
TOPSIS(data, w, type)	data	数据框	待评价原始数据：行表示评价对象，列表示指标
	w	数值型向量	指标权重：可选择指标赋权方法确定
	type	数值型向量	指标方向：1表示正向型指标，2表示负向型指标
Fuzzy(r, w, v, s)	r	数值型矩阵	待评价隶属度矩阵：行表示评价对象，列表示指标
	w	数值型向量	指标权重：可选择指标赋权方法确定
	v	向量	评语等级
	s	数值型向量	评语分值
GRA(data, r, w)	data	数值型矩阵	待评价原始矩阵：行表示评价对象，列表示指标，首行为参考序列，其余为比较序列
	r	小数	分辨系数
	w	数值型向量	指标权重：可选择指标赋权方法确定

下载: 导出CSV

表 5 指标筛选结果

Table 5. Index screening results

指标名称	指标含义	聚类类别	变异系数	是否保留
CCM_physician	医院重症医师占比	1	0.814 6	是
pathologist	医院病理医师占比	1	0.281 9	否
anesthetist	医院麻醉医师占比	2	0.359 1	否
pediatrist	医院儿科医师占比	2	0.243 2	否
TCM_physicain	医院中医医师占比	2	0.456 5	是

下载: 导出CSV

表 6 部分数据标准化后结果

Table 6. Partial data normalization results

Hospital	CCM_physician	TCM_physicain	doctor_nurse	exam	funds
A	0.021 9	0.487 6	0.630 4	0.734 7	1.000 0
B	0.557 8	0.487 6	0.934 8	0.833 3	0.365 8
C	0.344 6	0.085 4	0.869 6	0.909 1	0.266 5
D	0.360 6	0.211 5	0.804 3	0.904 8	0.408 2
E	0.541 8	0.270 1	0.608 7	0.714 3	0.016 0

下载: 导出CSV

表 7 三种方法计算所得指标权重

Table 7. Indicator weights calculated by three methods

Indicator	w_EM	w_PCA	w_CV
CCM_physician	0.141 2	0.192 5	0.191 9
TCM_physicain	0.114 6	0.195 0	0.107 5
doctor_nurse	0.029 6	0.173 9	0.032 7
exam	0.040 1	0.211 8	0.082 9
funds	0.674 6	0.226 8	0.585 0

下载: 导出CSV

表 8 TOPSIS综合评价

Table 8. TOPSIS comprehensive evaluation

Hospital	EM-TOPSIS		PCA-TOPSIS		CV-TOPSIS
Hospital	Index	Rank	Index	Rank	Index	Rank
A	0.781 6	1	0.591 3	2	0.723 1	1
B	0.442 4	2	0.624 5	1	0.473 1	2
C	0.300 4	5	0.487 2	6	0.335 0	4
D	0.410 6	3	0.534 8	5	0.432 1	3
E	0.164 9	11	0.417 8	12	0.221 5	10

下载: 导出CSV

表 9 灰色关联分析综合评价

Table 9. GRA comprehensive evaluation

Hospital	EM-GRA		PCA-GRA		CV-GRA
Hospital	Index	Rank	Index	Rank	Index	Rank
A	0.909 8	1	0.849 4	1	0.884 7	1
B	0.524 1	2	0.758 8	3	0.564 3	2
C	0.478 0	6	0.717 9	8	0.518 7	6
D	0.515 9	3	0.734 3	5	0.552 1	3
E	0.447 4	11	0.707 9	11	0.494 8	10

下载: 导出CSV

参考文献(16)

[1]	张朝晖, 邱红, 何闽, 等. 规划实施新型定量评价模式探索-"城市发展规划指数"体系构建及实证研究[J]. 城市规划, 2014, 38(7): 17-22, 30. DOI: 10.11819/cpr20140704a. Zhang ZH, Qiu H, He M, et al. An exploration on new model or quantitative evaluation on planning implementation; system construction and empirical study of "city development planning index"[J]. City Plan Rev, 2014, 38(7): 17-22, 30. DOI: 10.11819/cpr20140704a.
[2]	甄峰. 综合评价方法与应用的统计学内涵[J]. 统计与决策, 2016, (19): 81-83. DOI: 10.13546/j.cnki.tjyjc.2016.19.022. Zhen F. Statistical connotation of comprehensive evaluation method and application[J]. Statistics and Decision, 2016, (19): 81-83. DOI: 10.13546/j.cnki.tjyjc.2016.19.022.
[3]	Liang XD, Liu CM, Li Z. Measurement of scenic spots sustainable capacity based on PCA-Entropy TOPSIS: a case study from 30 provinces, China[J]. Int J Environ Res Public Health, 2017, 15(1): 10. DOI10.3390/ijerph15010010. doi: 10.3390/ijerph15010010
[4]	Liu J, Wu J, Liu W. Study on evaluation model of emergency rescue capability of chemical accidents based on PCA-BP[J]. Comput Intell Neurosci, 2021, 2021: 8869608. DOI: 10.1155/2021/8869608.
[5]	Mobinizadeh M, Raeissi P, Nasiripour AA, et al. A model for priority setting of health technology assessment: the experience of AHP-TOPSIS combination approach[J]. Daru, 2016, 24: 10. DOI: 10.1186/s40199-016-0148-7.
[6]	金贞珍. 关于多指标综合评价方法及其权数问题的讨论[D]. 延吉: 延边大学, 2007. Jin ZZ. Discussion on multi index comprehensive evaluation method and its weight[D]. Yanji: Yanbian University, 2007.
[7]	汤智斌. 和谐社会指数的构建及应用研究[D]. 长沙: 湖南大学. 2014. Tang ZB. Research on the Construction and application of harmonious society index[D]. Changsha: Hunan University, 2014.
[8]	刘文, 李强, 刘鹏, 等. 食品安全指数的构建研究与实证分析[J]. 食品科学, 2015, 36(11): 191-196. DOI: 10.7506/spkx1002-6630-201511037. Liu W, Li Q, Liu P, et al. Research and empirical analysis on the construction of food safety index[J]. Food Science, 2015, 36(11): 191-196. DOI: 10.7506/spkx1002-6630-201511037.
[9]	Mohammed TJ, Albahri AS, Zaidan AA, et al. Convalescent-plasma-transfusion intelligent framework for rescuing COVID-19 patients across centralised/decentralised telemedicine hospitals based on AHP-group TOPSIS and matching component[J]. Appl Intell (Dordr), 2021, 51(5): 2956-2987. DOI: 10.1007/s10489-020-02169-2.
[10]	Najafifar A, Mirzaei J, Heydari M. Presentation of landscape-fuzzy approach of forest capability evaluation (LFAFCE) for degraded sites[J]. Environ Monit Assess, 2021, 193(10): 659. DOI: 10.1007/s10661-021-09368-5.
[11]	Teng H. Construction and drug evaluation based on convolutional neural network system optimized by grey correlation analysis[J]. Comput Intell Neurosci, 2021, 2021: 2794588. DOI: 10.1155/2021/2794588.
[12]	Wang ZX, Li DD, Zheng HH. The external performance appraisal of China energy regulation: an empirical study using a TOPSIS method based on entropy weight and mahalanobis distance[J]. Int J Environ Res Public Health, 2018, 15(2): 236. DOI: 10.3390/ijerph15020236.
[13]	Zheng G, Li C, Feng Y. Developing a new index for evaluating physiological safety in high temperature weather based on entropy-TOPSIS model-A case of sanitation worker[J]. Environ Res, 2020, 191: 110091. DOI: 10.1016/j.envres.2020.110091.
[14]	刘芳, 张彦文, 徐亮, 等. 高职中药学"双师型"教师核心能力评价体系-基于R聚类-变异系数的分析[J]. 温州职业技术学院学报, 2020, 20(3): 26-29, 39. DOI: 10.13669/j.cnki.33-1276/z.2020.042. Liu F, Zhang YW, Xu L, et al. Core competence evaluation system for double-capability-teachers of traditional Chinese pharmacology in vocational colleges-based on R cluster-variable coefficient analysis[J]. Journal of Wenzhou Polytechnic, 2020, 20(3): 26-29, 39. DOI: 10.13669/j.cnki.33-1276/z.2020.042.
[15]	王雪铭. 评价方法的演变与分类研究[D]. 上海: 上海交通大学, 2009. Wang XM. Research on the evolution and classification of evaluation methods[D]. Shanghai: Shanghai Jiao Tong University. 2009.
[16]	胡强, 甄峰. 指标体系构建与综合评估的统计检验-以全球创新指数为例[J]. 调研世界, 2021, (9): 65-73. DOI: 10.13778/j.cnki.11-3705/c.2021.09.009. Hu Q, Zhen F. Statistical test of index system construction and comprehensive evaluation-taking global innovation index as an example[J]. The World of Survey and Research, 2021, (9): 65-73. DOI: 10.13778/j.cnki.11-3705/c.2021.09.009.