将优势比公式应用于多个变量

2024-05-23 17:23:24 发布

您现在位置:Python中文网/ 问答频道 /正文

我有一个拥有1000多条记录的数据集(见下文)。就优势比计算而言,新冠病毒是“因变量”。所有其他变量都是“自变量”,也称为症状变量。除了无症状、发烧和咳嗽变量外,我还有15个额外的症状变量。以下是所有症状变量:

Symptoms =['Cough',"Nasal_Congestion","Wheezing_Asthma","Abdominal_Pain","Diarrhea","Vomiting","Rash","Fever","MED_ALERT_CPR_SHOCK_SEPSIS","Lymph_Node_Neck","Ear","Mouth Sores","Eye","SOB_WOB_Hyp_Desat","PNA","Nausea","Weak_Fatigue","Bodyaches","Dizziness","Fussy","Poor_PO_Dehydration","Tachycardia","COVID Exposure","COVID Test","COVID PUI" ,"COVID MIS","COVID Kawasaki","CP","ST","HA","Loss_Taste_Smell"]

ID   Covid_pos  Asymptomatic  Fever  Cough ...
0          1             0      1      0
1          0             0      0      1
2          1             1      0      1
3          1             0      1      0
4          0             1      1      0
5          1             0      1      0
6          0             1      1      0
7          1             0      0      1
8          0             0      0      0
9          0             0      0      0

我想用COVID_POS变量对每个症状变量进行未调整的优势比。为了进行手术室,我有以下准则:

NCOdds=pd.crosstab(LABS_TAT.Nasal_Congestion,LABS_TAT.Covid_pos)
NCO = sm.stats.Table2x2(NCOdds)
print(NCO .summary())

enter image description here

我想知道是否可以应用for循环,而不是在更改症状变量时复制、粘贴和编辑这段代码15次以上。第二,也许更适合单独的问题,我希望输出如下:

enter image description here


Tags: 数据pos症状记录feverlabs优势tat