我有一个保险数据集,如下所示。为此,我需要建立一个模型来计算费用。
age sex bmi children smoker region charges
0 19 female 27.900 0 yes southwest 16884.92400
1 18 male 33.770 1 no southeast 1725.55230
2 28 male 33.000 3 no southeast 4449.46200
3 33 male 22.705 0 no northwest 21984.47061
4 32 male 28.880 0 no northwest 3866.85520
但我不确定是否可以删除“区域”列。是否可以进行任何测试以仅考虑重要变量?