我的数据具有在我的工具上执行的操作,我想预测准备从免费/跟踪转换为付费类别的客户。
我的数据如下所示:
dummy<-data.frame(license=sample(c("Free","Trail","Paid"),10000,replace = T,prob = c(0.6,0.35,0.05)),
plan_type=sample(1:5,10000,replace=T),
action1=sample(0:100,10000,replace = T),
action2=sample(0:1000,10000,replace = T),
action3=sample(0:10,10000,replace = T),
num_days_in_product=sample(0:500,10000,replace = T))
head(dummy)
license plan_type action1 action2 action3 num_days_in_product
1 Paid 1 100 71 5 285
2 Free 5 75 438 1 2
3 Free 1 5 555 7 389
4 Free 3 4 105 0 150
5 Free 1 16 348 7 423
6 Free 5 15 866 8 270
> table(dummy$license)
Free Paid Trail
6016 516 3468
> prop.table(table(dummy$license))
Free Paid Trail
0.6016 0.0516 0.3468
让我知道是否需要任何额外信息。