这是我的数据框的样子:
Number Age Famous_for
1 35 "businessman chairman of IBM (1973–1981)"
2 42 "musician (House of Freaks Gutterball)"
3 87 "baseball player (Oakland Athletics)"
我想创建一个额外的列,该列将包含一个虚拟变量,无论人是否参与娱乐业务。像这样的东西:
Number..Age..................Famous_for.....................Entertaining
1.......35.......businessman chairman of IBM (1973–1981)........0
2.......42.......musician (House of Freaks Gutterball)..........1
3.......87.......baseball player (Oakland Athletics)............0
如何根据名列中的某些词(例如“音乐家”、“俱乐部”、“演员”等)创建一个列?我尝试了以下方法:
df['entertaining'] = np.where(df['famous_for']>="musician", 1, 0)
但这行不通。我怎样才能在 Python 中做到这一点?