这是我为预处理数据集编写的代码。有用
import numpy as np
import pandas as pd
from sklearn import svm
%matplotlib inline
import matplotlib.pyplot as plt
from sklearn.impute import SimpleImputer
import seaborn as sns; sns.set(font_scale=1.2)
stock=pd.read_csv("C:/Users/Dulangi/Downloads/winequality-red.csv")
stock.head()
X= stock.iloc[:,0:5].values
y= stock.iloc[:,5].values
g=sns.lmplot('alcohol','quality',data=stock,height=7, truncate=True, scatter_kws={"s":100})
imputer = SimpleImputer( strategy = "mean")
imputer = imputer.fit(X[:,1:2])
imputer.fit_transform(X[:,1:2])
imputer = imputer.fit(X[:,4:5])
imputer.fit_transform(X[:,4:5])
我想知道如果我在一列中同时包含字符串和数字数据怎么办,如何预处理这些数据以包含所有数字数据?