oaf_x = oaf[['Administrative', 'Administrative_Duration', 'Informational', 'Informational_Duration', 'ProductRelated', 'ProductRelated_Duration', 'BounceRates', 'ExitRates', 'PageValues', 'SpecialDay', 'Month', 'OperatingSystems', 'Browser', 'Region', 'TrafficType', 'VisitorType','Weekend']]
oaf_y = oaf[['Revenue']]
from sklearn.model_selection import train_test_split
X_train,X_test,y_train,y_test = train_test_split(oaf_x,oaf_y,test_size =
0.3,random_state = 52)
# Function for showing a dataset of 2 classes
def plot_dataset(X, y, axes):
plt.plot(X[:, 0][y==0], X[:, 1][y==0], "bs")
plt.plot(X[:, 0][y==1], X[:, 1][y==1], "g^")
plt.axis(axes)
plt.grid(True, which='both')
plt.xlabel(r"$x_1$", fontsize=20)
plt.ylabel(r"$x_2$", fontsize=20, rotation=0)
plot_dataset(X_train.values, y_train.values, [-1.5, 2.5, -1, 1.5])
plt.show()
It results in the error message:
IndexError: too many indices for array
How can I solve this problem? The dataset is really big that i can not show all the values here, some columns type are object, some are float or int, some are bool.
与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…