Today was the second meeting of our BI class. Students had done a simple exercise in SAS EM with Churn data set which has 3351 data records. It's the same telecom Churn data I had used for my professional Master's class but it's clean.
The topic was data preprocessing and some basic EDA. We talked about correlated variables, Chi-square, Cramer's V, variable worth, missing values, normalization, outliers, mean, mode, median, skewness, kurtosis. The PPT slides are here. Student reminded me that I needed to explain what interval, nominal, ordinal, and binary variable are, it was a timely reminder. Phone numbers, area code, and zipcode are always good examples.
Class activity for today's class is here, so is the concept check which is on the second page on the same link.
No comments:
Post a Comment