MC Given the following five transactions:

T1 {K, A, D, B}
T2 {D, A, C, E, B}
T3 {C, A, B, D}
T4 {B, A, E}
T5 {B, E, D}

Consider the association rule R: A -> BD.

Which statement is correct? The support of R is 100% and the confidence is 75%. incorrect The support of R is 75% and the confidence is 60%. incorrect The support of R is 60% and the confidence is 100%. incorrect The support of R is 60% and the confidence is 75%. correct MC Featurization in the context of neural networks refers to... selecting the most predictive features. incorrect making features (=inputs) out of the network characteristics. correct adding more local features to the data set. incorrect adding more nodes to the network. incorrect MC Which of the following strategies can be used to deal with missing values? Keep incorrect Delete incorrect Replace/impute incorrect All of these strategies can be applied correct MC What statement about the adjacency matrix representing a social network is not true? It is a symmetric matrix. incorrect It has the same number of rows and columns. incorrect It can include weights. incorrect It is sparse since it contains a lot of non-zero elements. correct MC Which statement is NOT CORRECT? Although the benefit component is usually not that difficult to approximate, the costs are much harder to precisely quantify. correct ROI analysis offers a common firm-wide language to compare multiple investment opportunities and decide which one(s) to go for. incorrect For companies like Facebook, Amazon, Netflix and Google a positive ROI is obvious since they essentially thrive on data and analytics. incorrect Negative ROI of analytics often boils down to the lack of good quality data, management support and a company-wide data driven decision culture incorrect MC Consider a data set with a multiclass target variable as follows: 25% bad payers, 25% poor payers, 25% medium payers and 25% good payers. In this case, the entropy will be: Minimal incorrect Maximal correct MC Which of the following strategies can be used to deal with missing values? Keep incorrect Delete incorrect Replace/impute incorrect All of these strategies can be applied correct MC Given the following decision tree:



According to the decision tree, an applicant with Income > $50.000 and High Debt=Yes is classified as: Bad Risk correct Good Risk incorrect MC Given the following decision tree:



According to the decision tree, an applicant with Income > $50.000 and High Debt=Yes is classified as: Bad Risk correct Good Risk incorrect MC Which of the following measures cannot be used to make the splitting decision in a regression tree? ANOVA/F-test incorrect Entropy correct Mean Squared Error (MSE) incorrect