MC Which of the following is not an advantage of open source software for analytics? A world-wide network of developers can work on it. incorrect It is available for free. incorrect It can be used in combination with commercial software. incorrect It has been thoroughly engineered and extensively tested, validated and completely documented. correct
MC Featurization in the context of neural networks refers to... adding more nodes to the network. incorrect selecting the most predictive features. incorrect adding more local features to the data set. incorrect making features (=inputs) out of the network characteristics. correct
MC Which statement is CORRECT? Although the benefit component is usually not that difficult to approximate, the costs are much harder to precisely quantify. incorrect All given success factors of an analytical model, i.e. relevance, performance, interpretability, efficiency, economical cost and regulatory compliance, are always equally important. incorrect On premise solutions catalyze improved collaboration across business departments and geographical locations. incorrect The aim of clustering is to come up with clusters such that the homogeneity within a cluster is maximized and the heterogeneity between clusters is maximized. correct
MC Given the following decision tree:

According to the decision tree, an applicant with Income > $50.000 and High Debt=Yes is classified as: Bad Risk correct Good Risk incorrect
MC Outlying observations which represent erroneous data are treated using... missing value procedures. correct truncation or capping. incorrect
MC Featurization in the context of neural networks refers to... making features (=inputs) out of the network characteristics. correct adding more nodes to the network. incorrect selecting the most predictive features. incorrect adding more local features to the data set. incorrect
MC Consider a data set with a multiclass target variable as follows: 25% bad payers, 25% poor payers, 25% medium payers and 25% good payers. In this case, the entropy will be: Maximal correct Minimal incorrect
MC Which statement is CORRECT? The graph theoretic center is the node with the highest minimum distance to all other nodes. incorrect The betweenness counts the number of the times that a node or edge occurs in the geodesics of the network. correct The geodesic represents the longest path between two nodes. incorrect The closeness is always higher than the betweenness. incorrect
MC Which of the following activities are part of the post-processing step? Model interpretation and validation incorrect Sensitivity analysis incorrect Model representation incorrect All of these activities correct
MC What statement about the adjacency matrix representing a social network is not true? It is sparse since it contains a lot of non-zero elements. correct It is a symmetric matrix. incorrect It can include weights. incorrect It has the same number of rows and columns. incorrect